JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3059656074

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

About this item

Full title

Author / Creator

Zhou, Kun , Zhang, Beichen , Wang, Jiapeng , Chen, Zhipeng , Wayne Xin Zhao , Sha, Jing , Sheng, Zhichao , Wang, Shijin and Ji-Rong, Wen

Publisher

Ithaca: Cornell University Library, arXiv.org

Journal title

arXiv.org, 2024-05

Language

English

Formats

Articles

Publication information

Publisher

Ithaca: Cornell University Library, arXiv.org

Subjects

Subjects and topics

More information

Scope and Contents

Contents

Mathematical reasoning is an important capability of large language models~(LLMs) for real-world applications. To enhance this capability, existing work either collects large-scale math-related texts for pre-training, or relies on stronger LLMs (\eg GPT-4) to synthesize massive math problems. Both types of work generally lead to large costs in training or synthesis. To reduce the cost, based on open-source available texts, we propose an efficient way that trains a small LLM for math problem synthesis, to efficiently generate sufficient high-quality pre-training data. To achieve it, we create a dataset using GPT-4 to distill its data synthesis capability into the small LLM. Concretely, we craft a set of prompts based on human education stages to guide GPT-4, to synthesize problems covering diverse math knowledge and difficulty levels. Besides, we adopt the gradient-based influence estimation method to select the most valuable math-related texts. The both are fed into GPT-4 for creating the knowledge distillation dataset to train the small LLM. We leverage it to synthesize 6 million math problems for pre-training our JiuZhang3.0 model, which only needs to invoke GPT-4 API 9.3k times and pre-train on 4.6B data. Experimental results have shown that JiuZhang3.0 achieves state-of-the-art performance on several mathematical reasoning datasets, under both natural language reasoning and tool manipulation settings. Our code and data will be publicly released in \url{https://github.com/RUCAIBox/JiuZhang3.0}....

Alternative Titles

Full title

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

Authors, Artists and Contributors

Author / Creator

Zhou, Kun
Zhang, Beichen
Wang, Jiapeng
Chen, Zhipeng
Wayne Xin Zhao
Sha, Jing
Sheng, Zhichao
Wang, Shijin
Ji-Rong, Wen

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_3059656074

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3059656074

Other Identifiers

E-ISSN

2331-8422

How to access this item

Full text available

View in old catalogue

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models

About this item

Publication information

Subjects

More information

Scope and Contents

Alternative Titles

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Other Identifiers

How to access this item

Connecting people and collections

Indigenous engagement

Learning

Stories