From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting

Nuo Chen, Hongguang Li, Baoyuan Wang, Jia Li

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

2 Citations (Scopus)

Abstract

This paper investigates the performance of Large Language Models (LLMs) and Tool-augmented LLMs in tackling complex mathematical reasoning tasks. We introduce IMR-TIP: Improving Math Reasoning with Tool-augmented Interleaf Prompting, a framework that combines the strengths of both LLMs and Tool-augmented LLMs. IMR-TIP follows the “From Good to Great” concept, collecting multiple potential solutions from both LLMs and their Tool-Augmented counterparts for the same math problem, and then selecting or regenerating the most accurate answer after crosschecking these solutions via tool-augmented interleaf prompting. The framework incorporates two key aspects: self-prompt and tool-augmented interleaf prompting (TIP). The former allows LLMs to autonomously refine and improve an initial prompt related to tool usage, while the latter enables LLMs to derive the final answer by dynamically analyzing the problem, cross-checking potential solutions, and revising previous reasoning hints in an interleaved manner. Experimental analysis shows that IMR-TIP achieves enhanced mathematical capabilities and outperforms traditional LLMs and tool-augmented LLMs in accuracy and reasoning diversity on math reasoning tasks. For instance, IMR-TIP can improve Tool-augmented ChatGPT on GSM8K-Hard from 56.0% to 65.2 %.

Original languageEnglish
Title of host publication2nd Workshop on Natural Language Reasoning and Structured Explanations, NLRSE 2024 at ACL 2024 - Proceedings of the Workshop
EditorsBhavana Dalvi Mishra, Greg Durrett, Peter Jansen, Ben Lipkin, Danilo Neves Ribeiro, Lionel Wong, Xi Ye, Wenting Zhao
PublisherAssociation for Computational Linguistics (ACL)
Pages64-79
Number of pages16
ISBN (Electronic)9798891761421
Publication statusPublished - 2024
Externally publishedYes
Event2nd Workshop on Natural Language Reasoning and Structured Explanations, NLRSE 2024, co-located with ACL 2024 - Bangkok, Thailand
Duration: 15 Aug 2024 → …

Publication series

Name2nd Workshop on Natural Language Reasoning and Structured Explanations, NLRSE 2024 at ACL 2024 - Proceedings of the Workshop

Conference

Conference2nd Workshop on Natural Language Reasoning and Structured Explanations, NLRSE 2024, co-located with ACL 2024
Country/TerritoryThailand
CityBangkok
Period15/08/24 → …

Bibliographical note

Publisher Copyright:
© 2024 Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting'. Together they form a unique fingerprint.

Cite this