Step-by-Step Correction of LLM-based Math Word Problems Solutions

Page view(s)

Checked on Sep 10, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/21692

Title:

Step-by-Step Correction of LLM-based Math Word Problems Solutions

Journal Title:

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

DOI:

10.1109/ICASSP49660.2025.10889273

Publication URL:

https://doi.org/10.1109/icassp49660.2025.10889273

Authors:

Yiyao Li, Dhanish Musharraf Ubaidali, Lu Wang, Wenyu Zhang

Keywords:

Math reasoning, LLM

Publication Date:

12 March 2025

Citation:

Li, Y., Ubaidali, D. M., Wang, L., & Zhang, W. (2025). Step-by-Step Correction of LLM-based Math Word Problems Solutions. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5. https://doi.org/10.1109/icassp49660.2025.10889273

Abstract:

Following the success of Large Language Models (LLMs) in language tasks, LLMs have been adapted for reasoning in math word problems (MWPs). MWP is a complex task that requires both semantic understanding of text and mathematical reasoning, such that achieving high accuracy in MWP remains a challenge. We find that MWP performance can be improved by step-by-step reasoning where the LLM is trained to generate smaller and more manageable steps. We further propose a post-processing correction model to edit the initial solutions given by the LLM. Our correction model, designed to detect and rectify mistakes in these steps, is firstly pretrained using heuristically generated model-agnostic error data and further finetuned with model-specific errors generated through self-supervised augmentation. The correction model iteratively refines the solution step-by-step by analyzing the problem statement and steps up until the current one, making corrections as needed, and repeating the process until all steps in the solution are processed. Experimental results demonstrate that the step-by-step reasoning significantly improves MWP performance compared to one-step solutions. The combination of pretraining and finetuning effectively aligns the correction model with the error patterns of the reasoning model, resulting in further accuracy improvements through error correction.

License type:

Publisher Copyright

Funding Info:

This research / project is supported by the Ministry of Education, Singapore - Science of Learning Grant
Grant Reference no. : MOE-MOESOL2021-0006

Description:

© 2025 IEEE.  Personal use of this material is permitted.  Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

URI:

https://oar.a-star.edu.sg/communities-collections/articles/21692

ISSN:

2379-190X

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
llmcorrection-icassp2025-cameraready-v1.pdf	421.58 KB	PDF	Request a copy