Vietnamese Elementary Math Reasoning Using Large Language Model with Refined Translation and Dense-Retrieved Chain-of-Thought

被引:1
作者
Nguyen-Khang Le [1 ]
Dieu-Hien Nguyen [1 ]
Dinh-Truong Do [1 ]
Chau Nguyen [1 ]
Minh Le Nguyen [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Nomi, Ishikawa, Japan
来源
NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2024 | 2024年 / 14741卷
关键词
Large language model; Low-resource language; Mathematics reasoning;
D O I
10.1007/978-981-97-3076-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art large language models (LLMs) have succeeded in various tasks but still show limitations in solving math reasoning problems. Although this problem is actively studied in the English language, a scarcity of work has been conducted to explore LLMs in math reasoning in low-resource languages. Recent advances in LLMs show their ability to obtain cross-lingual knowledge. However, a systematical approach to bridge the language gap and employ these LLMs to math reasoning in low-resource language has yet to be studied. This study proposes a pipeline to solve math problems in Vietnamese by integrating the chain-of-thought technique with high-quality in-context learning exemplars obtained by multilingual dense retrieval. The pipeline is modelagnostic and capable of adapting to any language without fine-tuning. Empirical results show that the proposed pipeline obtains remarkable performance gains compared to competitive baseline LLMs, paving the way for future research on employing English-focus LLMs to solve complex reasoning tasks in low-resource languages.
引用
收藏
页码:260 / 268
页数:9
相关论文
共 31 条
[1]  
An Shengnan, 2024, Learning from mistakes makes llm better reasoner
[2]  
Bai JZ, 2023, Arxiv, DOI [arXiv:2309.16609, DOI 10.48550/ARXIV.2309.16609]
[3]  
Chen W., 2023, Trans. Mach. Learn. Res.
[4]  
Chiang TR, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2656
[5]  
Chiang Wei-Lin, 2023, Vicuna: An Open -Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality
[6]  
Cobbe K, 2021, Arxiv, DOI [arXiv:2110.14168, DOI 10.48550/ARXIV.2110.14168]
[7]   UNDERSTANDING AND SOLVING ARITHMETIC WORD-PROBLEMS - A COMPUTER-SIMULATION [J].
FLETCHER, CR .
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1985, 17 (05) :565-571
[8]  
Huang D., 2017, P 2017 C EMPIRICAL, P805, DOI [DOI 10.18653/V1/D17-1084, 10.18653/v1/d17-1084]
[9]  
Huang D., 2018, P 27 INT C COMPUTATI, P213
[10]  
Kaplan J., 2020, Scaling Laws for Neural Language Models