Vietnamese Elementary Math Reasoning Using Large Language Model with Refined Translation and Dense-Retrieved Chain-of-Thought

被引：1

作者：

Nguyen-Khang Le ^{[1
]}

Dieu-Hien Nguyen ^{[1
]}

Dinh-Truong Do ^{[1
]}

Chau Nguyen ^{[1
]}

Minh Le Nguyen ^{[1
]}

机构：

[1] Japan Adv Inst Sci & Technol, Nomi, Ishikawa, Japan

来源：

NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2024 | 2024年 / 14741卷

关键词：

Large language model; Low-resource language; Mathematics reasoning;

D O I：

10.1007/978-981-97-3076-6_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

State-of-the-art large language models (LLMs) have succeeded in various tasks but still show limitations in solving math reasoning problems. Although this problem is actively studied in the English language, a scarcity of work has been conducted to explore LLMs in math reasoning in low-resource languages. Recent advances in LLMs show their ability to obtain cross-lingual knowledge. However, a systematical approach to bridge the language gap and employ these LLMs to math reasoning in low-resource language has yet to be studied. This study proposes a pipeline to solve math problems in Vietnamese by integrating the chain-of-thought technique with high-quality in-context learning exemplars obtained by multilingual dense retrieval. The pipeline is modelagnostic and capable of adapting to any language without fine-tuning. Empirical results show that the proposed pipeline obtains remarkable performance gains compared to competitive baseline LLMs, paving the way for future research on employing English-focus LLMs to solve complex reasoning tasks in low-resource languages.

引用

页码：260 / 268

页数：9

共 31 条

[1]

An Shengnan, 2024, Learning from mistakes makes llm better reasoner

[2]

Bai JZ, 2023, Arxiv, DOI [arXiv:2309.16609, DOI 10.48550/ARXIV.2309.16609]

[3]

Chen W., 2023, Trans. Mach. Learn. Res.

[4]

Chiang TR, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2656

[5]

Chiang Wei-Lin, 2023, Vicuna: An Open -Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality

[6]

Cobbe K, 2021, Arxiv, DOI [arXiv:2110.14168, DOI 10.48550/ARXIV.2110.14168]

[7] UNDERSTANDING AND SOLVING ARITHMETIC WORD-PROBLEMS - A COMPUTER-SIMULATION [J].

FLETCHER, CR .

BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1985, 17 (05) :565-571

[8]

Huang D., 2017, P 2017 C EMPIRICAL, P805, DOI [DOI 10.18653/V1/D17-1084, 10.18653/v1/d17-1084]

[9]

Huang D., 2018, P 27 INT C COMPUTATI, P213

[10]

Kaplan J., 2020, Scaling Laws for Neural Language Models

← 1 2 3 4 →