Improving Machine Translation Capabilities by Fine-Tuning Large Language Models and Prompt Engineering with Domain-Specific Data

被引:0
作者
Laki, Laszlo Janos [1 ]
Yang, Zijian Gyozo [2 ]
机构
[1] Glohalese GmbH, Trier, Germany
[2] HUN REN Hungarian Res Ctr Linguist, Budapest, Hungary
来源
2024 IEEE 3RD CONFERENCE ON INFORMATION TECHNOLOGY AND DATA SCIENCE, CITDS 2024 | 2024年
关键词
machine translation; large language models; prompt; fine-tuning; retrieval augmented generation; Ilamaindex; domain-specific data;
D O I
10.1109/CITDS62610.2024.10791375
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study examines the applicability and performance of large language models (LLMs) in the field of machine translation for in-domain texts, with a particular focus on tine-tuning LLMs, few-shot prompting, and word vector based example sentence search methods. The aim of the study is to determine the extent to which the few-shot technique can improve translation quality for domain-specific texts. Our results indicate that the few-shot learning approach consistently improved translation quality across all examined LLM systems, with performance enhancements ranging from 10% to 25% in BLEU scores. Surprisingly, the word vector-based method, which uses the vectorial representation of words to select translation examples, did not perform as well as the character similarity based fuzzy matching technique. The study discusses the performance of various systems, highlighting significant advancements achieved through tine-tuning and few-shot prompting.
引用
收藏
页码:129 / 133
页数:5
相关论文
共 20 条
[1]  
Amin Farajian M., 2017, C MACH TRANSL
[2]  
Aranberri N., P 24 ANN C EUR ASS M
[3]  
Brown TB, 2020, ADV NEUR IN, V33
[4]  
Chung Hyung Won, 2022, Scaling Instruction-Finetuned Language Models
[5]  
Csaki Z., 2024, Sambalingo: Teaching large language models new languages
[6]   The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation [J].
Goyal, Naman ;
Gao, Cynthia ;
Chaudhary, Vishrav ;
Chen, Peng-Jen ;
Wenzek, Guillaume ;
Ju, Da ;
Krishnan, Sanjana ;
Ranzato, Marc'Aurelio ;
Guzman, Francisco ;
Fan, Angela .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 :522-538
[7]  
Goyal T., 2023, News summarization and evaluation in the era of gpt-3
[8]  
Kaplan J, 2020, Arxiv, DOI [arXiv:2001.08361, 10.48550/arXiv.2001.08361]
[9]  
Liu J., 2022, GitHub repository
[10]  
Oravecz Csaba, 2022, P 7 C MACH TRANSL WM, P346