Pre-Training on Mixed Data for Low-Resource Neural Machine Translation

被引:7
作者
Zhang, Wenbo [1 ,2 ,3 ]
Li, Xiao [1 ,2 ,3 ]
Yang, Yating [1 ,2 ,3 ]
Dong, Rui [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830011, Peoples R China
关键词
neural machine translation; pre-training; low resource; word translation;
D O I
10.3390/info12030133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The pre-training fine-tuning mode has been shown to be effective for low resource neural machine translation. In this mode, pre-training models trained on monolingual data are used to initiate translation models to transfer knowledge from monolingual data into translation models. In recent years, pre-training models usually take sentences with randomly masked words as input, and are trained by predicting these masked words based on unmasked words. In this paper, we propose a new pre-training method that still predicts masked words, but randomly replaces some of the unmasked words in the input with their translation words in another language. The translation words are from bilingual data, so that the data for pre-training contains both monolingual data and bilingual data. We conduct experiments on Uyghur-Chinese corpus to evaluate our method. The experimental results show that our method can make the pre-training model have a better generalization ability and help the translation model to achieve better performance. Through a word translation task, we also demonstrate that our method enables the embedding of the translation model to acquire more alignment knowledge.
引用
收藏
页数:10
相关论文
共 50 条
[41]   An empirical study of low-resource neural machine translation of manipuri in multilingual settings [J].
Salam Michael Singh ;
Thoudam Doren Singh .
Neural Computing and Applications, 2022, 34 :14823-14844
[42]   A pseudo-dynamic smoothing approach for low-resource neural machine translation using prompts [J].
Dai, Shangjing ;
Yang, Lina ;
Wang, Bingzhen ;
Wu, Thomas ;
Wang, Jing ;
Tang, Yuan Yan .
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2025, 23 (04)
[43]   Improved neural machine translation for low-resource English-Assamese pair [J].
Laskar, Sahinur Rahman ;
Khilji, Abdullah Faiz Ur Rahman ;
Pakray, Partha ;
Bandyopadhyay, Sivaji .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) :4727-4738
[44]   Improving neural machine translation by integrating transliteration for low-resource English-Assamese language [J].
Nath, Basab ;
Sarkar, Sunita ;
Mukhopadhyay, Somnath ;
Roy, Arindam .
NATURAL LANGUAGE PROCESSING, 2025, 31 (02) :306-327
[45]   Multi-granularity Knowledge Sharing in Low-resource Neural Machine Translation [J].
Mi, Chenggang ;
Xie, Shaoliang ;
Fan, Yi .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
[46]   An empirical study of low-resource neural machine translation of manipuri in multilingual settings [J].
Singh, Salam Michael ;
Singh, Thoudam Doren .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) :14823-14844
[47]   low-resource neural Machine translation with Multi-strategy prototype generation [J].
Yu Z.-Q. ;
Yu Z.-T. ;
Huang Y.-X. ;
Guo J.-J. ;
Xian Y.-T. .
Ruan Jian Xue Bao/Journal of Software, 2023, 34 (11) :5113-5125
[48]   A Survey Of Low Resource Neural Machine Translation [J].
Liu, Ding ;
Ma, Ning ;
Yang, Fangtao ;
Yang, Xuebin .
2019 4TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2019), 2019, :39-42
[49]   Improving Neural Machine Translation for Low-resource English-Myanmar-Thai Language Pairs with SwitchOut Data Augmentation Algorithm [J].
San, Mya Ei ;
Thu, Ye Kyaw ;
Supnithi, Thepchai ;
Usanavasin, Sasiporn .
2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,
[50]   Improving Neural Machine Translation for Low Resource Languages Using Mixed Training: The Case of Ethiopian Languages [J].
Tonja, Atnafu Lambebo ;
Kolesnikova, Olga ;
Arif, Muhammad ;
Gelbukh, Alexander ;
Sidorov, Grigori .
ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2022, PT II, 2022, 13613 :30-40