Low resource neural machine translation model optimization based on semantic confidence weighted alignment

被引:2
作者
Zhuang, Xuhui [1 ,2 ]
Gao, Shengxiang [1 ,2 ]
Yu, Zhengtao [1 ,2 ]
Guo, Junjun [1 ,2 ]
Wang, Xiaocong [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural machine translation; Semantic confidence-weighted alignment; Low-resource languages; Data noise mitigation; CORPUS;
D O I
10.1007/s13042-024-02148-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of neural machine translation models based on the Transformer architecture is contingent upon the quality of the data. When the training data contains a high proportion of noise, the performance of the model deteriorates. This paper addresses the issue of diminished model capability in the presence of noisy datasets by proposing an optimization method based on semantic confidence-weighted alignment. This method integrates alignment metrics and model parameter confidence adjustments to recalibrate loss weights, thereby enhancing the model's ability to identify and process noisy data. Experimental results demonstrate that this approach significantly improves the performance of translation models, particularly in low-resource language pairs such as Malay-Chinese, especially when dealing with noisy datasets. Compared to traditional methods, there is a notable increase in BLEU scores.
引用
收藏
页码:4325 / 4340
页数:16
相关论文
共 28 条
[1]  
Abdelali A, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1856
[2]  
Abdul-Rauf S., 2009, Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics, P16
[3]  
Acarcicek H, 2020, P 5 C MACHINE TRANSL, P940
[4]   Pseudotext Injection and Advance Filtering of Low-Resource Corpus for Neural Machine Translation [J].
Adjeisah, Michael ;
Liu, Guohua ;
Nyabuga, Douglas Omwenga ;
Nortey, Richard Nuetey ;
Song, Jinling .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6]  
Batheja A, 2022, P 2022 C EMPIRICAL M, P5395
[7]  
Batheja A, 2023, ARXIV
[8]  
Dou Q, 2022, ARXIV
[9]  
Gandrabur Simona., 2003, P 7 C NATURAL LANGUA, P95
[10]   Learning to refine source representations for neural machine translation [J].
Geng, Xinwei ;
Wang, Longyue ;
Wang, Xing ;
Yang, Mingtao ;
Feng, Xiaocheng ;
Qin, Bing ;
Tu, Zhaopeng .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) :2199-2212