Low resource neural machine translation model optimization based on semantic confidence weighted alignment

被引:2
作者
Zhuang, Xuhui [1 ,2 ]
Gao, Shengxiang [1 ,2 ]
Yu, Zhengtao [1 ,2 ]
Guo, Junjun [1 ,2 ]
Wang, Xiaocong [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural machine translation; Semantic confidence-weighted alignment; Low-resource languages; Data noise mitigation; CORPUS;
D O I
10.1007/s13042-024-02148-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of neural machine translation models based on the Transformer architecture is contingent upon the quality of the data. When the training data contains a high proportion of noise, the performance of the model deteriorates. This paper addresses the issue of diminished model capability in the presence of noisy datasets by proposing an optimization method based on semantic confidence-weighted alignment. This method integrates alignment metrics and model parameter confidence adjustments to recalibrate loss weights, thereby enhancing the model's ability to identify and process noisy data. Experimental results demonstrate that this approach significantly improves the performance of translation models, particularly in low-resource language pairs such as Malay-Chinese, especially when dealing with noisy datasets. Compared to traditional methods, there is a notable increase in BLEU scores.
引用
收藏
页码:4325 / 4340
页数:16
相关论文
共 28 条
[21]  
Utiyama M, 2003, 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P72
[22]  
Vaswani A, 2017, ADV NEUR IN, V30
[23]  
Wang S, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P791
[24]  
Wang Weilun, 2022, arXiv
[25]  
Wang Weiyue, 2018, arXiv
[26]   Chinese-Vietnamese Pseudo-Parallel Sentences Extraction Based on Image Information Fusion [J].
Wen, Yonghua ;
Guo, Junjun ;
Yu, Zhiqiang ;
Yu, Zhengtao .
INFORMATION, 2023, 14 (05)
[27]  
Xu JH, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P419
[28]   Domain Adaptation for SMT Using Sentence Weight [J].
Zhou, Xinpeng ;
Cao, Hailong ;
Zhao, Tiejun .
CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 :153-163