Joint pairwise learning and masked language models for neural machine translation of English

被引:0
作者
Yang, Shuhan [1 ]
Yang, Qun [1 ]
机构
[1] Guangzhou Huashang Coll, Guangzhou 511300, Peoples R China
关键词
Adversarial training; Corpus; Machine translation; Pairwise learning; Masked languages; Decoding;
D O I
10.1007/s10015-025-01008-2
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The translation activity of language is a link and bridge for the integration of politics, economy, and culture in various countries. However, manual translation requires high quality of professional translators and takes a long time. The study attempts to introduce dual learning on the basis of traditional neural machine translation models. The improved neural machine translation model includes decoding of the source language and target language. With the help of the source language encoder, forward translation, backward backtranslation, and parallel decoding can be achieved; At the same time, adversarial training is carried out using a corpus containing noise to enhance the robustness of the model, enriching the technical and theoretical knowledge of existing neural machine translation models. The test results show that compared with the training speed of the baseline model, the training speed of the constructed model is 115 K words/s and the decoding speed is 2647 K words/s, which is 7.65 times faster than the decoding speed, and the translation quality loss is within the acceptable range. The mean bilingual evaluation score for the "two-step" training method was 16.51, an increase of 3.64 points from the lowest score, and the K-nearest-neighbor algorithm and the changing-character attack ensured the semantic integrity of noisy source language utterances to a greater extent. The translation quality of the changing character method outperformed that of the unrestricted noise attack method, with the highest bilingual evaluation study score value improving by 3.34 points and improving the robustness of the model. The translation model constructed by the study has been improved in terms of training speed and robustness performance, and is of practical use in many translation domains.
引用
收藏
页码:342 / 353
页数:12
相关论文
共 28 条
[1]   Improved neural machine translation using Natural Language Processing (NLP) [J].
Ahammad, Sk Hasane ;
Kalangi, Ruth Ramya ;
Nagendram, S. ;
Inthiyaz, Syed ;
Priya, P. Poorna ;
Faragallah, Osama S. ;
Mohammad, Alsharef ;
Eid, Mahmoud M. A. ;
Rashed, Ahmed Nabih Zaki .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) :39335-39348
[2]   The explanatory power of descriptive translation studies in the machine translation era [J].
Asscher, Omri .
PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2024, 32 (02) :261-277
[3]  
Blanka K., 2022, ED INFORM TECHNOL, V28, P663
[4]   SemSyn: Semantic-Syntactic Similarity Based Automatic Machine Translation Evaluation Metric [J].
Chauhan, Shweta ;
Kumar, Rahul ;
Saxena, Shefali ;
Kaur, Amandeep ;
Daniel, Philemon .
IETE JOURNAL OF RESEARCH, 2024, 70 (04) :3823-3834
[5]   Multilingual Neural Machine Translation for Indic to Indic Languages [J].
Das, Sudhansu Bala ;
Panda, Divyajyoti ;
Mishra, Tapas Kumar ;
Patra, Bidyut Kr. ;
Ekbal, Asif .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
[6]   Improving Multilingual Neural Machine Translation System for Indic Languages [J].
Das, Sudhansu Bala ;
Biradar, Atharv ;
Mishra, Tapas Kumar ;
Patra, Bidyut Kr. .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)
[7]   Machine translation from signed to spoken languages: state of the art and challenges [J].
De Coster, Mathieu ;
Shterionov, Dimitar ;
Van Herreweghe, Mieke ;
Dambre, Joni .
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2024, 23 (03) :1305-1331
[8]   Improving neural machine translation for low resource languages through non-parallel corpora: a case study of Egyptian dialect to modern standard Arabic translation [J].
Faheem, Mohamed Atta ;
Wassif, Khaled Tawfik ;
Bayomi, Hanaa ;
Abdou, Sherif Mahdy .
SCIENTIFIC REPORTS, 2024, 14 (01)
[9]  
Guo Y., 2022, J Comput Cogn Eng, V2, P5, DOI DOI 10.47852/BONVIEWJCCE2202192
[10]   Neural Machine Translation by Fusing Key Information of Text [J].
Hu, Shijie ;
Li, Xiaoyu ;
Bai, Jiayu ;
Lei, Hang ;
Qian, Weizhong ;
Hu, Sunqiang ;
Zhang, Cong ;
Kofi, Akpatsa Samuel ;
Qiu, Qian ;
Zhou, Yong ;
Yang, Shan .
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02) :2803-2815