Improving Neural Machine Translation by Retrieving Target Translation Template

被引:2
作者
Li, Fuxue [1 ,2 ]
Chi, Chuncheng [3 ]
Yan, Hong [2 ]
Zhang, Zhen [2 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang, Peoples R China
[2] Yingkou Inst Technol, Coll Elect Engn, Yingkou, Peoples R China
[3] Shenyang Univ Chem Technol, Coll Comp Sci & Technol, Shenyang, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV | 2023年 / 14089卷
关键词
Translation template; Neural machine translation; Fuzzy matching strategy;
D O I
10.1007/978-981-99-4752-2_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the neural machine translation (NMT) paradigm, transformer-based NMT has achieved great progress in recent years. It is based on the standard end-to-end structure, and acquires translation knowledge through the attention mechanism from the parallel corpus automatically without human intervention. Inspired by the process of translating sentences by human translators and the successful application of translation template in statistical machine translation, this paper proposes a novel approach to incorporate the target translation template into the Transformer-based NMT model. Firstly, the template extraction method derives the parallel templates corpus from the constituency parse tree. Secondly, given a sentence to be translated, a fuzzy matching strategy is proposed to calculate the most possible target translation template from the parallel template corpus. Finally, an effective method is proposed to incorporate the target translate template into the Transformer-based NMT model. Experimental results on three translation tasks demonstrate the effectiveness of the proposed approach and it improves the translation quality significantly.
引用
收藏
页码:658 / 669
页数:12
相关论文
共 29 条
[1]  
Anastasopoulos A, 2018, Arxiv, DOI arXiv:1802.06655
[2]  
[Anonymous], 2005, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL '05, DOI DOI 10.3115/1219840.1219874
[3]  
[Anonymous], 2007, P 45 ANN M ASS COMPU
[4]  
Ba J. L., 2016, Layer normalization
[5]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6]  
Carl M., 1999, P MACH TRANSL SUMM 7, P250
[7]  
Dinu G, 2019, Arxiv, DOI arXiv:1906.01105
[8]  
Duan N., 2017, EMNLP, DOI DOI 10.18653/V1/D17-1090
[9]  
Duan SF, 2020, Arxiv, DOI arXiv:2004.14200
[10]  
Gehring J, 2017, PR MACH LEARN RES, V70