Neural Machine Translation with Diversity-Enabled Translation Memory

被引:0
作者
Quang Chieu Nguyen [1 ,2 ]
Xuan Dung Doan [1 ]
Van-Vinh Nguyen [2 ]
Khac-Hoai Nam Bui [1 ]
机构
[1] Viettel Grp, Viettel Cyberspace Ctr, Hanoi, Vietnam
[2] Vietnam Natl Univ Hanoi, Hanoi, Vietnam
来源
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT I | 2023年 / 13995卷
关键词
Neural Machine Translation; Translation Memory; Maximal Marginal Relevance; Low Resource Language;
D O I
10.1007/978-981-99-5834-4_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural machine translation (NMT) using translation memory (TM) has been introduced as an emergent technique for improving machine translation systems (MTS). In this study, we propose an end-to-end NMT model with TM by exploiting the diversity of the retrieval-augmented phase using maximal marginal relevance (MMR). In particular, the proposed model is designed with monolingual TM, which is able to support low-resource scenarios. Furthermore, the memory retriever and translation models are jointly trained to improve translation performance. For the experiment, we use IWSLT15 (En <-> Vi) as a benchmark dataset to evaluate the performance of the proposed method. Accordingly, the experiential results show the effectiveness of the proposed method compared with strong baselines in this research field.
引用
收藏
页码:322 / 333
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 2009, Proceedings of the Second ACM International Conference on Web Search and Data Mining, WSDM '09, DOI 10.1145/1498759.1498766
[2]  
Artetxe Mikel., 2018, Unsupervised neural machine translation, DOI DOI 10.18653/V1/D18-1399
[3]  
Borodin A., 2012, P 31 ACM SIGMOD SIGA, P155
[4]  
Cai D., 2021, P 59 ANN M ASS COMP, V1, P7307
[5]   Learning to Reuse Translations: Guiding Neural Machine Translation with Examples [J].
Cao, Qian ;
Kuang, Shaohui ;
Xiong, Deyi .
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 :1982-1989
[6]  
Cao Q, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P3042
[7]  
Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[8]  
Cettolo M., 2015, INT WORKSH SPOK LANG, P2
[9]  
Cheng X., 2022, P 2022 C EMPIRICAL M, P3591, DOI [10.18653/v1/2022.emnlp, DOI 10.18653/V1/2022.EMNLP]
[10]  
Cheng Y, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3974