Addressing Troublesome Words in Neural Machine Translation

被引:0
作者
Zhao, Yang [1 ,2 ]
Zhang, Jiajun [1 ,2 ]
He, Zhongjun [4 ]
Zone, Chengqing [1 ,2 ,3 ]
Wu, Hua [4 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
[4] Baidu Inc, Beijing, Peoples R China
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the weaknesses of Neural Machine Translation (NMT) is in handling lowfrequency and ambiguous words, which we refer as troublesome words. To address this problem, we propose a novel memoryenhanced NMT method. First, we investigate different strategies to define and detect the troublesome words. Then, a contextual memory is constructed to memorize which target words should be produced in what situations. Finally, we design a hybrid model to dynamically access the contextual memory so as to correctly translate the troublesome words. The extensive experiments on Chineseto-English and English-to-German translation tasks demonstrate that our method significantly outperforms the strong baseline models in translation quality, especially in handling troublesome words.
引用
收藏
页码:391 / 400
页数:10
相关论文
共 36 条
[1]  
[Anonymous], 2015, P 3 INT C LEARN REPR
[2]  
[Anonymous], 2017, P EMNLP 2017
[3]  
[Anonymous], 2017, P WMT 2017
[4]  
[Anonymous], 2013, P ACL 2013
[5]  
[Anonymous], 2016, EMNLP 2016
[6]  
[Anonymous], 2017, P EMNLP 2017
[7]  
Arthur P., 2016, P 2016 C EMP METH NA, P1557, DOI [DOI 10.18653/V1/D16-1162, 10.18653/V1/D16-1162]
[8]  
Carpuat Marine, 2007, P 2007 JOINT C EMPIR, P61
[9]  
Cho K., 2014, P 2014 C EMP METH NA, P1724
[10]  
Dyer Chris, 2013, SIMPLE FAST EFFECTIV