Multi-way, multilingual neural machine translation

被引:41
作者
Firat, Orhan [1 ]
Cho, Kyunghyun [2 ]
Sankaran, Baskaran [3 ]
Vural, Fatos T. Yarman [1 ]
Bengio, Yoshua [4 ]
机构
[1] Middle East Tech Univ, Ankara, Turkey
[2] NYU, New York, NY 10003 USA
[3] IBM TJ Watson Res Ctr, Cambridge, MA USA
[4] Univ Montreal, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Neural machine translation; Multi-lingual; Low resource translation;
D O I
10.1016/j.csl.2016.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs. (C) 2016 Elsevier Ltd. All rights reserved
引用
收藏
页码:236 / 252
页数:17
相关论文
共 33 条
[1]  
[Anonymous], HLT NAACL IN PRESS
[2]  
[Anonymous], P 2008 C EMP METH NA
[3]  
[Anonymous], 2013, P 2013 C EMPIRICAL M
[4]  
[Anonymous], 2012, P 16 EAMT C TRENT IT
[5]  
[Anonymous], ICLR 2015
[6]  
[Anonymous], 2015, P INT C NEUR INF PRO
[7]  
[Anonymous], MULTITASK LEARNING M
[8]  
[Anonymous], 2012, ARXIV E PRINTS
[9]  
[Anonymous], 2015, P ACL
[10]  
[Anonymous], ABS160100710 CORR