Multi-way, multilingual neural machine translation

被引:40
|
作者
Firat, Orhan [1 ]
Cho, Kyunghyun [2 ]
Sankaran, Baskaran [3 ]
Vural, Fatos T. Yarman [1 ]
Bengio, Yoshua [4 ]
机构
[1] Middle East Tech Univ, Ankara, Turkey
[2] NYU, New York, NY 10003 USA
[3] IBM TJ Watson Res Ctr, Cambridge, MA USA
[4] Univ Montreal, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Neural machine translation; Multi-lingual; Low resource translation;
D O I
10.1016/j.csl.2016.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs. (C) 2016 Elsevier Ltd. All rights reserved
引用
收藏
页码:236 / 252
页数:17
相关论文
共 50 条
  • [11] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Salam Michael Singh
    Thoudam Doren Singh
    Neural Computing and Applications, 2022, 34 : 14823 - 14844
  • [12] Multi-coverage Model for Neural Machine Translation
    Liu J.-P.
    Huang K.-Y.
    Li J.-Y.
    Song D.-X.
    Huang D.-G.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 1141 - 1152
  • [13] Adaptive Adapters: An Efficient Way to Incorporate BERT Into Neural Machine Translation
    Guo, Junliang
    Zhang, Zhirui
    Xu, Linli
    Chen, Boxing
    Chen, Enhong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1740 - 1751
  • [14] Multilingual sequence to sequence convolutional machine translation
    Bansal, Mani
    Lobiyal, D. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (25) : 33701 - 33726
  • [15] Multilingual sequence to sequence convolutional machine translation
    Mani Bansal
    D. K. Lobiyal
    Multimedia Tools and Applications, 2021, 80 : 33701 - 33726
  • [16] Neural Machine Translation as a Novel Approach to Machine Translation
    Benkova, Lucia
    Benko, Lubomir
    DIVAI 2020: 13TH INTERNATIONAL SCIENTIFIC CONFERENCE ON DISTANCE LEARNING IN APPLIED INFORMATICS, 2020, : 499 - 508
  • [17] Neural Name Translation Improves Neural Machine Translation
    Li, Xiaoqing
    Yan, Jinghui
    Zhang, Jiajun
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 93 - 100
  • [18] The Event/Machine of Neural Machine Translation?
    Regnauld, Arnaud
    JOURNAL OF AESTHETICS AND PHENOMENOLOGY, 2022, 9 (02) : 141 - 154
  • [19] Neural machine translation: Challenges, progress and future
    Zhang, JiaJun
    Zong, ChengQing
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (10) : 2028 - 2050
  • [20] Neural machine translation: Challenges, progress and future
    JiaJun Zhang
    ChengQing Zong
    Science China Technological Sciences, 2020, 63 : 2028 - 2050