Multi-way, multilingual neural machine translation

被引:40
|
作者
Firat, Orhan [1 ]
Cho, Kyunghyun [2 ]
Sankaran, Baskaran [3 ]
Vural, Fatos T. Yarman [1 ]
Bengio, Yoshua [4 ]
机构
[1] Middle East Tech Univ, Ankara, Turkey
[2] NYU, New York, NY 10003 USA
[3] IBM TJ Watson Res Ctr, Cambridge, MA USA
[4] Univ Montreal, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Neural machine translation; Multi-lingual; Low resource translation;
D O I
10.1016/j.csl.2016.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs. (C) 2016 Elsevier Ltd. All rights reserved
引用
收藏
页码:236 / 252
页数:17
相关论文
共 50 条
  • [1] A Survey of Multilingual Neural Machine Translation
    Dabre, Raj
    Chu, Chenhui
    Kunchukuttan, Anoop
    ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [2] Survey on Neural Machine Translation for multilingual translation system
    Basmatkar, Pranjali
    Holani, Hemant
    Kaushal, Shivani
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 443 - 448
  • [3] Synchronous Inference for Multilingual Neural Machine Translation
    Wang, Qian
    Zhang, Jiajun
    Zong, Chengqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1827 - 1839
  • [4] Incorporating Typological Features into Language Selection for Multilingual Neural Machine Translation
    Mi, Chenggang
    Zhu, Shaolin
    Fan, Yi
    Xie, Lei
    WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 348 - 357
  • [5] Pruning Residual Networks in Multilingual Neural Machine Translation to Improve Zero-Shot Translation
    Lu, Kaiwen
    Yang, Yating
    Dong, Rui
    Ma, Bo
    Wang, Lei
    Zhou, Xi
    Ahmat, Ahtamjan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 280 - 292
  • [6] An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource Languages
    Mueller, Aaron
    Nicolai, Garrett
    McCarthy, Arya D.
    Lewis, Dylan
    Wu, Winston
    Yarowsky, David
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3710 - 3718
  • [7] Multilingual Neural Machine Translation for Low Resourced Languages: Ometo-English
    Yigezu, Mesay Gemeda
    Woldeyohannis, Michael Melese
    Tonja, Atnafu Lambebo
    2021 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR DEVELOPMENT FOR AFRICA (ICT4DA), 2021, : 89 - 94
  • [8] Multilingual Machine Translation : An Analytical Study
    Phadke, Madhura Mandar
    Devane, Satish R.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 881 - 884
  • [9] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Singh, Salam Michael
    Singh, Thoudam Doren
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) : 14823 - 14844
  • [10] Extremely Low-resource Multilingual Neural Machine Translation for Indic Mizo Language
    Lalrempuii C.
    Soni B.
    International Journal of Information Technology, 2023, 15 (8) : 4275 - 4282