Multi-way, multilingual neural machine translation

被引:40
|
作者
Firat, Orhan [1 ]
Cho, Kyunghyun [2 ]
Sankaran, Baskaran [3 ]
Vural, Fatos T. Yarman [1 ]
Bengio, Yoshua [4 ]
机构
[1] Middle East Tech Univ, Ankara, Turkey
[2] NYU, New York, NY 10003 USA
[3] IBM TJ Watson Res Ctr, Cambridge, MA USA
[4] Univ Montreal, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Neural machine translation; Multi-lingual; Low resource translation;
D O I
10.1016/j.csl.2016.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs. (C) 2016 Elsevier Ltd. All rights reserved
引用
收藏
页码:236 / 252
页数:17
相关论文
共 50 条
  • [41] Unsupervised dialectal neural machine translation
    Farhan, Wael
    Talafha, Bashar
    Abuammar, Analle
    Jaikat, Ruba
    Al-Ayyoub, Mahmoud
    Tarakji, Ahmad Bisher
    Toma, Anas
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [42] Neural Machine Translation for English to Hindi
    Saini, Sandeep
    Sahula, Vineet
    2018 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2018, : 25 - 30
  • [43] Neural Machine Translation for Indian Languages
    Pathak, Amarnath
    Pakray, Partha
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 465 - 477
  • [44] NEURAL MACHINE TRANSLATION WITH ACOUSTIC EMBEDDING
    Kano, Takatomo
    Sakti, Sakriani
    Nakamura, Satoshi
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 578 - 584
  • [45] Improving Neural Machine Translation with Neural Sentence Rewriting
    Wu, Tian
    He, Zhongjun
    Chen, Enhong
    Wang, Haifeng
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 147 - 152
  • [46] Synchronous Bidirectional Neural Machine Translation
    Zhou, Long
    Zhang, Jiajun
    Zong, Chengqing
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 91 - 105
  • [47] Ancient Korean Neural Machine Translation
    Park, Chanjun
    Lee, Chanhee
    Yang, Yeongwook
    Lim, Heuiseok
    IEEE ACCESS, 2020, 8 : 116617 - 116625
  • [48] Multi-granularity Knowledge Sharing in Low-resource Neural Machine Translation
    Mi, Chenggang
    Xie, Shaoliang
    Fan, Yi
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
  • [49] An error analysis for image-based multi-modal neural machine translation
    Calixto, Iacer
    Liu, Qun
    MACHINE TRANSLATION, 2019, 33 (1-2) : 155 - 177
  • [50] Improving Neural Machine Translation by Retrieving Target Translation Template
    Li, Fuxue
    Chi, Chuncheng
    Yan, Hong
    Zhang, Zhen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 658 - 669