Multi-way, multilingual neural machine translation

被引:40
|
作者
Firat, Orhan [1 ]
Cho, Kyunghyun [2 ]
Sankaran, Baskaran [3 ]
Vural, Fatos T. Yarman [1 ]
Bengio, Yoshua [4 ]
机构
[1] Middle East Tech Univ, Ankara, Turkey
[2] NYU, New York, NY 10003 USA
[3] IBM TJ Watson Res Ctr, Cambridge, MA USA
[4] Univ Montreal, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Neural machine translation; Multi-lingual; Low resource translation;
D O I
10.1016/j.csl.2016.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs. (C) 2016 Elsevier Ltd. All rights reserved
引用
收藏
页码:236 / 252
页数:17
相关论文
共 50 条
  • [31] Exploring Multi-Stage Information Interactions for Multi-Source Neural Machine Translation
    Lu, Ziyao
    Li, Xiang
    Liu, Yang
    Zhou, Chulun
    Cui, Jianwei
    Wang, Bin
    Zhang, Min
    Su, Jinsong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 562 - 570
  • [32] Neural Machine Translation Based on Multi-task Learning of Discourse Structure
    Kang X.-M.
    Zong C.-Q.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3806 - 3818
  • [33] Multi-mechanism neural machine translation framework for automatic program repair
    Cao H.
    Han D.
    Chu Y.
    Tian F.
    Wang Y.
    Liu Y.
    Jia J.
    Ge H.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04) : 7859 - 7873
  • [34] Learning to decode to future success for multi-modal neural machine translation
    Huang, Yan
    Zhang, TianYuan
    Xu, Chun
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (02):
  • [35] Incorporating bilingual translation templates into neural machine translation
    Li, Fuxue
    Liu, Beibei
    Yan, Hong
    Xie, Peijun
    Li, Jiarui
    Zhang, Zhen
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [36] Neural Machine Translation for Amharic-English Translation
    Gezmu, Andargachew Mekonne
    Nuernberger, Andreas
    Bati, Tesfaye Bayu
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2021, : 526 - 532
  • [37] An Enhanced Method for Mongolian-Chinese Neural Machine Translation Using Multilingual Datastores and Chinese-Centric Methods
    Wang, Bailun
    Ji, Yatu
    Wu, Nier
    Liu, Xu
    Wang, Yanli
    Mao, Rui
    Zhou, Chao
    Jia, Yepai
    Zhao, Chen
    Ren, Qing-Dao-Er-Ji
    Liu, Na
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT IV, NLPCC 2024, 2025, 15362 : 159 - 170
  • [38] The Impact of Named Entity Translation for Neural Machine Translation
    Yan, Jinghui
    Zhang, Jiajun
    Xu, JinAn
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 63 - 73
  • [39] Improvements of Google Neural Machine Translation
    李瑞
    蒋美佳
    海外英语, 2017, (15) : 132 - 134
  • [40] Improving Neural Machine Translation Using Rule-Based Machine Translation
    Singh, Muskaan
    Kumar, Ravinder
    Chana, Inderveer
    2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, : 8 - 12