Incorporating Typological Features into Language Selection for Multilingual Neural Machine Translation

被引:1
|
作者
Mi, Chenggang [1 ]
Zhu, Shaolin [2 ]
Fan, Yi [3 ]
Xie, Lei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[2] Zhengzhou Univ Light Ind, Sch Software, Zhengzhou, Peoples R China
[3] Northwestern Polytech Univ, Sch Aeronaut, Xian, Peoples R China
来源
WEB AND BIG DATA, APWEB-WAIM 2021, PT I | 2021年 / 12858卷
基金
中国国家自然科学基金;
关键词
Language selection; Neural machine translation; Typological feature;
D O I
10.1007/978-3-030-85896-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to use rich semantic and typological information of languages to improve the language selection method for multilingual NMT. In particular, we first use a graph-based model to output the most semantic similarity languages; then, a random forest model is built which integrates features such as data size, language family, word formation, morpheme overlap, word order, POS tag and syntax similarity together to predict the final target language(s). Experimental results on several datasets show that our method achieves consistent improvements over existing approaches both on language selection and multilingual NMT.
引用
收藏
页码:348 / 357
页数:10
相关论文
共 50 条
  • [1] A Survey of Multilingual Neural Machine Translation
    Dabre, Raj
    Chu, Chenhui
    Kunchukuttan, Anoop
    ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [2] Survey on Neural Machine Translation for multilingual translation system
    Basmatkar, Pranjali
    Holani, Hemant
    Kaushal, Shivani
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 443 - 448
  • [3] Synchronous Inference for Multilingual Neural Machine Translation
    Wang, Qian
    Zhang, Jiajun
    Zong, Chengqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1827 - 1839
  • [4] Incorporating bilingual translation templates into neural machine translation
    Li, Fuxue
    Liu, Beibei
    Yan, Hong
    Xie, Peijun
    Li, Jiarui
    Zhang, Zhen
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [5] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [6] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    MATHEMATICS, 2023, 11 (11)
  • [7] Multi-way, multilingual neural machine translation
    Firat, Orhan
    Cho, Kyunghyun
    Sankaran, Baskaran
    Vural, Fatos T. Yarman
    Bengio, Yoshua
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 236 - 252
  • [8] Neural machine translation and the indivisibility of culture and language
    Sanchez-Gijon, Pilar
    FORUM-REVUE INTERNATIONALE D INTERPRETATION ET DE TRADUCTION-INTERNATIONAL JOURNAL OF INTERPRETATION AND TRANSLATION, 2022, 20 (02): : 357 - 367
  • [9] On integrating a language model into neural machine translation
    Gulcehre, Caglar
    Firat, Orhan
    Xu, Kelvin
    Cho, Kyunghyun
    Bengio, Yoshua
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 137 - 148
  • [10] Natural Language to Visualization by Neural Machine Translation
    Luo, Yuyu
    Tang, Nan
    Li, Guoliang
    Tang, Jiawei
    Chai, Chengliang
    Qin, Xuedi
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 217 - 226