Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation

被引:1
|
作者
Dugonik, Jani [1 ]
Maucec, Mirjam Sepesy [1 ]
Verber, Domen [1 ]
Brest, Janez [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SI-2000 Maribor, Slovenia
关键词
neural machine translation; statistical machine translation; sentence embedding; similarity; classification; hybrid machine translation;
D O I
10.3390/math11112484
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper proposes a hybrid machine translation (HMT) system that improves the quality of neural machine translation (NMT) by incorporating statistical machine translation (SMT). Therefore, two NMT systems and two SMT systems were built for the Slovenian-English language pair, each for translation in one direction. We used a multilingual language model to embed the source sentence and translations into the same vector space. From each vector, we extracted features based on the distances and similarities calculated between the source sentence and the NMT translation, and between the source sentence and the SMT translation. To select the best possible translation, we used several well-known classifiers to predict which translation system generated a better translation of the source sentence. The proposed method of combining SMT and NMT in the hybrid system is novel. Our framework is language-independent and can be applied to other languages supported by the multilingual language model. Our experiment involved empirical applications. We compared the performance of the classifiers, and the results demonstrate that our proposed HMT system achieved notable improvements in the BLEU score, with an increase of 1.5 points and 10.9 points for both translation directions, respectively.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] MACHINE TRANSLATION: A CRITICAL LOOK AT THE PERFORMANCE OF RULE-BASED AND STATISTICAL MACHINE TRANSLATION
    Banitz, Brita
    CADERNOS DE TRADUCAO, 2020, 40 (01): : 54 - 71
  • [32] Neural Machine Translation for Amharic-English Translation
    Gezmu, Andargachew Mekonne
    Nuernberger, Andreas
    Bati, Tesfaye Bayu
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2021, : 526 - 532
  • [33] The Impact of Named Entity Translation for Neural Machine Translation
    Yan, Jinghui
    Zhang, Jiajun
    Xu, JinAn
    Zong, Chengqing
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 63 - 73
  • [34] Survey on Neural Machine Translation for multilingual translation system
    Basmatkar, Pranjali
    Holani, Hemant
    Kaushal, Shivani
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 443 - 448
  • [35] Online learning for effort reduction in interactive neural machine translation
    Peris, Alvaro
    Casacuberta, Francisco
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 98 - 126
  • [36] Paraphrase Lattice for Statistical Machine Translation
    Onishi, Takashi
    Utiyama, Masao
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (06) : 1299 - 1305
  • [37] Statistical machine translation for Indic languages
    Das, Sudhansu Bala
    Panda, Divyajyoti
    Mishra, Tapas Kumar
    Patra, Bidyut Kr.
    NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 328 - 345
  • [38] Migrating Code with Statistical Machine Translation
    Anh Tuan Nguyen
    Tung Thanh Nguyen
    Nguyen, Tien N.
    36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE COMPANION 2014), 2014, : 544 - 547
  • [39] Spatial Ontology in Statistical Machine Translation
    Skadins, Raivis
    DATABASES AND INFORMATION SYSTEMS, 2010, : 409 - 421
  • [40] Bilingual phrases for statistical machine translation
    Garcia-Varea, I.
    Nevado, F.
    Ortiz, D.
    Tomas, J.
    Casacuberta, F.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 93 - 100