Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation

被引:1
|
作者
Dugonik, Jani [1 ]
Maucec, Mirjam Sepesy [1 ]
Verber, Domen [1 ]
Brest, Janez [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, SI-2000 Maribor, Slovenia
关键词
neural machine translation; statistical machine translation; sentence embedding; similarity; classification; hybrid machine translation;
D O I
10.3390/math11112484
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper proposes a hybrid machine translation (HMT) system that improves the quality of neural machine translation (NMT) by incorporating statistical machine translation (SMT). Therefore, two NMT systems and two SMT systems were built for the Slovenian-English language pair, each for translation in one direction. We used a multilingual language model to embed the source sentence and translations into the same vector space. From each vector, we extracted features based on the distances and similarities calculated between the source sentence and the NMT translation, and between the source sentence and the SMT translation. To select the best possible translation, we used several well-known classifiers to predict which translation system generated a better translation of the source sentence. The proposed method of combining SMT and NMT in the hybrid system is novel. Our framework is language-independent and can be applied to other languages supported by the multilingual language model. Our experiment involved empirical applications. We compared the performance of the classifiers, and the results demonstrate that our proposed HMT system achieved notable improvements in the BLEU score, with an increase of 1.5 points and 10.9 points for both translation directions, respectively.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [2] Incorporating bilingual translation templates into neural machine translation
    Li, Fuxue
    Liu, Beibei
    Yan, Hong
    Xie, Peijun
    Li, Jiarui
    Zhang, Zhen
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [3] Neural and statistical machine translation: perception and productivity
    Lopez Pereira, Ariana
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2019, (17): : 1 - 19
  • [4] Preventing translation quality deterioration caused by beam search decoding in neural machine translation using statistical machine translation
    Satir, Emre
    Bulut, Hasan
    INFORMATION SCIENCES, 2021, 581 : 791 - 807
  • [5] Analysing terminology translation errors in statistical and neural machine translation
    Haque, Rejwanul
    Hasanuzzaman, Mohammed
    Way, Andy
    MACHINE TRANSLATION, 2020, 34 (2-3) : 149 - 195
  • [6] English-Basque Statistical and Neural Machine Translation
    Unanue, Inigo Jauregi
    Garmendia Arratibel, Lierni
    Borzeshi, Ehsan Zare
    Piccardi, Massimo
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 880 - 885
  • [7] MTIL2017: Machine Translation Using Recurrent Neural Network on Statistical Machine Translation
    Mahata, Sainik Kumar
    Das, Dipankar
    Bandyopadhyay, Sivaji
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 447 - 453
  • [8] Neural Machine Translation as a Novel Approach to Machine Translation
    Benkova, Lucia
    Benko, Lubomir
    DIVAI 2020: 13TH INTERNATIONAL SCIENTIFIC CONFERENCE ON DISTANCE LEARNING IN APPLIED INFORMATICS, 2020, : 499 - 508
  • [9] Statistical Machine Translation
    Vatsa, Mukesh G. S.
    Joshi, Nikita
    Goswami, Sumit
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2010, 30 (04): : 25 - 32
  • [10] Statistical machine translation method based on improved neural network
    Yang, Lingxing
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 1715 - 1719