Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
来源
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II | 2024年 / 2166卷
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [41] The Use of the Survey to Find Out About the Perception of Machine Translation and Post-Editing in Spain
    Perez Macias, Lorena
    TRANS-REVISTA DE TRADUCTOLOGIA, 2020, (24): : 375 - 400
  • [42] Multi-modal indicators for estimating perceived cognitive load in post-editing of machine translation
    Herbig, Nico
    Pal, Santanu
    Vela, Mihaela
    Krueger, Antonio
    van Genabith, Josef
    MACHINE TRANSLATION, 2019, 33 (1-2) : 91 - 115
  • [43] Estimating post-editing time using a gold-standard set of machine translation errors
    Tezcan, Arda
    Hoste, Veronique
    Macken, Lieve
    COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 120 - 144
  • [44] The role of syntactic variation in translation and post-editing
    Bangalore, Srinivas
    Behrens, Bergljot
    Carl, Michael
    Ghankot, Maheshwar
    Heilmann, Arndt
    Nitzke, Jean
    Schaeffer, Moritz
    Sturm, Annegret
    TRANSLATION SPACES, 2015, 4 (01) : 119 - 143
  • [45] Translators' perceptions of literary post-editing using statistical and neural machine translation
    Moorkens, Joss
    Toral, Antonio
    Castilho, Sheila
    Way, Andy
    TRANSLATION SPACES, 2018, 7 (02) : 240 - 262
  • [46] Comparison of human, machine and Post-Editing Translation from Slovak into German by means of automatic Evaluation
    Munkova, Dasa
    Wrede, Ol'ga
    Absolon, Jakub
    ZEITSCHRIFT FUR SLAWISTIK, 2019, 64 (02): : 231 - 261
  • [47] Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation
    Culo, Oliver
    Nitzke, Jean
    BALTIC JOURNAL OF MODERN COMPUTING, 2016, 4 (02): : 106 - 114
  • [49] Devised Guidelines of Rapid Post-Editing in Machine Translation Output of Mark on Some Aspects
    屈亚媛
    王庆怡
    海外英语, 2012, (21) : 130 - 133
  • [50] Is post-editing really faster than human translation?
    Terribile, Silvia
    TRANSLATION SPACES, 2024, 13 (02) : 171 - 199