Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
来源
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II | 2024年 / 2166卷
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [31] Cognitive effort in human translation and machine translation post-editing processes A holistic and phased view
    Wang, Yu
    Jalalian Daghigh, Ali
    FORUM-REVUE INTERNATIONALE D INTERPRETATION ET DE TRADUCTION-INTERNATIONAL JOURNAL OF INTERPRETATION AND TRANSLATION, 2023, 21 (01): : 139 - 162
  • [32] Can college students be post-editors? An investigation into employing language learners in machine translation plus post-editing settings
    Yamada, Masaru
    MACHINE TRANSLATION, 2015, 29 (01) : 49 - 67
  • [33] Identifying the Machine Translation Error Types with the Greatest Impact on Post-editing Effort
    Daems, Joke
    Vandepitte, Sonia
    Hartsuiker, Robert J.
    Macken, Lieve
    FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [34] Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    INFORMATICS-BASEL, 2019, 6 (03):
  • [35] POST-EDITING PRACTICE IN SPECIALIZED TRANSLATION TRAINING
    Alvarez Garcia, Carmen
    CARACTERES-ESTUDIOS CULTURALES Y CRITICOS DE LA ESFERA DIGITAL, 2019, 8 (02): : 67 - 91
  • [36] Man or machine? Comparing the difficulty of human translation versus neural machine translation post-editing
    Jia, Yanfang
    Sun, Sanjun
    PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2023, 31 (05): : 950 - 968
  • [37] What Do Translators Think About Post-Editing? A Mixed-Methods Study Of Translators' Fears, Worries And Preferences On Machine Translation Post-Editing
    Perez Macias, Lorena
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2020, (18): : 11 - 32
  • [38] Using the TED Talks to Evaluate Spoken Post-editing of Machine Translation
    Liyanapathirana, Jeevanthi
    Popescu-Belis, Andrei
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2232 - 2239
  • [39] Public service interpreting and translation training: a path towards digital adaptation to machine translation and post-editing
    Sanchez Ramos, Maria del Mar
    INTERPRETER AND TRANSLATOR TRAINER, 2022, 16 (03): : 294 - 308
  • [40] The impact of Google Neural Machine Translation on Post-editing by student translators
    Yamada, Masaru
    JOURNAL OF SPECIALISED TRANSLATION, 2019, (31): : 87 - 106