Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
来源
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II | 2024年 / 2166卷
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [21] MACHINE TRANSLATION AND POST-EDITING IN WILDLIFE DOCUMENTARIES: CHALLENGES AND POSSIBLE SOLUTIONS
    Ortiz-Boix, Carla
    HERMENEUS, 2016, (18): : 269 - 313
  • [22] MACHINE TRANSLATION AND POST-EDITING: PROFILES AND COMPETENCES IN TRANSLATOR TRAINING PROGRAMMES
    Cid-Leal, Pilar
    Espin-Garcia, Maria-Carmen
    Presas, Marisa
    MONTI, 2019, 11 : 187 - 212
  • [23] Post-editing of machine translation while reading on English proficiency levels
    Kim, Hea-Suk
    Cha, Yoonjung
    LINGUISTIC RESEARCH, 2023, 40 : 89 - 126
  • [24] Perspectives and use of machine translation and post-editing in audiovisual translation: The point of view of professionals
    Martin, Jose Fernando Carrero
    Oliver, Beatriz Reverter
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2024, (22): : 302 - 322
  • [25] The Trials and Tribulations of Predicting Machine Translation Post-Editing Productivity
    Marg, Lena
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 23 - 26
  • [26] Multi-Modal Approaches for Post-Editing Machine Translation
    Herbig, Nico
    Pal, Santanu
    van Genabith, Josef
    Krueger, Antonio
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [27] Comparing the Quality of Neural Machine Translation and Professional Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2019,
  • [28] Re-thinking Machine Translation Post-Editing Guidelines
    Perez, Celia Rico
    JOURNAL OF SPECIALISED TRANSLATION, 2024, (41): : 26 - 47
  • [29] The Role of Machine Translation Quality Estimation in the Post-Editing Workflow
    Bechara, Hannah
    Orasan, Constantin
    Escartin, Carla Parra
    Zampieri, Marcos
    Lowe, William
    INFORMATICS-BASEL, 2021, 8 (03):
  • [30] USING POST-EDITING IN TRANSLATION AND LSP COURSES
    Udina, Natalia
    PROCEEDINGS OF INTCESS 2019- 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND SOCIAL SCIENCES, 2019, : 1097 - 1101