Maximum Entropy Model of Synonym Selection in Post-editing Machine Translation into Kazakh Language

被引:0
|
作者
Shormakova, Assem [1 ]
Tukeyev, Ualsher [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Alma Ata, Kazakhstan
来源
ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2024, PT II | 2024年 / 2166卷
关键词
Maximum entropy model; synonym selection; post-editing; machine translation; Kazakh;
D O I
10.1007/978-3-031-70259-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work presents a model, algorithm, and experimental studies for selecting synonyms of incorrectly translated words from English into Kazakh within the framework of machine translation post-editing technology. As a model for choosing synonyms, a maximum entropy model has been developed, the distinctive feature of which is the consideration of contextual words located at any distance from the translated word in a sentence (non-consecutive collocations), which takes into account the peculiarities of the Kazakh language. A feature of the proposed solution algorithm for this model is the use of the semantic cube model proposed by the authors. The developed maximum entropy model was learned on the 250 0000 sentences parallel Kazakh-English corpus, and a test set containing 25,000 sentences was conducted. Experiments on post-editing of machine translation of the Kazakh language compared with machine translation of Google Translate showed an improvement in the BLEU metric by 6 positions.
引用
收藏
页码:111 / 123
页数:13
相关论文
共 50 条
  • [1] Semantic Connections in the Complex Sentences for Post-Editing Machine Translation in the Kazakh Language
    Turganbayeva, Aliya
    Rakhimova, Diana
    Karyukin, Vladislav
    Karibayeva, Aidana
    Turarbek, Asem
    INFORMATION, 2022, 13 (09)
  • [2] On the correctness of machine translation: A machine translation post-editing task
    Koponen, Maarit
    Salmi, Leena
    JOURNAL OF SPECIALISED TRANSLATION, 2015, (23): : 117 - 135
  • [3] Second language learners' post-editing strategies for machine translation errors
    Shin, Dongkawang
    Chon, Yuah V.
    LANGUAGE LEARNING & TECHNOLOGY, 2023, 27 (01):
  • [4] The Task of Post-Editing Machine Translation for the Low-Resource Language
    Rakhimova, Diana
    Karibayeva, Aidana
    Turarbek, Assem
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [5] Development and Study of a Post-editing Model for Russian-Kazakh and English-Kazakh Translation Based on Machine Learning
    Rakhimova, Diana
    Sagat, Kamila
    Zhakypbaeva, Kamila
    Zhunussova, Aliya
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 525 - 534
  • [6] Post-Editing Machine Translation As an FSL Exercise
    Kliffer, Michael D.
    PORTA LINGUARUM, 2008, (09) : 53 - 67
  • [7] Training in machine translation post-editing for foreign language students
    Zhang, Hong
    Torres-Hostench, Olga
    LANGUAGE LEARNING & TECHNOLOGY, 2022, 26 (01):
  • [8] Multidimensional strategy for the selection of machine translation candidates for post-editing
    de Gibert, Ona
    Aranberri, Nora
    LINGUAMATICA, 2019, 11 (02): : 3 - 16
  • [9] Post-editing Machine Translation in MateCat: a classroom experiment
    Herget, Katrin
    7TH INTERNATIONAL CONFERENCE ON HIGHER EDUCATION ADVANCES (HEAD'21), 2021, : 1003 - 1009
  • [10] Machine translation post-editing through emotional narratives
    Ramos, Maria del Mar Sanchez
    TRANSLATION AND TRANSLANGUAGING IN MULTILINGUAL CONTEXTS, 2025, 11 (01) : 31 - 47