The Task of Post-Editing Machine Translation for the Low-Resource Language

被引:4
|
作者
Rakhimova, Diana [1 ,2 ]
Karibayeva, Aidana [1 ,2 ]
Turarbek, Assem [1 ]
机构
[1] Al Farabi Kazakh Natl Univ, Dept Informat Syst, Alma Ata 050040, Kazakhstan
[2] Inst Informat & Comp Technol, Alma Ata 050010, Kazakhstan
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 02期
关键词
machine translation; post-editing machine translation; light post-editing; full post-editing; BRNN; transformer; English; Kazakh; Uzbek; Russian; HANDLING UNKNOWN WORDS; PRODUCT;
D O I
10.3390/app14020486
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent years, machine translation has made significant advancements; however, its effectiveness can vary widely depending on the language pair. Languages with limited resources, such as Kazakh, Uzbek, Kalmyk, Tatar, and others, often encounter challenges in achieving high-quality machine translations. Kazakh is an agglutinative language with complex morphology, making it a low-resource language. This article addresses the task of post-editing machine translation for the Kazakh language. The research begins by discussing the history and evolution of machine translation and how it has developed to meet the unique needs of languages with limited resources. The research resulted in the development of a machine translation post-editing system. The system utilizes modern machine learning methods, starting with neural machine translation using the BRNN model in the initial post-editing stage. Subsequently, the transformer model is applied to further edit the text. Complex structural and grammatical forms are processed, and abbreviations are replaced. Practical experiments were conducted on various texts: news publications, legislative documents, IT sphere, etc. This article serves as a valuable resource for researchers and practitioners in the field of machine translation, shedding light on effective post-editing strategies to enhance translation quality, particularly in scenarios involving languages with limited resources such as Kazakh and Uzbek. The obtained results were tested and evaluated using specialized metrics-BLEU, TER, and WER.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] When Student Translators Meet With Machine Translation: The Impacts of Machine Translation Quality and Perceived Self-Efficacy on Post-Editing Performance
    Peng, Xinyang
    Wang, Xiangling
    Li, Xiaoye
    SAGE OPEN, 2024, 14 (04):
  • [42] Decoding Strategies for Improving Low-Resource Machine Translation
    Park, Chanjun
    Yang, Yeongwook
    Park, Kinam
    Lim, Heuiseok
    ELECTRONICS, 2020, 9 (10) : 1 - 15
  • [43] A Survey on Machine Translation of Low-Resource Arabic Dialects
    Abdul-Nabi, Razan
    Obeidat, Rasha
    Bsoul, Anas
    2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,
  • [44] What Do Translators Think About Post-Editing? A Mixed-Methods Study Of Translators' Fears, Worries And Preferences On Machine Translation Post-Editing
    Perez Macias, Lorena
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2020, (18): : 11 - 32
  • [45] Public service interpreting and translation training: a path towards digital adaptation to machine translation and post-editing
    Sanchez Ramos, Maria del Mar
    INTERPRETER AND TRANSLATOR TRAINER, 2022, 16 (03): : 294 - 308
  • [46] Devised Guidelines of Rapid Post-Editing in Machine Translation Output of Mark on Some Aspects
    屈亚媛
    王庆怡
    海外英语, 2012, (21) : 130 - 133
  • [47] The Use of the Survey to Find Out About the Perception of Machine Translation and Post-Editing in Spain
    Perez Macias, Lorena
    TRANS-REVISTA DE TRADUCTOLOGIA, 2020, (24): : 375 - 400
  • [48] Post-editing strategies in automatic translation of proverbs by FFL and translation learners
    Kandeel, Rana
    TEXTO LIVRE-LINGUAGEM E TECNOLOGIA, 2021, 14 (03):
  • [49] Low-Resource Neural Machine Translation with Neural Episodic Control
    Wu, Nier
    Hou, Hongxu
    Sun, Shuo
    Zheng, Wei
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [50] Low-Resource Machine Translation with Different Granularity Image Features
    Tayir, Turghun
    Li, Lin
    Maimaiti, Mieradilijiang
    Muhtar, Yusnur
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 260 - 273