Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing

被引:21
|
作者
Vardaro, Jennifer [1 ]
Schaeffer, Moritz [1 ]
Hansen-Schirra, Silvia [1 ]
机构
[1] Johannes Gutenberg Univ Mainz, English Linguist & Translat Studies, D-76726 Mainz, Germany
来源
INFORMATICS-BASEL | 2019年 / 6卷 / 03期
关键词
neural machine translation; post-editing; revision; error annotations; Hjerson; MQM; European Commission (DGT); eye-tracking; key-logging; post-editing effort; ENGLISH;
D O I
10.3390/informatics6030041
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to analyse how translation experts from the German department of the European Commission's Directorate-General for Translation (DGT) identify and correct different error categories in neural machine translated texts (NMT) and their post-edited versions (NMTPE). The term translation expert encompasses translator, post-editor as well as revisor. Even though we focus on neural machine-translated segments, translator and post-editor are used synonymously because of the combined workflow using CAT-Tools as well as machine translation. Only the distinction between post-editor, which refers to a DGT translation expert correcting the neural machine translation output, and revisor, which refers to a DGT translation expert correcting the post-edited version of the neural machine translation output, is important and made clear whenever relevant. Using an automatic error annotation tool and the more fine-grained manual error annotation framework to identify characteristic error categories in the DGT texts, a corpus analysis revealed that quality assurance measures by post-editors and revisors of the DGT are most often necessary for lexical errors. More specifically, the corpus analysis showed that, if post-editors correct mistranslations, terminology or stylistic errors in an NMT sentence, revisors are likely to correct the same error type in the same post-edited sentence, suggesting that the DGT experts were being primed by the NMT output. Subsequently, we designed a controlled eye-tracking and key-logging experiment to compare participants' eye movements for test sentences containing the three identified error categories (mistranslations, terminology or stylistic errors) and for control sentences without errors. We examined the three error types' effect on early (first fixation durations, first pass durations) and late eye movement measures (e.g., total reading time and regression path durations). Linear mixed-effects regression models predict what kind of behaviour of the DGT experts is associated with the correction of different error types during the post-editing process.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] The role of syntactic variation in translation and post-editing
    Bangalore, Srinivas
    Behrens, Bergljot
    Carl, Michael
    Ghankot, Maheshwar
    Heilmann, Arndt
    Nitzke, Jean
    Schaeffer, Moritz
    Sturm, Annegret
    TRANSLATION SPACES, 2015, 4 (01) : 119 - 143
  • [22] Reasons for the increasing use of Machine translation followed by post-editing
    Sanchez-Martinez, Felipe
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2012, (10): : 150 - 156
  • [23] Perspectives and use of machine translation and post-editing in audiovisual translation: The point of view of professionals
    Martin, Jose Fernando Carrero
    Oliver, Beatriz Reverter
    TRADUMATICA-TRADUCCIO I TECNOLOGIES DE LA INFORMACIO I LA COMUNICACIO, 2024, (22): : 302 - 322
  • [24] Multi-Modal Approaches for Post-Editing Machine Translation
    Herbig, Nico
    Pal, Santanu
    van Genabith, Josef
    Krueger, Antonio
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [25] Multidimensional strategy for the selection of machine translation candidates for post-editing
    de Gibert, Ona
    Aranberri, Nora
    LINGUAMATICA, 2019, 11 (02): : 3 - 16
  • [26] Translation Methods and Experience: A Comparative Analysis of Human Translation and Post-editing with Students and Professional Translators
    Daems, Joke
    Vandepitte, Sonia
    Hartsuiker, Robert J.
    Macken, Lieve
    META, 2017, 62 (02) : 245 - 270
  • [27] How does the post-editing of neural machine translation compare with from-scratch translation? A product and process study
    Jia, Yanfang
    Carl, Michael
    Wang, Xiangling
    JOURNAL OF SPECIALISED TRANSLATION, 2019, (31) : 60 - 86
  • [28] Patterns of Terminological Variation in Post-editing and of Cognate Use in Machine Translation in Contrast to Human Translation
    Culo, Oliver
    Nitzke, Jean
    BALTIC JOURNAL OF MODERN COMPUTING, 2016, 4 (02): : 106 - 114
  • [29] Second language learners' post-editing strategies for machine translation errors
    Shin, Dongkawang
    Chon, Yuah V.
    LANGUAGE LEARNING & TECHNOLOGY, 2023, 27 (01):
  • [30] The Task of Post-Editing Machine Translation for the Low-Resource Language
    Rakhimova, Diana
    Karibayeva, Aidana
    Turarbek, Assem
    APPLIED SCIENCES-BASEL, 2024, 14 (02):