Analysing terminology translation errors in statistical and neural machine translation

被引:7
作者
Haque, Rejwanul [1 ]
Hasanuzzaman, Mohammed [1 ]
Way, Andy [1 ]
机构
[1] Dublin City Univ, ADAPT Ctr, Sch Comp, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
Terminology translation; Machine translation; Phrase-based statistical machine translation; Neural machine translation; QUALITY;
D O I
10.1007/s10590-020-09251-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Terminology translation plays a critical role in domain-specific machine translation (MT). Phrase-based statistical MT (PB-SMT) has been the dominant approach to MT for the past 30 years, both in academia and industry. Neural MT (NMT), an end-to-end learning approach to MT, is steadily taking the place of PB-SMT. In this paper, we conduct comparative qualitative evaluation and comprehensive error analysis on terminology translation in PB-SMT and NMT in two translation directions: English-to-Hindi and Hindi-to-English. To the best of our knowledge, there is no gold standard available for evaluating terminology translation quality in MT. For this reason we select an evaluation test set from a legal domain corpus and create a gold standard for evaluating terminology translation in MT. We also propose an error typology taking the terminology translation errors in MT into consideration. We translate sentences of the test set with our MT systems and terminology translations are manually classified as per the error typology. We evaluate the MT system's performance on terminology translation, and demonstrate our findings, unraveling strengths, weaknesses, and similarities of PB-SMT and NMT in the area of term translation.
引用
收藏
页码:149 / 195
页数:47
相关论文
共 50 条
[41]   Terminology-Enriched Meta-curriculum Learning for Domain Neural Machine Translation [J].
Chen, Zheng ;
Wang, Yifan .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 :379-390
[42]   Natural Language to Visualization by Neural Machine Translation [J].
Luo, Yuyu ;
Tang, Nan ;
Li, Guoliang ;
Tang, Jiawei ;
Chai, Chengliang ;
Qin, Xuedi .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) :217-226
[43]   Modeling Future Cost for Neural Machine Translation [J].
Duan, Chaoqun ;
Chen, Kehai ;
Wang, Rui ;
Utiyama, Masao ;
Sumita, Eiichiro ;
Zhu, Conghui ;
Zhao, Tiejun .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :770-781
[44]   Analysis of Rule-Based Machine Translation and Neural Machine Translation Approaches for Translating Portuguese to LIBRAS [J].
Moraes de Oliveira, Caio Cesar ;
do Rego, Thais Gaudencio ;
Cavalcanti Brandao Lima, Manuella Aschoff ;
Ugulino de Araujo, Tiago Maritan .
WEBMEDIA 2019: PROCEEDINGS OF THE 25TH BRAZILLIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, 2019, :117-124
[45]   Statistical machine translation for Indic languages [J].
Das, Sudhansu Bala ;
Panda, Divyajyoti ;
Mishra, Tapas Kumar ;
Patra, Bidyut Kr. .
NATURAL LANGUAGE PROCESSING, 2025, 31 (02) :328-345
[46]   Improving Neural Machine Translation Using Rule-Based Machine Translation [J].
Singh, Muskaan ;
Kumar, Ravinder ;
Chana, Inderveer .
2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, :8-12
[47]   Improving Neural Machine Translation by Retrieving Target Translation Template [J].
Li, Fuxue ;
Chi, Chuncheng ;
Yan, Hong ;
Zhang, Zhen .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 :658-669
[48]   Word Position Aware Translation Memory for Neural Machine Translation [J].
He, Qiuxiang ;
Huang, Guoping ;
Liu, Lemao ;
Li, Li .
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 :367-379
[49]   Neural Machine Translation with Diversity-Enabled Translation Memory [J].
Quang Chieu Nguyen ;
Xuan Dung Doan ;
Van-Vinh Nguyen ;
Khac-Hoai Nam Bui .
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT I, 2023, 13995 :322-333
[50]   Neural machine translation and human translation A political and ideological perspective [J].
Sheng, Anfeng ;
Kong, Yankun .
BABEL-REVUE INTERNATIONALE DE LA TRADUCTION-INTERNATIONAL JOURNAL OF TRANSLATION, 2023, 69 (04) :483-498