Word Closure-Based Metamorphic Testing for Machine Translation

被引：0

作者：

Xie, Xiaoyuan ^{[1
]}

Jin, Shuo ^{[1
]}

Chen, Songqiang ^{[2
]}

Cheung, Shing-chi ^{[2
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China

[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY | 2024年 / 33卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Machine translation; metamorphic testing; word closure; deep learning testing;

D O I：

10.1145/3675396

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

With the wide application of machine translation, the testing of Machine Translation Systems (MTSs) has attracted much attention. Recent works apply Metamorphic Testing (MT) to address the oracle problem in MTS testing. Existing MT methods for MTS generally follow the workflow of input transformation and output relation comparison, which generates a follow-up input sentence by mutating the source input and compares the source and follow-up output translations to detect translation errors, respectively. These methods use various input transformations to generate the test case pairs and have successfully triggered numerous translation errors. However, they have limitations in performing fine-grained and rigorous output relation comparison and thus may report many false alarms and miss many true errors. In this article, we propose a word closure-based output comparison method to address the limitations of the existing MTS MT methods. We first propose word closure as a new comparison unit, where each closure includes a group of correlated input and output words in the test case pair. Word closures suggest the linkages between the appropriate fragment in the source output translation and its counterpart in the follow-up output for comparison. Next, we compare the semantics on the level of word closure to identify the translation errors. In this way, we perform a fine-grained and rigorous semantic comparison for the outputs and thus realize more effective violation identification. We evaluate our method with the test cases generated by five existing input transformations and the translation outputs from three popular MTSs. Results show that our method significantly outperforms the existing works in violation identification by improving the precision and recall and achieving an average increase of 29.9% in F1 score. It also helps to increase the F1 score of translation error localization by 35.9%.

引用

页数：46

共 50 条

[21] Automatic Testing and Improvement of Machine Translation
Sun, Zeyu
Zhang, Jie M.
Harman, Mark
Papadakis, Mike
Zhang, Lu
2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 974 - 985
[22] Identifying Implementation Bugs in Machine Learning Based Image Classifiers using Metamorphic Testing
Dwarakanath, Anurag
Ahuja, Manish
Sikand, Samarth
Rao, Raghotham M.
Bose, R. P. Jagadeesh Chandra
Dubash, Neville
Podder, Sanjay
ISSTA'18: PROCEEDINGS OF THE 27TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, 2018, : 118 - 128
[23] Measuring word alignment quality for statistical machine translation
Fraser, Alexander
Marcu, Daniel
COMPUTATIONAL LINGUISTICS, 2007, 33 (03) : 293 - 303
[24] A Smaller and Better Word Embedding for Neural Machine Translation
Chen, Qi
IEEE ACCESS, 2023, 11 : 40770 - 40778
[25] Design and Testing of Automatic Machine Translation System Based on Chinese-English Phrase Translation
Ning, Jing
Ban, Haidong
MOBILE INFORMATION SYSTEMS, 2021, 2021
[26] An Experiment of Word Sense Disambiguation in a Machine Translation System
Faili, Heshaam
IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 28 - 34
[27] Research on English-Chinese machine translation shift based on word vector similarity
Ma, Qingqing
ARTIFICIAL LIFE AND ROBOTICS, 2024, 29 (04) : 585 - 589
[28] Better Addressing Word Deletion for Statistical Machine Translation
Li, Qiang
Zhang, Dongdong
Li, Mu
Xiao, Tong
Zhu, Jingbo
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 91 - 102
[29] A Cognitive Model of Chinese Word Segmentation for Machine Translation
Wu, Zhijie
META, 2011, 56 (03) : 631 - 644
[30] Statistic-based Target Word Selection in English-Chinese Machine Translation
刘小虎
李生
张波
Journal of Harbin Institute of Technology(New series), 1999, (01) : 46 - 49

← 1 2 3 4 5 →