Word Closure-Based Metamorphic Testing for Machine Translation

被引:0
|
作者
Xie, Xiaoyuan [1 ]
Jin, Shuo [1 ]
Chen, Songqiang [2 ]
Cheung, Shing-chi [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine translation; metamorphic testing; word closure; deep learning testing;
D O I
10.1145/3675396
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the wide application of machine translation, the testing of Machine Translation Systems (MTSs) has attracted much attention. Recent works apply Metamorphic Testing (MT) to address the oracle problem in MTS testing. Existing MT methods for MTS generally follow the workflow of input transformation and output relation comparison, which generates a follow-up input sentence by mutating the source input and compares the source and follow-up output translations to detect translation errors, respectively. These methods use various input transformations to generate the test case pairs and have successfully triggered numerous translation errors. However, they have limitations in performing fine-grained and rigorous output relation comparison and thus may report many false alarms and miss many true errors. In this article, we propose a word closure-based output comparison method to address the limitations of the existing MTS MT methods. We first propose word closure as a new comparison unit, where each closure includes a group of correlated input and output words in the test case pair. Word closures suggest the linkages between the appropriate fragment in the source output translation and its counterpart in the follow-up output for comparison. Next, we compare the semantics on the level of word closure to identify the translation errors. In this way, we perform a fine-grained and rigorous semantic comparison for the outputs and thus realize more effective violation identification. We evaluate our method with the test cases generated by five existing input transformations and the translation outputs from three popular MTSs. Results show that our method significantly outperforms the existing works in violation identification by improving the precision and recall and achieving an average increase of 29.9% in F1 score. It also helps to increase the F1 score of translation error localization by 35.9%.
引用
收藏
页数:46
相关论文
共 50 条
  • [21] Automatic Testing and Improvement of Machine Translation
    Sun, Zeyu
    Zhang, Jie M.
    Harman, Mark
    Papadakis, Mike
    Zhang, Lu
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 974 - 985
  • [22] Identifying Implementation Bugs in Machine Learning Based Image Classifiers using Metamorphic Testing
    Dwarakanath, Anurag
    Ahuja, Manish
    Sikand, Samarth
    Rao, Raghotham M.
    Bose, R. P. Jagadeesh Chandra
    Dubash, Neville
    Podder, Sanjay
    ISSTA'18: PROCEEDINGS OF THE 27TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, 2018, : 118 - 128
  • [23] Measuring word alignment quality for statistical machine translation
    Fraser, Alexander
    Marcu, Daniel
    COMPUTATIONAL LINGUISTICS, 2007, 33 (03) : 293 - 303
  • [24] A Smaller and Better Word Embedding for Neural Machine Translation
    Chen, Qi
    IEEE ACCESS, 2023, 11 : 40770 - 40778
  • [25] Design and Testing of Automatic Machine Translation System Based on Chinese-English Phrase Translation
    Ning, Jing
    Ban, Haidong
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [26] An Experiment of Word Sense Disambiguation in a Machine Translation System
    Faili, Heshaam
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 28 - 34
  • [27] Research on English-Chinese machine translation shift based on word vector similarity
    Ma, Qingqing
    ARTIFICIAL LIFE AND ROBOTICS, 2024, 29 (04) : 585 - 589
  • [28] Better Addressing Word Deletion for Statistical Machine Translation
    Li, Qiang
    Zhang, Dongdong
    Li, Mu
    Xiao, Tong
    Zhu, Jingbo
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 91 - 102
  • [29] A Cognitive Model of Chinese Word Segmentation for Machine Translation
    Wu, Zhijie
    META, 2011, 56 (03) : 631 - 644
  • [30] Statistic-based Target Word Selection in English-Chinese Machine Translation
    刘小虎
    李生
    张波
    Journal of Harbin Institute of Technology(New series), 1999, (01) : 46 - 49