Word Closure-Based Metamorphic Testing for Machine Translation

被引:0
|
作者
Xie, Xiaoyuan [1 ]
Jin, Shuo [1 ]
Chen, Songqiang [2 ]
Cheung, Shing-chi [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine translation; metamorphic testing; word closure; deep learning testing;
D O I
10.1145/3675396
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the wide application of machine translation, the testing of Machine Translation Systems (MTSs) has attracted much attention. Recent works apply Metamorphic Testing (MT) to address the oracle problem in MTS testing. Existing MT methods for MTS generally follow the workflow of input transformation and output relation comparison, which generates a follow-up input sentence by mutating the source input and compares the source and follow-up output translations to detect translation errors, respectively. These methods use various input transformations to generate the test case pairs and have successfully triggered numerous translation errors. However, they have limitations in performing fine-grained and rigorous output relation comparison and thus may report many false alarms and miss many true errors. In this article, we propose a word closure-based output comparison method to address the limitations of the existing MTS MT methods. We first propose word closure as a new comparison unit, where each closure includes a group of correlated input and output words in the test case pair. Word closures suggest the linkages between the appropriate fragment in the source output translation and its counterpart in the follow-up output for comparison. Next, we compare the semantics on the level of word closure to identify the translation errors. In this way, we perform a fine-grained and rigorous semantic comparison for the outputs and thus realize more effective violation identification. We evaluate our method with the test cases generated by five existing input transformations and the translation outputs from three popular MTSs. Results show that our method significantly outperforms the existing works in violation identification by improving the precision and recall and achieving an average increase of 29.9% in F1 score. It also helps to increase the F1 score of translation error localization by 35.9%.
引用
收藏
页数:46
相关论文
共 50 条
  • [1] Metamorphic Testing of Machine Translation Models using Back Translation
    Gao, Wentao
    He, Jiayuan
    Pham, Van-Thuan
    2023 IEEE/ACM INTERNATIONAL WORKSHOP ON DEEP LEARNING FOR TESTING AND TESTING FOR DEEP LEARNING, DEEPTEST, 2023, : 1 - 8
  • [2] Semantic Structure Invariance-Based Metamorphic Testing for Machine Translation Systems
    Sun, Chang-ai
    Mu, Jian
    Xiao, Mingjun
    Liu, Huai
    He, Pinjia
    IEEE TRANSACTIONS ON RELIABILITY, 2025,
  • [3] A Monte Carlo Method for Metamorphic Testing of Machine Translation Services
    Pesu, Daniel
    Zhou, Zhi Quan
    Zhen, Jingfeng
    Towey, Dave
    2018 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2018), 2018, : 38 - 45
  • [4] WORD BASED MACHINE TRANSLATION SYSTEM
    谢金宝
    孙岗
    杨振宇
    JournalofShanghaiJiaotongUniversity, 1999, (02) : 104 - 108
  • [5] Multi-granularity Metamorphic Testing for Neural Machine Translation System
    Zhong W.-K.
    Ge J.-D.
    Chen X.
    Li C.-Y.
    Tang Z.
    Luo B.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04): : 1051 - 1066
  • [6] Back Deduction Based Testing for Word Sense Disambiguation Ability of Machine Translation Systems
    Wang, Jun
    Li, Yanhui
    Huang, Xiang
    Chen, Lin
    Zhan, Xiaofang
    Zhou, Yuming
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 601 - 613
  • [7] An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation Systems
    Raganato, Alessandro
    Scherrer, Yves
    Tiedemann, Jorg
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3668 - 3675
  • [8] Fairness Testing of Machine Translation Systems
    Sun, Zeyu
    Chen, Zhenpeng
    Zhang, Jie
    Hao, Dan
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (06)
  • [9] SemMT: A Semantic-Based Testing Approach for Machine Translation Systems
    Cao, Jialun
    Li, Meiziniu
    Li, Yeting
    Wen, Ming
    Cheung, Shing-Chi
    Chen, Haiming
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2022, 31 (02)
  • [10] Testing and validating machine learning classifiers by metamorphic testing
    Xie, Xiaoyuan
    Ho, Joshua W. K.
    Murphy, Christian
    Kaiser, Gail
    Xu, Baowen
    Chen, Tsong Yueh
    JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (04) : 544 - 558