DeepWeak: Weak Mutation Testing for Deep Learning Systems

被引:0
作者
Xue, Yinjie [1 ]
Zhang, Zhiyi [1 ,2 ]
Liu, Chen [3 ]
Chen, Shuxian [1 ]
Huang, Zhiqiu [1 ,4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Peoples R China
[3] Yangzhou Univ, Sch Marxism, Yangzhou, Jiangsu, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Minist Key Lab Safety Crit Software Dev & Verific, Nanjing, Peoples R China
来源
2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS | 2024年
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
software testing; mutation testing; weak mutation; deep learning;
D O I
10.1109/QRS62785.2024.00015
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The widespread application of deep learning (DL) makes it crucial to ensure its reliability. Mutation testing has been employed in DL testing to evaluate the quality of test suite. However, the largest problem of DL mutation testing is the high cost of executing numbers of mutants. Weak mutation technology can alleviate this problem by reducing the execution time of mutants in traditional software testing. However, the compared components in traditional software are too trivial to apply weak mutation to DL models directly for that it is impractical for testers to track and monitor massive parameters during execution process. In this paper, we propose a novel weak mutation framework for mutants generated by source-level mutation operators. DeepWeak treats all layers that make up the DL model directly as a set of components of model to replace trivial parameters. And it pays attention to the last convolutioanl layer for that they not only have impacts on prediction results but also are evident for weak analysis. By quantifying contribution of feature maps to the prediction, weight maps will be obtained on the basis of their weights. Finally, the judgements on whether mutants have been killed will be reached by comparing the maps. To evaluate the applicability and effectiveness of our approach, we conduct experiments on three widely used datasets and four deep learning models using three metrics. Experimental results show that DeepWeak is effective at alleviating costs problem, reducing runtime by 11.21% to 18.21% compared with the DL mutation testing with little accuracy loss.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 50 条
  • [41] Mutation Testing in Evolving Systems: Studying the Relevance of Mutants to Code Evolution
    Ojdanic, Milos
    Soremekun, Ezekiel
    Degiovanni, Renzo
    Papadakis, Mike
    Le Traon, Yves
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (01)
  • [42] Metamorphic Testing of Deep Learning Compilers
    Xiao, Dongwei
    Liu, Zhibo
    Yuan, Yuanyuan
    Pang, Qi
    Wang, Shuai
    [J]. PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2022, 6 (01)
  • [43] TSDTest: A Efficient Coverage Guided Two-Stage Testing for Deep Learning Systems
    Li, Haoran
    Wang, Shihai
    Shi, Tengfei
    Fang, Xinyue
    Chen, Jian
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 173 - 178
  • [44] Testing Deep Learning Recommender Systems Models on Synthetic GAN-Generated Datasets
    Bobadilla, Jesus
    Gutierrez, Abraham
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023,
  • [45] Small and weak target detection based on deep learning
    Wang, Ting
    Cao, Changqing
    Zeng, Xiaodong
    Feng, Zhejun
    Liu, Yutao
    Ning, Jinna
    [J]. AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [46] Applicability of weak samples to deep learning crop classification
    Xu Q.
    Zhang J.
    Zhang F.
    Ge S.
    Yang Z.
    Duan Y.
    [J]. National Remote Sensing Bulletin, 2022, 26 (07) : 1395 - 1409
  • [47] Parallel mutation testing for large scale systems
    Canizares, Pablo C.
    Nunez, Alberto
    Filgueira, Rosa
    de Lara, Juan
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (02): : 2071 - 2097
  • [48] Parallel mutation testing for large scale systems
    Pablo C. Cañizares
    Alberto Núñez
    Rosa Filgueira
    Juan de Lara
    [J]. Cluster Computing, 2024, 27 : 2071 - 2097
  • [49] DeepAtash: Focused Test Generation for Deep Learning Systems
    Zohdinasab, Tahereh
    Riccio, Vincenzo
    Tonella, Paolo
    [J]. PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 954 - 966
  • [50] DEEPMETIS: Augmenting a Deep Learning Test Set to Increase its Mutation Score
    Riccio, Vincenzo
    Humbatova, Nargiz
    Jahangirova, Gunel
    Tonella, Paolo
    [J]. 2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 355 - 367