DeepWeak: Weak Mutation Testing for Deep Learning Systems

被引：0

作者：

Xue, Yinjie ^{[1
]}

Zhang, Zhiyi ^{[1
,2
]}

Liu, Chen ^{[3
]}

Chen, Shuxian ^{[1
]}

Huang, Zhiqiu ^{[1
,4
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

[2] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing, Peoples R China

[3] Yangzhou Univ, Sch Marxism, Yangzhou, Jiangsu, Peoples R China

[4] Nanjing Univ Aeronaut & Astronaut, Minist Key Lab Safety Crit Software Dev & Verific, Nanjing, Peoples R China

来源：

2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS | 2024年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

software testing; mutation testing; weak mutation; deep learning;

D O I：

10.1109/QRS62785.2024.00015

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The widespread application of deep learning (DL) makes it crucial to ensure its reliability. Mutation testing has been employed in DL testing to evaluate the quality of test suite. However, the largest problem of DL mutation testing is the high cost of executing numbers of mutants. Weak mutation technology can alleviate this problem by reducing the execution time of mutants in traditional software testing. However, the compared components in traditional software are too trivial to apply weak mutation to DL models directly for that it is impractical for testers to track and monitor massive parameters during execution process. In this paper, we propose a novel weak mutation framework for mutants generated by source-level mutation operators. DeepWeak treats all layers that make up the DL model directly as a set of components of model to replace trivial parameters. And it pays attention to the last convolutioanl layer for that they not only have impacts on prediction results but also are evident for weak analysis. By quantifying contribution of feature maps to the prediction, weight maps will be obtained on the basis of their weights. Finally, the judgements on whether mutants have been killed will be reached by comparing the maps. To evaluate the applicability and effectiveness of our approach, we conduct experiments on three widely used datasets and four deep learning models using three metrics. Experimental results show that DeepWeak is effective at alleviating costs problem, reducing runtime by 11.21% to 18.21% compared with the DL mutation testing with little accuracy loss.

引用

页码：49 / 60

页数：12

共 50 条

[31] Deep Learning Library Testing: Definition, Methods and Challenges
Zhang, Xiaoyu
Jiang, Weipeng
Shen, Chao
Li, Qi
Wang, Qian
Lin, Chenhao
Guan, Xiaohong
ACM COMPUTING SURVEYS, 2025, 57 (07)
[32] WeMuTe - A Weak Mutation Testing Tool for WS-BPEL
Boonyakulsrirung, Panya
Suwannasart, Taratip
INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I, 2012, : 810 - 815
[33] Mutant reduction based on dominance relation for weak mutation testing
Gong, Dunwei
Zhang, Gongjie
Yao, Xiangjuan
Meng, Fanlin
INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 81 : 82 - 96
[34] Application of Mutation testing in SafetyCritical Embedded Systems: A Case Study
Serban, Andrada A.
Micskei, Zoltan
ACTA POLYTECHNICA HUNGARICA, 2024, 21 (08)
[35] Mutation Based Testing of P Systems
Ipate, Florentin
Gheorghe, Marian
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2009, 4 (03) : 253 - 262
[36] Combinational Metamorphic Testing for Deep Learning based Target Detection CPS Systems
Ma, Siyou
Hu, Chi
Deng, Fei
Zhou, Yuyao
Shang, Xiaohu
Wu, Linbo
Zou, Wei
Li, Chunlei
Yang, Wansheng
Zhang, Hao
Ren, Fei
2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 250 - 255
[37] Mutation Testing
Reales, Pedro
Polo, Macario
Luis Fernandez-Aleman, Jose
Toval, Ambrosio
Piattini, Mario
IEEE SOFTWARE, 2014, 31 (03) : 30 - 35
[38] Taxonomy of Real Faults in Deep Learning Systems
Humbatova, Nargiz
Jahangirova, Gunel
Bavota, Gabriele
Riccio, Vincenzo
Stocco, Andrea
Tonella, Paolo
2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 1110 - 1121
[39] TEST-INT: A Testing Platform for Deep Learning Models
Ozdemir, Ovgu
Demir, Demet
2021 15TH TURKISH NATIONAL SOFTWARE ENGINEERING SYMPOSIUM (UYMS), 2021, : 155 - 157
[40] Learning from Faults: Mutation Testing in Active Automata Learning
Aichernig, Bernhard K.
Tappler, Martin
NASA FORMAL METHODS (NFM 2017), 2017, 10227 : 19 - 34

← 1 2 3 4 5 →