DeepFault: Fault Localization for Deep Neural Networks

被引:66
作者
Eniser, Hasan Ferit [1 ]
Gerasimou, Simos [2 ]
Sen, Alper [1 ]
机构
[1] Bogazici Univ, Istanbul, Turkey
[2] Univ York, York, N Yorkshire, England
来源
FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING (FASE 2019) | 2019年 / 11424卷
关键词
Deep Neural Networks; Fault localization; Test input generation;
D O I
10.1007/978-3-030-16722-6_10
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep Neural Networks (DNNs) are increasingly deployed in safety-critical applications including autonomous vehicles and medical diagnostics. To reduce the residual risk for unexpected DNN behaviour and provide evidence for their trustworthy operation, DNNs should be thoroughly tested. The DeepFault whitebox DNN testing approach presented in our paper addresses this challenge by employing suspiciousness measures inspired by fault localization to establish the hit spectrum of neurons and identify suspicious neurons whose weights have not been calibrated correctly and thus are considered responsible for inadequate DNN performance. DeepFault also uses a suspiciousness-guided algorithm to synthesize new inputs, from correctly classified inputs, that increase the activation values of suspicious neurons. Our empirical evaluation on several DNN instances trained on MNIST and CIFAR-10 datasets shows that DeepFault is effective in identifying suspicious neurons. Also, the inputs synthesized by DeepFault closely resemble the original inputs, exercise the identified suspicious neurons and are highly adversarial.
引用
收藏
页码:171 / 191
页数:21
相关论文
共 69 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   A practical evaluation of spectrum-based fault localization [J].
Abreu, Rui ;
Zoeteweij, Peter ;
Golsteijn, Rob ;
van Gemund, Arjan J. C. .
JOURNAL OF SYSTEMS AND SOFTWARE, 2009, 82 (11) :1780-1792
[3]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[4]  
[Anonymous], CIFAR10 MODEL KERAS
[5]  
[Anonymous], 2016, P NIPS
[6]  
Artzi Shay, 2010, P 19 INT S SOFTW TES, P49, DOI DOI 10.1145/1831708.1831715
[7]  
Burton Simon, 2017, Computer Safety, Reliability and Security, SAFECOMP 2017: Workshops ASSURE, DECSoS, SASSUR, TELERISE and TIPS. Proceedings: LNCS 10489, P5, DOI 10.1007/978-3-319-66284-8_1
[8]   Engineering Trustworthy Self-Adaptive Software with Dynamic Assurance Cases [J].
Calinescu, Radu ;
Weyns, Danny ;
Gerasimou, Simos ;
Iftikhar, Muhammad Usman ;
Habli, Ibrahim ;
Kelly, Tim .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (11) :1039-1069
[9]   Towards Evaluating the Robustness of Neural Networks [J].
Carlini, Nicholas ;
Wagner, David .
2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57
[10]  
Chollet F., 2015, Keras