DeepHunter: A Coverage-Guided Fuzz Testing Framework for Deep Neural Networks

被引:297
作者
Xie, Xiaofei [1 ]
Ma, Lei [2 ]
Juefei-Xu, Felix [3 ]
Xue, Minhui [4 ]
Chen, Hongxu [1 ]
Liu, Yang [1 ,5 ]
Zhao, Jianjun [2 ]
Li, Bo [6 ]
Yin, Jianxiong [7 ]
See, Simon [7 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Kyushu Univ, Fukuoka, Fukuoka, Japan
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Adelaide, Adelaide, SA, Australia
[5] Zhejiang Sci Tech Univ, Hangzhou, Zhejiang, Peoples R China
[6] Univ Illinois, Urbana, IL USA
[7] NVIDIA AI Tech Ctr, Singapore, Singapore
来源
PROCEEDINGS OF THE 28TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS (ISSTA '19) | 2019年
基金
新加坡国家研究基金会;
关键词
Deep learning testing; metamorphic testing; coverage-guided fuzzing;
D O I
10.1145/3293882.3330579
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The past decade has seen the great potential of applying deep neural network (DNN) based software to safety-critical scenarios, such as autonomous driving. Similar to traditional software, DNNs could exhibit incorrect behaviors, caused by hidden defects, leading to severe accidents and losses. In this paper, we propose DeepHunter, a coverage-guided fuzz testing framework for detecting potential defects of general-purpose DNNs. To this end, we first propose a metamorphic mutation strategy to generate new semantically preserved tests, and leverage multiple extensible coverage criteria as feedback to guide the test generation. We further propose a seed selection strategy that combines both diversity-based and recency-based seed selection. We implement and incorporate 5 existing testing criteria and 4 seed selection strategies in DeepHunter. Large-scale experiments demonstrate that (1) our metamorphic mutation strategy is useful to generate new valid tests with the same semantics as the original seed, by up to a 98% validity ratio; (2) the diversity-based seed selection generally weighs more than recency-based seed selection in boosting the coverage and in detecting defects; (3) DeepHunter outperforms the state of the arts by coverage as well as the quantity and diversity of defects identified; (4) guided by corner-region based criteria, DeepHunteris useful to capture defects during the DNN quantization for platform migration.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 47 条
[21]  
Biggio Battista, 2013, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2013. Proceedings: LNCS 8190, P387, DOI 10.1007/978-3-642-40994-3_25
[22]   Coverage-Based Greybox Fuzzing as Markov Chain [J].
Bohme, Marcel ;
Van-Thuan Pham ;
Roychoudhury, Abhik .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2019, 45 (05) :489-506
[23]  
Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
[24]  
Courbariaux M., 2016, BinaryNet: Training deep neural networks with weights and activa
[25]   CollAFL: Path Sensitive Fuzzing [J].
Gan, Shuitao ;
Zhang, Chao ;
Qin, Xiaojun ;
Tu, Xuwen ;
Li, Kang ;
Pei, Zhongyu ;
Chen, Zuoning .
2018 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2018, :679-696
[26]  
Gopinath Divya., 2017, CoRR
[27]   Deep Neural Networks for Acoustic Modeling in Speech Recognition [J].
Hinton, Geoffrey ;
Deng, Li ;
Yu, Dong ;
Dahl, George E. ;
Mohamed, Abdel-rahman ;
Jaitly, Navdeep ;
Senior, Andrew ;
Vanhoucke, Vincent ;
Patrick Nguyen ;
Sainath, Tara N. ;
Kingsbury, Brian .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :82-97
[28]   Perturbative Neural Networks [J].
Juefei-Xu, Felix ;
Boddeti, Vishnu Naresh ;
Savvides, Marios .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3310-3318
[29]  
Krizhevsky N., 2014, The CIFAR-10 dataset
[30]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324