Seed Selection for Testing Deep Neural Networks

被引:7
作者
Zhi, Yuhan [1 ]
Xie, Xiaofei [2 ]
Shen, Chao [1 ]
Sun, Jun [2 ]
Zhang, Xiaoyu [1 ]
Guan, Xiaohong [1 ]
机构
[1] Xi An Jiao Tong Univ, 28 Xianning West Rd, Xian 710049, Peoples R China
[2] Singapore Management Univ, 81 Victoria St, Singapore 188065, Singapore
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Deep learning testing; seed selection; coverage; robustness; TEST DATA GENERATION;
D O I
10.1145/3607190
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep learning (DL) has been applied in many applications. Meanwhile, the quality of DL systems is becoming a big concern. To evaluate the quality of DL systems, a number of DL testing techniques have been proposed. To generate test cases, a set of initial seed inputs are required. Existing testing techniques usually construct seed corpus by randomly selecting inputs from training or test dataset. Till now, there is no study on how initial seed inputs affect the performance of DL testing and how to construct an optimal one. To fill this gap, we conduct the first systematic study to evaluate the impact of seed selection strategies on DL testing. Specifically, considering three popular goals of DL testing (i.e., coverage, failure detection, and robustness), we develop five seed selection strategies, including three based on single-objective optimization (SOO) and two based on multi-objective optimization (MOO). We evaluate these strategies on seven testing tools. Our results demonstrate that the selection of initial seed inputs greatly affects the testing performance. SOO-based selection can construct the best seed corpus that can boost DL testing with respect to the specific testing goal. MOO-based selection strategies can construct seed corpus that achieve balanced improvement on multiple objectives.
引用
收藏
页数:33
相关论文
共 96 条
[1]  
Abdelnur H., 2010, SPECTRAL FUZZING EVA
[2]  
Aizatsky Mike, 2016, Google Testing Blog
[3]  
Alshahwan N., 2011, 2011 26th IEEE/ACM International Conference on Automated Software Engineering, P3, DOI 10.1109/ASE.2011.6100082
[4]   Search-based software test data generation for string data using program-specific search operators [J].
Alshraideh, Mohammad ;
Bottaci, Leonardo .
SOFTWARE TESTING VERIFICATION & RELIABILITY, 2006, 16 (03) :175-203
[5]  
[Anonymous], 2010, P 1 INT WORKSHOP SOF
[6]   REDQUEEN: Fuzzing with Input-to-State Correspondence [J].
Aschermann, Cornelius ;
Schumilo, Sergej ;
Blazytko, Tim ;
Gawlik, Robert ;
Holz, Thorsten .
26TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2019), 2019,
[7]   A novel non-dominated sorting algorithm for evolutionary multi-objective optimization [J].
Bao, Chunteng ;
Xu, Lihong ;
Goodman, Erik D. ;
Cao, Leilei .
JOURNAL OF COMPUTATIONAL SCIENCE, 2017, 23 :31-43
[8]   Testing Autonomous Cars for Feature Interaction Failures using Many-Objective Search [J].
Ben Abdessalem, Raja ;
Panichella, Annibale ;
Nejati, Shiva ;
Briand, Lionel C. ;
Stifter, Thomas .
PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, :143-154
[9]   Fuzzing: On the Exponential Cost of Vulnerability Discovery [J].
Bohme, Marcel ;
Falk, Brandon .
PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20), 2020, :713-724
[10]   Boosting Fuzzer Efficiency: An Information Theoretic Perspective [J].
Bohme, Marcel ;
Manes, Valentin J. M. ;
Cha, Sang Kil .
PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20), 2020, :678-689