Structural Coverage Criteria for Neural Networks Could Be Misleading

被引:93
作者
Li, Zenan [1 ]
Ma, Xiaoxing [1 ]
Xu, Chang [1 ]
Cao, Chun [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING RESULTS (ICSE-NIER 2019) | 2019年
基金
中国国家自然科学基金;
关键词
Software Testing; Neural Networks; Coverage;
D O I
10.1109/ICSE-NIER.2019.00031
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
There is a dramatically increasing interest in the quality assurance for DNN-based systems in the software engineering community. An emerging hot topic in this direction is structural coverage criteria for testing neural networks, which are inspired by coverage metrics used in conventional software testing. In this short paper, we argue that these criteria could be misleading because of the fundamental differences between neural networks and human written programs. Our preliminary exploration shows that (1) adversarial examples are pervasively distributed in the finely divided space defined by such coverage criteria, while available natural samples are very sparse, and as a consequence, (2) previously reported fault-detection "capabilities" conjectured from high coverage testing are more likely due to the adversary-oriented search but not the real "high" coverage.
引用
收藏
页码:89 / 92
页数:4
相关论文
共 20 条
[1]  
[Anonymous], 2014, AUTONOMOMOUS VEHICLE
[2]  
[Anonymous], 2015, 3 INT C LEARN REPR I
[3]  
[Anonymous], 2001, NASATM2001210876
[4]  
[Anonymous], 2016, TECHNICAL REPORT CLE
[5]  
[Anonymous], 2018, ARXIV180208686
[6]   Towards Evaluating the Robustness of Neural Networks [J].
Carlini, Nicholas ;
Wagner, David .
2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57
[7]  
Ciresan D., 2012, ADV NEURAL INFORM PR, V25, P2843
[8]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[9]  
Ma L., 2018, Combinatorial Testing for Deep Learning Systems
[10]   DeepGauge: Multi-Granularity Testing Criteria for Deep Learning Systems [J].
Ma, Lei ;
Juefei-Xu, Felix ;
Zhang, Fuyuan ;
Sun, Jiyuan ;
Xue, Minhui ;
Li, Bo ;
Chen, Chunyang ;
Su, Ting ;
Li, Li ;
Liu, Yang ;
Zhao, Jianjun ;
Wang, Yadong .
PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, :120-131