Importance-Driven Deep Learning System Testing

被引:68
作者
Gerasimou, Simos [1 ]
Eniser, Hasan Ferit [2 ,3 ]
Sen, Alper [3 ]
Cakan, Alper [3 ]
机构
[1] Univ York, York, N Yorkshire, England
[2] MPI SWS, Kaiserslautern, Germany
[3] Bogazici Univ, Istanbul, Turkey
来源
2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020) | 2020年
关键词
Deep Learning Systems; Test Adequacy; Safety-Critical Systems;
D O I
10.1145/3377811.3380391
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep Learning (DL) systems are key enablers for engineering intelligent applications due to their ability to solve complex tasks such as image recognition and machine translation. Nevertheless, using DL systems in safety- and security-critical applications requires to provide testing evidence for their dependable operation. Recent research in this direction focuses on adapting testing criteria from traditional software engineering as a means of increasing confidence for their correct behaviour. However, they are inadequate in capturing the intrinsic properties exhibited by these systems. We bridge this gap by introducing DeepImportance, a systematic testing methodology accompanied by an Importance-Driven (IDC) test adequacy criterion for DL systems. Applying IDC enables to establish a layer-wise functional understanding of the importance of DL system components and use this information to assess the semantic diversity of a test set. Our empirical evaluation on several DL systems, across multiple DL datasets and with state-of-the-art adversarial generation techniques demonstrates the usefulness and effectiveness of DeepImportance and its ability to support the engineering of more robust DL systems.
引用
收藏
页码:702 / 713
页数:12
相关论文
共 74 条
[71]  
Varshney KR, 2016, 2016 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA)
[72]  
Wang SQ, 2018, PROCEEDINGS OF THE 27TH USENIX SECURITY SYMPOSIUM, P1599
[73]   Feature-Guided Black-Box Safety Testing of Deep Neural Networks [J].
Wicker, Matthew ;
Huang, Xiaowei ;
Kwiatkowska, Marta .
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, TACAS 2018, PT I, 2018, 10805 :408-426
[74]  
Zhao Jianjun, 2018, ARXIV PREPRINT ARXIV