Adaptive Test Selection for Deep Neural Networks

被引：28

作者：

Gao, Xinyu ^{[1
]}

Feng, Yang ^{[1
]}

Yin, Yining ^{[1
]}

Liu, Zixi ^{[1
]}

Chen, Zhenyu ^{[1
]}

Xu, Baowen ^{[1
]}

机构：

[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China

来源：

2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

deep learning testing; deep neural networks; adaptive random testing; test selection; STRATEGY;

D O I：

10.1145/3510003.3510232

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Deep neural networks (DNN) have achieved tremendous development in the past decade. While many DNN-driven software applications have been deployed to solve various tasks, they could also produce incorrect behaviors and result in massive losses. To reveal the incorrect behaviors and improve the quality of DNN-driven applications, developers often need rich labeled data for the testing and optimization of DNN models. However, in practice, collecting diverse data from application scenarios and labeling them properly is often a highly expensive and time-consuming task. In this paper, we proposed an adaptive test selection method, namely ATS, for deep neural networks to alleviate this problem. ATS leverages the difference between the model outputs to measure the behavior diversity of DNN test data. And it aims at selecting a subset with diverse tests from a massive unlabelled dataset. We experiment ATS with four well-designed DNN models and four widely-used datasets in comparison with various kinds of neuron coverage (NC). The results demonstrate that ATS can significantly outperform all test selection methods in assessing both fault detection and model improvement capability of test suites. It is promising to save the data labeling and model retraining costs for deep neural networks.

引用

页码：73 / 85

页数：13

共 67 条

[1] DATA DIVERSITY - AN APPROACH TO SOFTWARE FAULT TOLERANCE
AMMANN, PE
KNIGHT, JC
[J]. IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (04) : 418 - 425
[2] An orchestrated survey of methodologies for automated software test case generation
Anand, Saswat
Burke, Edmund K.
Chen, Tsong Yueh
Clark, John
Cohen, Myra B.
Grieskamp, Wolfgang
Harman, Mark
Harrold, Mary Jean
McMinn, Phil
Bertolino, Antonia
Li, J. Jenny
Zhu, Hong
[J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (08) : 1978 - 2001
[3] [Anonymous], BBC NEWS
[4] [Anonymous], TESLAS LATEST AUTOPI
[5] [Anonymous], GOOGLE SELF DRIVING
[6] Bahdanau D, 2014, ARXIV
[7] A Cost-Effective Random Testing Method for Programs with Non-Numeric Inputs
Barus, Arlinta C.
Chen, Tsong Yueh
Kuo, Fei-Ching
Liu, Huai
Merkel, Robert
Rothermel, Gregg
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (12) : 3509 - 3523
[8] BISHOP PG, 1993, FTCS-23 - TWENTY-THIRD INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING : DIGEST OF PAPERS, P98, DOI 10.1109/FTCS.1993.627312
[9] Bojarski M, 2016, Arxiv, DOI arXiv:1604.07316
[10] Bretscher O., 1997, Linear Algebra with Applications

← 1 2 3 4 5 6 7 →