Adaptive Test Selection for Deep Neural Networks

被引:28
作者
Gao, Xinyu [1 ]
Feng, Yang [1 ]
Yin, Yining [1 ]
Liu, Zixi [1 ]
Chen, Zhenyu [1 ]
Xu, Baowen [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
来源
2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022) | 2022年
基金
中国国家自然科学基金;
关键词
deep learning testing; deep neural networks; adaptive random testing; test selection; STRATEGY;
D O I
10.1145/3510003.3510232
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep neural networks (DNN) have achieved tremendous development in the past decade. While many DNN-driven software applications have been deployed to solve various tasks, they could also produce incorrect behaviors and result in massive losses. To reveal the incorrect behaviors and improve the quality of DNN-driven applications, developers often need rich labeled data for the testing and optimization of DNN models. However, in practice, collecting diverse data from application scenarios and labeling them properly is often a highly expensive and time-consuming task. In this paper, we proposed an adaptive test selection method, namely ATS, for deep neural networks to alleviate this problem. ATS leverages the difference between the model outputs to measure the behavior diversity of DNN test data. And it aims at selecting a subset with diverse tests from a massive unlabelled dataset. We experiment ATS with four well-designed DNN models and four widely-used datasets in comparison with various kinds of neuron coverage (NC). The results demonstrate that ATS can significantly outperform all test selection methods in assessing both fault detection and model improvement capability of test suites. It is promising to save the data labeling and model retraining costs for deep neural networks.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 67 条
  • [1] DATA DIVERSITY - AN APPROACH TO SOFTWARE FAULT TOLERANCE
    AMMANN, PE
    KNIGHT, JC
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (04) : 418 - 425
  • [2] An orchestrated survey of methodologies for automated software test case generation
    Anand, Saswat
    Burke, Edmund K.
    Chen, Tsong Yueh
    Clark, John
    Cohen, Myra B.
    Grieskamp, Wolfgang
    Harman, Mark
    Harrold, Mary Jean
    McMinn, Phil
    Bertolino, Antonia
    Li, J. Jenny
    Zhu, Hong
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (08) : 1978 - 2001
  • [3] [Anonymous], BBC NEWS
  • [4] [Anonymous], TESLAS LATEST AUTOPI
  • [5] [Anonymous], GOOGLE SELF DRIVING
  • [6] Bahdanau D, 2014, ARXIV
  • [7] A Cost-Effective Random Testing Method for Programs with Non-Numeric Inputs
    Barus, Arlinta C.
    Chen, Tsong Yueh
    Kuo, Fei-Ching
    Liu, Huai
    Merkel, Robert
    Rothermel, Gregg
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (12) : 3509 - 3523
  • [8] BISHOP PG, 1993, FTCS-23 - TWENTY-THIRD INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING : DIGEST OF PAPERS, P98, DOI 10.1109/FTCS.1993.627312
  • [9] Bojarski M, 2016, Arxiv, DOI arXiv:1604.07316
  • [10] Bretscher O., 1997, Linear Algebra with Applications