QuoTe: Quality-oriented Testing for Deep Learning Systems

被引:2
|
作者
Chen, Jialuo [1 ]
Wang, Jingyi [1 ]
Ma, Xingjun [2 ]
Sun, Youcheng [3 ]
Sun, Jun [4 ]
Zhang, Peixin [1 ]
Cheng, Peng [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Fudan Univ, Shanghai 200433, Peoples R China
[3] Univ Manchester, Manchester M13 9PL, Lancs, England
[4] Singapore Management Univ, Singapore 188065, Singapore
基金
国家重点研发计划;
关键词
Deep learning; testing; robustness; fairness; ROBUSTNESS;
D O I
10.1145/3582573
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, there has been significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is DL testing-that is, given a property of test, defects of DL systems are found either by fuzzing or guided search with the help of certain testing metrics. However, recent studies have revealed that the neuron coverage metrics, which are commonly used by most existing DL testing approaches, are not necessarily correlated with model quality (e.g., robustness, the most studied model property), and are also not an effective measurement on the confidence of the model quality after testing. In this work, we address this gap by proposing a novel testing framework calledQuoTe (i.e., Quality-oriented Testing). A key part of QuoTe is a quantitative measurement on (1) the value of each test case in enhancing the model property of interest (often via retraining) and (2) the convergence quality of the model property improvement. QuoTe utilizes the proposed metric to automatically select or generate valuable test cases for improving model quality. The proposedmetric is also a lightweight yet strong indicator of how well the improvement converged. Extensive experiments on both image and tabular datasets with a variety of model architectures confirm the effectiveness and efficiency of QuoTe in improving DL model quality-that is, robustness and fairness. As a generic quality-oriented testing framework, future adaptations can be made to other domains (e.g., text) as well as other model properties.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] DistXplore: Distribution-Guided Testing for Evaluating and Enhancing Deep Learning Systems
    Wang, Longtian
    Xie, Xiaofei
    Du, Xiaoning
    Tian, Meng
    Guo, Qing
    Yang, Zheng
    Shen, Chao
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 68 - 80
  • [22] A Framework for Continuous Regression and Integration Testing in IoT Systems Based on Deep Learning and Search-Based Techniques
    Medhat, Noha
    Moussa, Sherin M.
    Badr, Nagwa Lotfy
    Tolba, Mohamed F.
    IEEE ACCESS, 2020, 8 : 215716 - 215726
  • [23] Deep Learning-Based Autonomous Driving Systems: A Survey of Attacks and Defenses
    Deng, Yao
    Zhang, Tiehua
    Lou, Guannan
    Zheng, Xi
    Jin, Jiong
    Han, Qing-Long
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) : 7897 - 7912
  • [24] A Feature Space-Restricted Attention Attack on Medical Deep Learning Systems
    Wang, Zizhou
    Shu, Xin
    Wang, Yan
    Feng, Yangqin
    Zhang, Lei
    Yi, Zhang
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 5323 - 5335
  • [25] Understanding adversarial attacks on deep learning based medical image analysis systems
    Ma, Xingjun
    Niu, Yuhao
    Gu, Lin
    Yisen, Wang
    Zhao, Yitian
    Bailey, James
    Lu, Feng
    PATTERN RECOGNITION, 2021, 110
  • [26] Novel approach for quality control testing of medical displays using deep learning technology
    Maruyama, Sho
    Mizutani, Fumiya
    Watanabe, Haruyuki
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (02):
  • [27] SOLAR: Services-Oriented Deep Learning Architectures-Deep Learning as a Service
    Wang, Chao
    Gong, Lei
    Li, Xi
    Yu, Qi
    Wang, Aili
    Hung, Patrick
    Zhou, Xuehai
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2021, 14 (01) : 262 - 273
  • [28] Deep-Learning-Based Detection of Segregations for Ultrasonic Testing
    Elischberger, Frederik
    Bamberg, Joachim
    Jiang, Xiaoyi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [29] Deep Reinforcement Learning Based Task-Oriented Communication in Multi-Agent Systems
    He, Guojun
    Feng, Mingjie
    Zhang, Yu
    Liu, Guanghua
    Dai, Yueyue
    Jiang, Tao
    IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 112 - 119
  • [30] Learning Deep Blind Quality Assessment for Cartoon Images
    Chen, Yuan
    Zhao, Yang
    Cao, Li
    Jia, Wei
    Liu, Xiaoping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6650 - 6655