QuoTe: Quality-oriented Testing for Deep Learning Systems

被引：2

作者：

Chen, Jialuo ^{[1
]}

Wang, Jingyi ^{[1
]}

Ma, Xingjun ^{[2
]}

Sun, Youcheng ^{[3
]}

Sun, Jun ^{[4
]}

Zhang, Peixin ^{[1
]}

Cheng, Peng ^{[1
]}

机构：

[1] Zhejiang Univ, Hangzhou 310027, Peoples R China

[2] Fudan Univ, Shanghai 200433, Peoples R China

[3] Univ Manchester, Manchester M13 9PL, Lancs, England

[4] Singapore Management Univ, Singapore 188065, Singapore

来源：

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY | 2023年 / 32卷 / 05期

基金：

国家重点研发计划;

关键词：

Deep learning; testing; robustness; fairness; ROBUSTNESS;

D O I：

10.1145/3582573

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Recently, there has been significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is DL testing-that is, given a property of test, defects of DL systems are found either by fuzzing or guided search with the help of certain testing metrics. However, recent studies have revealed that the neuron coverage metrics, which are commonly used by most existing DL testing approaches, are not necessarily correlated with model quality (e.g., robustness, the most studied model property), and are also not an effective measurement on the confidence of the model quality after testing. In this work, we address this gap by proposing a novel testing framework calledQuoTe (i.e., Quality-oriented Testing). A key part of QuoTe is a quantitative measurement on (1) the value of each test case in enhancing the model property of interest (often via retraining) and (2) the convergence quality of the model property improvement. QuoTe utilizes the proposed metric to automatically select or generate valuable test cases for improving model quality. The proposedmetric is also a lightweight yet strong indicator of how well the improvement converged. Extensive experiments on both image and tabular datasets with a variety of model architectures confirm the effectiveness and efficiency of QuoTe in improving DL model quality-that is, robustness and fairness. As a generic quality-oriented testing framework, future adaptations can be made to other domains (e.g., text) as well as other model properties.

引用

页数：33

共 50 条

[21] DistXplore: Distribution-Guided Testing for Evaluating and Enhancing Deep Learning Systems
Wang, Longtian
Xie, Xiaofei
Du, Xiaoning
Tian, Meng
Guo, Qing
Yang, Zheng
Shen, Chao
PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 68 - 80
[22] A Framework for Continuous Regression and Integration Testing in IoT Systems Based on Deep Learning and Search-Based Techniques
Medhat, Noha
Moussa, Sherin M.
Badr, Nagwa Lotfy
Tolba, Mohamed F.
IEEE ACCESS, 2020, 8 : 215716 - 215726
[23] Deep Learning-Based Autonomous Driving Systems: A Survey of Attacks and Defenses
Deng, Yao
Zhang, Tiehua
Lou, Guannan
Zheng, Xi
Jin, Jiong
Han, Qing-Long
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) : 7897 - 7912
[24] A Feature Space-Restricted Attention Attack on Medical Deep Learning Systems
Wang, Zizhou
Shu, Xin
Wang, Yan
Feng, Yangqin
Zhang, Lei
Yi, Zhang
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (08) : 5323 - 5335
[25] Understanding adversarial attacks on deep learning based medical image analysis systems
Ma, Xingjun
Niu, Yuhao
Gu, Lin
Yisen, Wang
Zhao, Yitian
Bailey, James
Lu, Feng
PATTERN RECOGNITION, 2021, 110
[26] Novel approach for quality control testing of medical displays using deep learning technology
Maruyama, Sho
Mizutani, Fumiya
Watanabe, Haruyuki
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (02):
[27] SOLAR: Services-Oriented Deep Learning Architectures-Deep Learning as a Service
Wang, Chao
Gong, Lei
Li, Xi
Yu, Qi
Wang, Aili
Hung, Patrick
Zhou, Xuehai
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2021, 14 (01) : 262 - 273
[28] Deep-Learning-Based Detection of Segregations for Ultrasonic Testing
Elischberger, Frederik
Bamberg, Joachim
Jiang, Xiaoyi
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[29] Deep Reinforcement Learning Based Task-Oriented Communication in Multi-Agent Systems
He, Guojun
Feng, Mingjie
Zhang, Yu
Liu, Guanghua
Dai, Yueyue
Jiang, Tao
IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 112 - 119
[30] Learning Deep Blind Quality Assessment for Cartoon Images
Chen, Yuan
Zhao, Yang
Cao, Li
Jia, Wei
Liu, Xiaoping
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6650 - 6655

← 1 2 3 4 5 →