QuoTe: Quality-oriented Testing for Deep Learning Systems

被引:2
|
作者
Chen, Jialuo [1 ]
Wang, Jingyi [1 ]
Ma, Xingjun [2 ]
Sun, Youcheng [3 ]
Sun, Jun [4 ]
Zhang, Peixin [1 ]
Cheng, Peng [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Fudan Univ, Shanghai 200433, Peoples R China
[3] Univ Manchester, Manchester M13 9PL, Lancs, England
[4] Singapore Management Univ, Singapore 188065, Singapore
基金
国家重点研发计划;
关键词
Deep learning; testing; robustness; fairness; ROBUSTNESS;
D O I
10.1145/3582573
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, there has been significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is DL testing-that is, given a property of test, defects of DL systems are found either by fuzzing or guided search with the help of certain testing metrics. However, recent studies have revealed that the neuron coverage metrics, which are commonly used by most existing DL testing approaches, are not necessarily correlated with model quality (e.g., robustness, the most studied model property), and are also not an effective measurement on the confidence of the model quality after testing. In this work, we address this gap by proposing a novel testing framework calledQuoTe (i.e., Quality-oriented Testing). A key part of QuoTe is a quantitative measurement on (1) the value of each test case in enhancing the model property of interest (often via retraining) and (2) the convergence quality of the model property improvement. QuoTe utilizes the proposed metric to automatically select or generate valuable test cases for improving model quality. The proposedmetric is also a lightweight yet strong indicator of how well the improvement converged. Extensive experiments on both image and tabular datasets with a variety of model architectures confirm the effectiveness and efficiency of QuoTe in improving DL model quality-that is, robustness and fairness. As a generic quality-oriented testing framework, future adaptations can be made to other domains (e.g., text) as well as other model properties.
引用
收藏
页数:33
相关论文
共 50 条
  • [41] Copy, Right? A Testing Framework for Copyright Protection of Deep Learning Models
    Chen, Jialuo
    Wang, Jingyi
    Peng, Tinglan
    Sun, Youcheng
    Cheng, Peng
    Ji, Shouling
    Ma, Xingjun
    Li, Bo
    Song, Dawn
    43RD IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP 2022), 2022, : 824 - 841
  • [42] ADVRET: An Adversarial Robustness Evaluating and Testing Platform for Deep Learning Models
    Ren, Fei
    Yang, Yonghui
    Hu, Chi
    Zhou, Yuyao
    Ma, Siyou
    2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 9 - 14
  • [43] Deep learning for recommendation systems
    Dellal-Hedjazi, Badiaa
    Alimazighi, Zaiai
    2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 90 - 97
  • [44] Deep Learning for Recommender Systems
    Karatzoglou, Alexandros
    Hidasi, Balazs
    PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 396 - 397
  • [45] Deep Learning on Mobile Systems
    Curukoglu, Nur
    Ozyildirim, Buse Melis
    2018 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2018, : 179 - 182
  • [46] Deep Learning-Based Performance Testing for Analog Integrated Circuits
    Cao, Jiawei
    Guo, Chongtao
    Wang, Houjun
    Wang, Zhigang
    Li, Hao
    Li, Geoffrey Ye
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024,
  • [47] A fine-grained evaluation of mutation operators to boost mutation testing for deep learning systems
    Zhang, Zhiyi
    Wang, Yichun
    Yao, Yongming
    Wang, Ziyuan
    Huang, Zhiqiu
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
  • [48] Validating a Deep Learning Framework by Metamorphic Testing
    Ding, Junhua
    Kang, Xiaojun
    Hu, Xin-Hua
    2017 IEEE/ACM 2ND INTERNATIONAL WORKSHOP ON METAMORPHIC TESTING (MET 2017), 2017, : 28 - 34
  • [49] Deep Learning and Video Quality Analysis
    Topiwala, P.
    Krishnan, M.
    Dai, W.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [50] Deep Learning for FAST Quality Assessment
    Taye, Mesfin
    Morrow, Dustin
    Cull, John
    Smith, Dane Hudson
    Hagan, Martin
    JOURNAL OF ULTRASOUND IN MEDICINE, 2023, 42 (01) : 71 - 79