Is Robustness the Cost of Accuracy? - A Comprehensive Study on the Robustness of 18 Deep Image Classification Models

被引:183
|
作者
Su, Dong [1 ]
Zhang, Huan [2 ]
Chen, Hongge [3 ]
Yi, Jinfeng [4 ]
Chen, Pin-Yu [1 ]
Gao, Yupeng [1 ]
机构
[1] IBM Res, New York, NY 10598 USA
[2] Univ Calif Davis, Davis, CA 95616 USA
[3] MIT, Cambridge, MA 02139 USA
[4] JD AI Res, Beijing, Peoples R China
来源
COMPUTER VISION - ECCV 2018, PT XII | 2018年 / 11216卷
关键词
Deep neural networks; Adversarial attacks; Robustness;
D O I
10.1007/978-3-030-01258-8_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prediction accuracy has been the long-lasting and sole standard for comparing the performance of different image classification models, including the ImageNet competition. However, recent studies have highlighted the lack of robustness in well-trained deep neural networks to adversarial examples. Visually imperceptible perturbations to natural images can easily be crafted and mislead the image classifiers towards misclassification. To demystify the trade-offs between robustness and accuracy, in this paper we thoroughly benchmark 18 ImageNet models using multiple robustness metrics, including the distortion, success rate and transferability of adversarial examples between 306 pairs of models. Our extensive experimental results reveal several new insights: (1) linear scaling law - the empirical l(2) and l(infinity) distortion metrics scale linearly with the logarithm of classification error; (2) model architecture is a more critical factor to robustness than model size, and the disclosed accuracy-robustness Pareto frontier can be used as an evaluation criterion for ImageNet model designers; (3) for a similar network architecture, increasing network depth slightly improves robustness in l(infinity) distortion; (4) there exist models (in VGG family) that exhibit high adversarial transferability, while most adversarial examples crafted from one model can only be transferred within the same family. Experiment code is publicly available at https://github.com/huanzhang12/Adversarial_Survey.
引用
收藏
页码:644 / 661
页数:18
相关论文
共 50 条
  • [41] Robustness evaluation of deep neural networks for endoscopic image analysis: Insights and strategies
    Jaspers, Tim J. M.
    Boers, Tim G. W.
    Kusters, Carolus H. J.
    Jong, Martijn R.
    Jukema, Jelmer B.
    Groof, Albert J. de
    Bergman, Jacques J.
    With, Peter H. N. de
    Sommen, Fons van der
    MEDICAL IMAGE ANALYSIS, 2024, 94
  • [42] A Monte Carlo study of the accuracy and robustness of ten bivariate location estimators
    Massé, JC
    Plante, JF
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 42 (1-2) : 1 - 26
  • [43] Evaluating the robustness of deep learning models trained to diagnose idiopathic pulmonary fibrosis using a retrospective study
    Yu, Wenxi
    McNitt-Gray, Michael F.
    Goldin, Jonathan G.
    Song, Jin Woo
    Kim, Grace Hyun J.
    MEDICAL PHYSICS, 2025,
  • [44] Fast Hyperspectral Image Classification with Strong Noise Robustness Based on Minimum Noise Fraction
    Wang, Hongqiao
    Yu, Guoqing
    Cheng, Jinyu
    Zhang, Zhaoxiang
    Wang, Xuan
    Xu, Yuelei
    REMOTE SENSING, 2024, 16 (20)
  • [45] Computerized classification of benign and malignant masses on digitized mammograms: A study of robustness
    Huo, ZM
    Giger, ML
    Vyborny, CJ
    Wolverton, DE
    Metz, CE
    ACADEMIC RADIOLOGY, 2000, 7 (12) : 1077 - 1084
  • [46] Study on the effects of tool design and process parameters on the robustness of deep drawing
    Heinzel, Christine
    Thiery, Sebastian
    Ben Khalifa, Noomane
    MATERIAL FORMING, ESAFORM 2024, 2024, 41 : 1488 - 1497
  • [47] Assessment of robustness and transferability of classification models built for cancer diagnostics using Raman spectroscopy
    Sattlecker, Martina
    Stone, Nick
    Smith, Jennifer
    Bessant, Conrad
    JOURNAL OF RAMAN SPECTROSCOPY, 2011, 42 (05) : 897 - 903
  • [48] The error sample feature compensation method for improving the robustness of underwater classification and recognition models
    He, Ming
    Wang, Jinman
    Wang, Hongbin
    Jiang, Tianshu
    Shi, Leibo
    APPLIED INTELLIGENCE, 2024, 54 (13-14) : 7201 - 7212
  • [49] Application of SVM in the Estimation of GCV of Coal and a Comparison Study of the Accuracy and Robustness of SVM
    Fu Jin-hui
    2016 23RD ANNUAL INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS. I AND II, 2016, : 553 - 560
  • [50] Assessing the Robustness of Image Registration Models Under Domain Shifts with Learnable Input Images
    Kolenbrander, Iris D.
    Prasad, Vidya
    Zikken, Leanne
    van Eijnatten, Maureen A. J. M.
    Maspero, Matteo
    Pluim, Josien P. W.
    BIOMEDICAL IMAGE REGISTRATION, WBIR 2024, 2025, 15249 : 101 - 111