Is Robustness the Cost of Accuracy? - A Comprehensive Study on the Robustness of 18 Deep Image Classification Models

被引：183

作者：

Su, Dong ^{[1
]}

Zhang, Huan ^{[2
]}

Chen, Hongge ^{[3
]}

Yi, Jinfeng ^{[4
]}

Chen, Pin-Yu ^{[1
]}

Gao, Yupeng ^{[1
]}

机构：

[1] IBM Res, New York, NY 10598 USA

[2] Univ Calif Davis, Davis, CA 95616 USA

[3] MIT, Cambridge, MA 02139 USA

[4] JD AI Res, Beijing, Peoples R China

来源：

COMPUTER VISION - ECCV 2018, PT XII | 2018年 / 11216卷

关键词：

Deep neural networks; Adversarial attacks; Robustness;

D O I：

10.1007/978-3-030-01258-8_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The prediction accuracy has been the long-lasting and sole standard for comparing the performance of different image classification models, including the ImageNet competition. However, recent studies have highlighted the lack of robustness in well-trained deep neural networks to adversarial examples. Visually imperceptible perturbations to natural images can easily be crafted and mislead the image classifiers towards misclassification. To demystify the trade-offs between robustness and accuracy, in this paper we thoroughly benchmark 18 ImageNet models using multiple robustness metrics, including the distortion, success rate and transferability of adversarial examples between 306 pairs of models. Our extensive experimental results reveal several new insights: (1) linear scaling law - the empirical l(2) and l(infinity) distortion metrics scale linearly with the logarithm of classification error; (2) model architecture is a more critical factor to robustness than model size, and the disclosed accuracy-robustness Pareto frontier can be used as an evaluation criterion for ImageNet model designers; (3) for a similar network architecture, increasing network depth slightly improves robustness in l(infinity) distortion; (4) there exist models (in VGG family) that exhibit high adversarial transferability, while most adversarial examples crafted from one model can only be transferred within the same family. Experiment code is publicly available at https://github.com/huanzhang12/Adversarial_Survey.

引用

页码：644 / 661

页数：18

共 50 条

[41] Robustness evaluation of deep neural networks for endoscopic image analysis: Insights and strategies
Jaspers, Tim J. M.
Boers, Tim G. W.
Kusters, Carolus H. J.
Jong, Martijn R.
Jukema, Jelmer B.
Groof, Albert J. de
Bergman, Jacques J.
With, Peter H. N. de
Sommen, Fons van der
MEDICAL IMAGE ANALYSIS, 2024, 94
[42] A Monte Carlo study of the accuracy and robustness of ten bivariate location estimators
Massé, JC
Plante, JF
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 42 (1-2) : 1 - 26
[43] Evaluating the robustness of deep learning models trained to diagnose idiopathic pulmonary fibrosis using a retrospective study
Yu, Wenxi
McNitt-Gray, Michael F.
Goldin, Jonathan G.
Song, Jin Woo
Kim, Grace Hyun J.
MEDICAL PHYSICS, 2025,
[44] Fast Hyperspectral Image Classification with Strong Noise Robustness Based on Minimum Noise Fraction
Wang, Hongqiao
Yu, Guoqing
Cheng, Jinyu
Zhang, Zhaoxiang
Wang, Xuan
Xu, Yuelei
REMOTE SENSING, 2024, 16 (20)
[45] Computerized classification of benign and malignant masses on digitized mammograms: A study of robustness
Huo, ZM
Giger, ML
Vyborny, CJ
Wolverton, DE
Metz, CE
ACADEMIC RADIOLOGY, 2000, 7 (12) : 1077 - 1084
[46] Study on the effects of tool design and process parameters on the robustness of deep drawing
Heinzel, Christine
Thiery, Sebastian
Ben Khalifa, Noomane
MATERIAL FORMING, ESAFORM 2024, 2024, 41 : 1488 - 1497
[47] Assessment of robustness and transferability of classification models built for cancer diagnostics using Raman spectroscopy
Sattlecker, Martina
Stone, Nick
Smith, Jennifer
Bessant, Conrad
JOURNAL OF RAMAN SPECTROSCOPY, 2011, 42 (05) : 897 - 903
[48] The error sample feature compensation method for improving the robustness of underwater classification and recognition models
He, Ming
Wang, Jinman
Wang, Hongbin
Jiang, Tianshu
Shi, Leibo
APPLIED INTELLIGENCE, 2024, 54 (13-14) : 7201 - 7212
[49] Application of SVM in the Estimation of GCV of Coal and a Comparison Study of the Accuracy and Robustness of SVM
Fu Jin-hui
2016 23RD ANNUAL INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS. I AND II, 2016, : 553 - 560
[50] Assessing the Robustness of Image Registration Models Under Domain Shifts with Learnable Input Images
Kolenbrander, Iris D.
Prasad, Vidya
Zikken, Leanne
van Eijnatten, Maureen A. J. M.
Maspero, Matteo
Pluim, Josien P. W.
BIOMEDICAL IMAGE REGISTRATION, WBIR 2024, 2025, 15249 : 101 - 111

← 1 2 3 4 5 →