On the Relationship between Generalization and Robustness to Adversarial Examples

被引：8

作者：

Pedraza, Anibal ^{[1
]}

Deniz, Oscar ^{[1
]}

Bueno, Gloria ^{[1
]}

机构：

[1] Univ Castilla La Mancha, VISILAB, ETSII, Ciudad Real 13071, Spain

来源：

SYMMETRY-BASEL | 2021年 / 13卷 / 05期

关键词：

machine learning; computer vision; deep learning; adversarial examples; adversarial robustness; overfitting;

D O I：

10.3390/sym13050817

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

One of the most intriguing phenomenons related to deep learning is the so-called adversarial examples. These samples are visually equivalent to normal inputs, undetectable for humans, yet they cause the networks to output wrong results. The phenomenon can be framed as a symmetry/asymmetry problem, whereby inputs to a neural network with a similar/symmetric appearance to regular images, produce an opposite/asymmetric output. Some researchers are focused on developing methods for generating adversarial examples, while others propose defense methods. In parallel, there is a growing interest in characterizing the phenomenon, which is also the focus of this paper. From some well known datasets of common images, like CIFAR-10 and STL-10, a neural network architecture is first trained in a normal regime, where training and validation performances increase, reaching generalization. Additionally, the same architectures and datasets are trained in an overfitting regime, where there is a growing disparity in training and validation performances. The behaviour of these two regimes against adversarial examples is then compared. From the results, we observe greater robustness to adversarial examples in the overfitting regime. We explain this simultaneous loss of generalization and gain in robustness to adversarial examples as another manifestation of the well-known fitting-generalization trade-off.

引用

页数：13

共 50 条

[21] A decade of adversarial examples: a survey on the nature and understanding of neural network non-robustness
Trusov, A. V.
Limonova, E. E.
Arlazarov, V. V.
COMPUTER OPTICS, 2025, 49 (02) : 222 - 252
[22] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
Wang, Desheng
Jin, Weidong
Wu, Yunpu
Khan, Aamir
APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
[23] The Problem of the Adversarial Examples in Deep Learning
Zhang S.-S.
Zuo X.
Liu J.-W.
Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (08): : 1886 - 1904
[24] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
Desheng Wang
Weidong Jin
Yunpu Wu
Aamir Khan
Applied Intelligence, 2023, 53 : 24492 - 24508
[25] Local-contrastive-learning machine with both generalization and adversarial robustness: A statistical physics analysis
Xie, Mingshan
Wang, Yuchen
Huang, Haiping
SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2025, 68 (01)
[26] There is more than one kind of robustness: Fooling Whisper with adversarial examples
Olivier, Raphael
Raj, Bhiksha
INTERSPEECH 2023, 2023, : 4394 - 4398
[27] LSGAN-AT: enhancing malware detector robustness against adversarial examples
Jianhua Wang
Xiaolin Chang
Yixiang Wang
Ricardo J. Rodríguez
Jianan Zhang
Cybersecurity, 4
[28] LSGAN-AT: enhancing malware detector robustness against adversarial examples
Wang, Jianhua
Chang, Xiaolin
Wang, Yixiang
Rodriguez, Ricardo J.
Zhang, Jianan
CYBERSECURITY, 2021, 4 (01)
[29] Assessing the Robustness of Automated Scoring of Divergent Thinking Tasks With Adversarial Examples
Hilker, Yannick
Forthmann, Boris
Doebler, Philipp
PSYCHOLOGY OF AESTHETICS CREATIVITY AND THE ARTS, 2025,
[30] Interpreting Adversarial Examples and Robustness for Deep Learning-Based Auto-Driving Systems
Wang, Ke
Li, Fengjun
Chen, Chien-Ming
Hassan, Mohammad Mehedi
Long, Jinyi
Kumar, Neeraj
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9755 - 9764

← 1 2 3 4 5 →