On the Relationship between Generalization and Robustness to Adversarial Examples

被引:8
|
作者
Pedraza, Anibal [1 ]
Deniz, Oscar [1 ]
Bueno, Gloria [1 ]
机构
[1] Univ Castilla La Mancha, VISILAB, ETSII, Ciudad Real 13071, Spain
来源
SYMMETRY-BASEL | 2021年 / 13卷 / 05期
关键词
machine learning; computer vision; deep learning; adversarial examples; adversarial robustness; overfitting;
D O I
10.3390/sym13050817
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the most intriguing phenomenons related to deep learning is the so-called adversarial examples. These samples are visually equivalent to normal inputs, undetectable for humans, yet they cause the networks to output wrong results. The phenomenon can be framed as a symmetry/asymmetry problem, whereby inputs to a neural network with a similar/symmetric appearance to regular images, produce an opposite/asymmetric output. Some researchers are focused on developing methods for generating adversarial examples, while others propose defense methods. In parallel, there is a growing interest in characterizing the phenomenon, which is also the focus of this paper. From some well known datasets of common images, like CIFAR-10 and STL-10, a neural network architecture is first trained in a normal regime, where training and validation performances increase, reaching generalization. Additionally, the same architectures and datasets are trained in an overfitting regime, where there is a growing disparity in training and validation performances. The behaviour of these two regimes against adversarial examples is then compared. From the results, we observe greater robustness to adversarial examples in the overfitting regime. We explain this simultaneous loss of generalization and gain in robustness to adversarial examples as another manifestation of the well-known fitting-generalization trade-off.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A decade of adversarial examples: a survey on the nature and understanding of neural network non-robustness
    Trusov, A. V.
    Limonova, E. E.
    Arlazarov, V. V.
    COMPUTER OPTICS, 2025, 49 (02) : 222 - 252
  • [22] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Wang, Desheng
    Jin, Weidong
    Wu, Yunpu
    Khan, Aamir
    APPLIED INTELLIGENCE, 2023, 53 (20) : 24492 - 24508
  • [23] The Problem of the Adversarial Examples in Deep Learning
    Zhang S.-S.
    Zuo X.
    Liu J.-W.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (08): : 1886 - 1904
  • [24] ATGAN: Adversarial training-based GAN for improving adversarial robustness generalization on image classification
    Desheng Wang
    Weidong Jin
    Yunpu Wu
    Aamir Khan
    Applied Intelligence, 2023, 53 : 24492 - 24508
  • [25] Local-contrastive-learning machine with both generalization and adversarial robustness: A statistical physics analysis
    Xie, Mingshan
    Wang, Yuchen
    Huang, Haiping
    SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2025, 68 (01)
  • [26] There is more than one kind of robustness: Fooling Whisper with adversarial examples
    Olivier, Raphael
    Raj, Bhiksha
    INTERSPEECH 2023, 2023, : 4394 - 4398
  • [27] LSGAN-AT: enhancing malware detector robustness against adversarial examples
    Jianhua Wang
    Xiaolin Chang
    Yixiang Wang
    Ricardo J. Rodríguez
    Jianan Zhang
    Cybersecurity, 4
  • [28] LSGAN-AT: enhancing malware detector robustness against adversarial examples
    Wang, Jianhua
    Chang, Xiaolin
    Wang, Yixiang
    Rodriguez, Ricardo J.
    Zhang, Jianan
    CYBERSECURITY, 2021, 4 (01)
  • [29] Assessing the Robustness of Automated Scoring of Divergent Thinking Tasks With Adversarial Examples
    Hilker, Yannick
    Forthmann, Boris
    Doebler, Philipp
    PSYCHOLOGY OF AESTHETICS CREATIVITY AND THE ARTS, 2025,
  • [30] Interpreting Adversarial Examples and Robustness for Deep Learning-Based Auto-Driving Systems
    Wang, Ke
    Li, Fengjun
    Chen, Chien-Ming
    Hassan, Mohammad Mehedi
    Long, Jinyi
    Kumar, Neeraj
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9755 - 9764