Adversarial Robustness with Partial Isometry

被引:2
作者
Shi-Garrier, Loic [1 ]
Bouaynaya, Nidhal Carla [2 ]
Delahaye, Daniel [1 ]
机构
[1] Univ Toulouse, ENAC, F-31400 Toulouse, France
[2] Rowan Univ, Dept Elect & Comp Engn, Glassboro, NJ 08028 USA
关键词
adversarial robustness; information geometry; fisher information metric; multi-class classification;
D O I
10.3390/e26020103
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Despite their remarkable performance, deep learning models still lack robustness guarantees, particularly in the presence of adversarial examples. This significant vulnerability raises concerns about their trustworthiness and hinders their deployment in critical domains that require certified levels of robustness. In this paper, we introduce an information geometric framework to establish precise robustness criteria for l2 white-box attacks in a multi-class classification setting. We endow the output space with the Fisher information metric and derive criteria on the input-output Jacobian to ensure robustness. We show that model robustness can be achieved by constraining the model to be partially isometric around the training points. We evaluate our approach using MNIST and CIFAR-10 datasets against adversarial attacks, revealing its substantial improvements over defensive distillation and Jacobian regularization for medium-sized perturbations and its superior robustness performance to adversarial training for large perturbations, all while maintaining the desired accuracy.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Improving Adversarial Robustness With Adversarial Augmentations
    Chen, Chuanxi
    Ye, Dengpan
    He, Yiheng
    Tang, Long
    Xu, Yue
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 5105 - 5117
  • [2] Robustness Tokens: Towards Adversarial Robustness of Transformers
    Pulfer, Brian
    Belousov, Yury
    Voloshynovskiy, Slava
    COMPUTER VISION - ECCV 2024, PT LIX, 2025, 15117 : 110 - 127
  • [3] On Saliency Maps and Adversarial Robustness
    Mangla, Puneet
    Singh, Vedant
    Balasubramanian, Vineeth N.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 272 - 288
  • [4] On the Adversarial Robustness of Hypothesis Testing
    Jin, Yulu
    Lai, Lifeng
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 515 - 530
  • [5] On the adversarial robustness of aerial detection
    Chen, Yuwei
    Chu, Shiyong
    FRONTIERS IN COMPUTER SCIENCE, 2024, 6
  • [6] On the Adversarial Robustness of Subspace Learning
    Li, Fuwei
    Lai, Lifeng
    Cui, Shuguang
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 1470 - 1483
  • [7] ON THE ADVERSARIAL ROBUSTNESS OF LINEAR REGRESSION
    Li, Fuwei
    Lai, Lifeng
    Cui, Shuguang
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [8] ON THE ADVERSARIAL ROBUSTNESS OF SUBSPACE LEARNING
    Li, Fuwei
    Lai, Lifeng
    Cui, Shuguang
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2477 - 2481
  • [9] On the Adversarial Robustness of Robust Estimators
    Lai, Lifeng
    Bayraktar, Erhan
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (08) : 5097 - 5109
  • [10] Pareto adversarial robustness: balancing spatial robustness and sensitivity-based robustness
    Ke Sun
    Mingjie Li
    Zhouchen Lin
    Science China Information Sciences, 2025, 68 (6)