Adversarial Training is a Form of Data-dependent Operator Norm Regularization

被引:0
|
作者
Roth, Kevin [1 ]
Kilcher, Yannic [1 ]
Hofmann, Thomas [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020) | 2020年 / 33卷
关键词
ROBUSTNESS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks. Specifically, we prove that l(p) -norm constrained projected gradient ascent based adversarial training with an lq-norm loss on the logits of clean and perturbed inputs is equivalent to data-dependent (p, q) operator norm regularization. This fundamental connection confirms the long-standing argument that a network's sensitivity to adversarial examples is tied to its spectral properties and hints at novel ways to robustify and defend against adversarial attacks. We provide extensive empirical evidence on state-of-the-art network architectures to support our theoretical results.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Data-dependent stability analysis of adversarial training
    Wang, Yihan
    Liu, Shuang
    Gao, Xiao-Shan
    NEURAL NETWORKS, 2025, 183
  • [2] Dropout Training, Data-dependent Regularization, and Generalization Bounds
    Mou, Wenlong
    Zhou, Yuchen
    Gao, Jun
    Wang, Liwei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [3] ADVERSARIAL DEFENSE VIA THE DATA-DEPENDENT ACTIVATION, TOTAL VARIATION MINIMIZATION, AND ADVERSARIAL TRAINING
    Wang, Bao
    Lin, Alex
    Yin, Penghang
    Zhu, Wei
    Bertozzi, Andrea L.
    Osher, Stanley J.
    INVERSE PROBLEMS AND IMAGING, 2021, 15 (01) : 129 - 145
  • [4] Operator Precedence for Data-Dependent Grammars
    Afroozeh, Ali
    Izmaylova, Anastasia
    PEPM'16: PROCEEDINGS OF THE 2016 ACM SIGPLAN WORKSHOP ON PARTIAL EVALUATION AND PROGRAM MANIPULATION, 2016, : 13 - 24
  • [5] Maximum relative margin and data-dependent regularization
    Shivaswamy, Pannagadatta K.
    Jebara, Tony
    Journal of Machine Learning Research, 2010, 11 : 747 - 788
  • [6] Maximum Relative Margin and Data-Dependent Regularization
    Shivaswamy, Pannagadatta K.
    Jebara, Tony
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 747 - 788
  • [7] A DATA-DEPENDENT REGULARIZATION METHOD BASED ON THE GRAPH LAPLACIAN
    Bianchi, Davide
    Evangelista, Davide
    Aleotti, Stefano
    Donatelli, Marco
    Piccolomini, Elena Loli
    Li, Wenbin
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2025, 47 (02): : C369 - C398
  • [8] Data identifiability for Data-Dependent Superimposed Training
    Whitworth, T.
    Ghogho, M.
    McLernon, D. C.
    2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 2545 - 2550
  • [9] Data detection and coding for data-dependent superimposed training
    Wang, Ping
    Fan, Pingzhi
    Yuan, Weina
    Darnell, Michael
    IET SIGNAL PROCESSING, 2014, 8 (02) : 138 - 145
  • [10] Data-dependent superimposed training for noncoherent channels
    Yang, Jingnong
    Williams, Douglas B.
    Cioffi, John M.
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1010 - +