Adversarial Training is a Form of Data-dependent Operator Norm Regularization

被引：0

作者：

Roth, Kevin ^{[1
]}

Kilcher, Yannic ^{[1
]}

Hofmann, Thomas ^{[1
]}

机构：

[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020) | 2020年 / 33卷

关键词：

ROBUSTNESS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We establish a theoretical link between adversarial training and operator norm regularization for deep neural networks. Specifically, we prove that l(p) -norm constrained projected gradient ascent based adversarial training with an lq-norm loss on the logits of clean and perturbed inputs is equivalent to data-dependent (p, q) operator norm regularization. This fundamental connection confirms the long-standing argument that a network's sensitivity to adversarial examples is tied to its spectral properties and hints at novel ways to robustify and defend against adversarial attacks. We provide extensive empirical evidence on state-of-the-art network architectures to support our theoretical results.

引用

页数：13

共 50 条

[1] Data-dependent stability analysis of adversarial training
Wang, Yihan
Liu, Shuang
Gao, Xiao-Shan
NEURAL NETWORKS, 2025, 183
[2] Dropout Training, Data-dependent Regularization, and Generalization Bounds
Mou, Wenlong
Zhou, Yuchen
Gao, Jun
Wang, Liwei
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[3] ADVERSARIAL DEFENSE VIA THE DATA-DEPENDENT ACTIVATION, TOTAL VARIATION MINIMIZATION, AND ADVERSARIAL TRAINING
Wang, Bao
Lin, Alex
Yin, Penghang
Zhu, Wei
Bertozzi, Andrea L.
Osher, Stanley J.
INVERSE PROBLEMS AND IMAGING, 2021, 15 (01) : 129 - 145
[4] Operator Precedence for Data-Dependent Grammars
Afroozeh, Ali
Izmaylova, Anastasia
PEPM'16: PROCEEDINGS OF THE 2016 ACM SIGPLAN WORKSHOP ON PARTIAL EVALUATION AND PROGRAM MANIPULATION, 2016, : 13 - 24
[5] Maximum relative margin and data-dependent regularization
Shivaswamy, Pannagadatta K.
Jebara, Tony
Journal of Machine Learning Research, 2010, 11 : 747 - 788
[6] Maximum Relative Margin and Data-Dependent Regularization
Shivaswamy, Pannagadatta K.
Jebara, Tony
JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 747 - 788
[7] A DATA-DEPENDENT REGULARIZATION METHOD BASED ON THE GRAPH LAPLACIAN
Bianchi, Davide
Evangelista, Davide
Aleotti, Stefano
Donatelli, Marco
Piccolomini, Elena Loli
Li, Wenbin
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2025, 47 (02): : C369 - C398
[8] Data identifiability for Data-Dependent Superimposed Training
Whitworth, T.
Ghogho, M.
McLernon, D. C.
2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 2545 - 2550
[9] Data detection and coding for data-dependent superimposed training
Wang, Ping
Fan, Pingzhi
Yuan, Weina
Darnell, Michael
IET SIGNAL PROCESSING, 2014, 8 (02) : 138 - 145
[10] Data-dependent superimposed training for noncoherent channels
Yang, Jingnong
Williams, Douglas B.
Cioffi, John M.
CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1010 - +

← 1 2 3 4 5 →