Overparameterized Linear Regression Under Adversarial Attacks

被引：3

作者：

Ribeiro, Antonio H. ^{[1
]}

Schon, Thomas B. ^{[1
]}

机构：

[1] Uppsala Univ, Dept Informat Technol, S-75236 Uppsala, Sweden

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2023年 / 71卷

基金：

瑞典研究理事会;

关键词：

Training; Robustness; Linear regression; Predictive models; Analytical models; Behavioral sciences; Perturbation methods; Adversarial machine learning; parameter estimation; regression analysis; DOUBLE-DESCENT;

D O I：

10.1109/TSP.2023.3246228

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in the presence of an adversary as a function of the parameter norm and the error in the absence of such an adversary. We show how these bounds make it possible to study the adversarial error using analysis from non-adversarial setups. The obtained results shed light on the robustness of overparameterized linear models to adversarial attacks. Adding features might be either a source of additional robustness or brittleness. On the one hand, we use asymptotic results to illustrate how double-descent curves can be obtained for the adversarial error. On the other hand, we derive conditions under which the adversarial error can grow to infinity as more features are added, while at the same time, the test error goes to zero. We show this behavior is caused by the fact that the norm of the parameter vector grows with the number of features. It is also established that l(infinity) and l(2)-adversarial attacks might behave fundamentally differently due to how the l(1) and l(2)-norms of random projections concentrate. We also show how our reformulation allows for solving adversarial training as a convex optimization problem. This fact is then exploited to establish similarities between adversarial training and parameter-shrinking methods and to study how the training might affect the robustness of the estimated models.

引用

页码：601 / 614

页数：14

共 57 条

[1] Adlam Ben, 2020, INT C MACHINE LEARNI, V119
[2] High-dimensional dynamics of generalization error in neural networks
Advani, Madhu S.
Saxe, Andrew M.
Sompolinsky, Haim
[J]. NEURAL NETWORKS, 2020, 132 : 428 - 446
[3] Allen-Zhu Z, 2019, PR MACH LEARN RES, V97
[4] [Anonymous], 2000, Probability and measure theory
[5] Bartlett P. L., 2021, P INT C NEUR INF PRO
[6] Benign overfitting in linear regression
Bartlett, Peter L.
Long, Philip M.
Lugosi, Gabor
Tsigler, Alexander
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (48) : 30063 - 30070
[7] Two Models of Double Descent for Weak Features
Belkin, Mikhail
Hsu, Daniel
Xu, Ji
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2020, 2 (04): : 1167 - 1180
[8] Reconciling modern machine-learning practice and the classical bias-variance trade-off
Belkin, Mikhail
Hsu, Daniel
Ma, Siyuan
Mandal, Soumik
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (32) : 15849 - 15854
[9] Bhagoji A. N., 2019, Advances in Neural Information Processing Systems, P7498
[10] Wild patterns: Ten years after the rise of adversarial machine learning
Biggio, Battista
Roli, Fabio
[J]. PATTERN RECOGNITION, 2018, 84 : 317 - 331

← 1 2 3 4 5 6 →