Overparameterized Linear Regression Under Adversarial Attacks

被引:3
作者
Ribeiro, Antonio H. [1 ]
Schon, Thomas B. [1 ]
机构
[1] Uppsala Univ, Dept Informat Technol, S-75236 Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Training; Robustness; Linear regression; Predictive models; Analytical models; Behavioral sciences; Perturbation methods; Adversarial machine learning; parameter estimation; regression analysis; DOUBLE-DESCENT;
D O I
10.1109/TSP.2023.3246228
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in the presence of an adversary as a function of the parameter norm and the error in the absence of such an adversary. We show how these bounds make it possible to study the adversarial error using analysis from non-adversarial setups. The obtained results shed light on the robustness of overparameterized linear models to adversarial attacks. Adding features might be either a source of additional robustness or brittleness. On the one hand, we use asymptotic results to illustrate how double-descent curves can be obtained for the adversarial error. On the other hand, we derive conditions under which the adversarial error can grow to infinity as more features are added, while at the same time, the test error goes to zero. We show this behavior is caused by the fact that the norm of the parameter vector grows with the number of features. It is also established that l(infinity) and l(2)-adversarial attacks might behave fundamentally differently due to how the l(1) and l(2)-norms of random projections concentrate. We also show how our reformulation allows for solving adversarial training as a convex optimization problem. This fact is then exploited to establish similarities between adversarial training and parameter-shrinking methods and to study how the training might affect the robustness of the estimated models.
引用
收藏
页码:601 / 614
页数:14
相关论文
共 57 条
  • [1] Adlam Ben, 2020, INT C MACHINE LEARNI, V119
  • [2] High-dimensional dynamics of generalization error in neural networks
    Advani, Madhu S.
    Saxe, Andrew M.
    Sompolinsky, Haim
    [J]. NEURAL NETWORKS, 2020, 132 : 428 - 446
  • [3] Allen-Zhu Z, 2019, PR MACH LEARN RES, V97
  • [4] [Anonymous], 2000, Probability and measure theory
  • [5] Bartlett P. L., 2021, P INT C NEUR INF PRO
  • [6] Benign overfitting in linear regression
    Bartlett, Peter L.
    Long, Philip M.
    Lugosi, Gabor
    Tsigler, Alexander
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (48) : 30063 - 30070
  • [7] Two Models of Double Descent for Weak Features
    Belkin, Mikhail
    Hsu, Daniel
    Xu, Ji
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2020, 2 (04): : 1167 - 1180
  • [8] Reconciling modern machine-learning practice and the classical bias-variance trade-off
    Belkin, Mikhail
    Hsu, Daniel
    Ma, Siyuan
    Mandal, Soumik
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (32) : 15849 - 15854
  • [9] Bhagoji A. N., 2019, Advances in Neural Information Processing Systems, P7498
  • [10] Wild patterns: Ten years after the rise of adversarial machine learning
    Biggio, Battista
    Roli, Fabio
    [J]. PATTERN RECOGNITION, 2018, 84 : 317 - 331