Efficient Untargeted White-Box Adversarial Attacks Based on Simple Initialization

被引:0
作者
Yunyi ZHOU
Haichang GAO
Jianping HE
Shudong ZHANG
Zihui WU
机构
[1] SchoolofComputerScienceandTechnology,XidianUniversity
关键词
D O I
暂无
中图分类号
TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adversarial examples(AEs) are an additive amalgamation of clean examples and artificially malicious perturbations. Attackers often leverage random noise and multiple random restarts to initialize perturbation starting points, thereby increasing the diversity of AEs. Given the non-convex nature of the loss function, employing randomness to augment the attack's success rate may lead to considerable computational overhead. To overcome this challenge,we introduce the one-hot mean square error loss to guide the initialization. This loss is combined with the strongest first-order attack, the projected gradient descent, alongside a dynamic attack step size adjustment strategy to form a comprehensive attack process. Through experimental validation, we demonstrate that our method outperforms baseline attacks in constrained attack budget scenarios and regular experimental settings. This establishes it as a reliable measure for assessing the robustness of deep learning models. We explore the broader application of this initialization strategy in enhancing the defense impact of few-shot classification models. We aspire to provide valuable insights for the community in designing attack and defense mechanisms.
引用
收藏
页码:979 / 988
页数:10
相关论文
共 5 条
[1]  
Imbalanced gradients:A subtle cause of overestimated adversarial robustness..X.J.Ma;L.X.Jiang;H.X.Huang;et al;.arXiv preprint;arXiv:2006.13726.2020,
[2]  
Adversarial interpolation training:A simple approach for improving model robustness..H.C.Zhang;W.Xu;.https://openreview.net/forum?id=Syejj0NYvr.2020,
[3]  
Defense against adversarial attacks by reconstructing images..[J].Zhang Shudong;Gao Haichang;Rao Qingxun.IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.2021,
[4]  
Adversarial examples in the physical world..[J].Alexey Kurakin;Ian J. Goodfellow;Samy Bengio.CoRR.2016,
[5]  
Explaining and Harnessing Adversarial Examples..[J].Ian J. Goodfellow;Jonathon Shlens;Christian Szegedy.CoRR.2014,