Improving the transferability of adversarial attacks via self-ensemble

被引：1

作者：

Cheng, Shuyan ^{[1
]}

Li, Peng ^{[1
]}

Liu, Jianguo ^{[1
]}

Xu, He ^{[1
]}

Yao, Yudong ^{[2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China

[2] Stevens Inst Technol, Dept Elect & Comp Engn, Hoboken, NJ 07030 USA

来源：

APPLIED INTELLIGENCE | 2024年 / 54卷 / 21期

基金：

中国国家自然科学基金;

关键词：

Black-box attacks; Transferability; Adversarial examples; Self-ensemble; Feature importance;

D O I：

10.1007/s10489-024-05728-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks have been used extensively for diverse visual tasks, including object detection, face recognition, and image classification. However, they face several security threats, such as adversarial attacks. To improve the resistance of neural networks to adversarial attacks, researchers have investigated the security issues of models from the perspectives of both attacks and defenses. Recently, the transferability of adversarial attacks has received extensive attention, which promotes the application of adversarial attacks in practical scenarios. However, existing transferable attacks tend to trap into a poor local optimum and significantly degrade the transferability because the production of adversarial samples lacks randomness. Therefore, we propose a self-ensemble-based feature-level adversarial attack (SEFA) to boost transferability by randomly disrupting salient features. We provide theoretical analysis to demonstrate the superiority of the proposed method. In particular, perturbing the refined feature importance weighted intermediate features suppresses positive features and encourages negative features to realize adversarial attacks. Subsequently, self-ensemble is introduced to solve the optimization problem, thus enhancing the diversity from an optimization perspective. The diverse orthogonal initial perturbations disrupt these features stochastically, searching the space of transferable perturbations exhaustively to avoid poor local optima and improve transferability effectively. Extensive experiments show the effectiveness and superiority of the proposed SEFA, i.e., the success rates against undefended models and defense models are improved by 7.7% and 13.4%, respectively, compared with existing transferable attacks. Our code is available at https://github.com/chengshuyan/SEFA.

引用

页码：10608 / 10626

页数：19

共 50 条

[1] Improving Adversarial Transferability via Neuron Attribution-based Attacks
Zhang, Jianping
Wu, Weibin
Huang, Jen-tse
Huang, Yizhan
Wang, Wenxuan
Su, Yuxin
Lyu, Michael R.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14973 - 14982
[2] Improving the adversarial transferability with relational graphs ensemble adversarial attack
Pi, Jiatian
Luo, Chaoyang
Xia, Fen
Jiang, Ning
Wu, Haiying
Wu, Zhiyou
FRONTIERS IN NEUROSCIENCE, 2023, 16
[3] Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation
Yi-Ge Xu
Xi-Peng Qiu
Li-Gao Zhou
Xuan-Jing Huang
Journal of Computer Science and Technology, 2023, 38 : 853 - 866
[4] Improving transferability of adversarial examples via statistical attribution-based attacks
Zhu, Hegui
Jia, Yanmeng
Yan, Yue
Yang, Ze
NEURAL NETWORKS, 2025, 187
[5] Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation
Xu, Yi-Ge
Qiu, Xi-Peng
Zhou, Li-Gao
Huang, Xuan-Jing
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (04) : 853 - 866
[6] GM-Attack: Improving the Transferability of Adversarial Attacks
Hong, Jinbang
Tang, Keke
Gao, Chao
Wang, Songxin
Guo, Sensen
Zhu, Peican
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 489 - 500
[7] IMPROVING ADVERSARIAL TRANSFERABILITY VIA FEATURE TRANSLATION
Kim, Yoonji
Cho, Seungju
Byun, Junyoung
Kwon, Myung-Joon
Kim, Changick
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3359 - 3363
[8] Improving Adversarial Transferability via Model Alignment
Ma, Avery
Farahmand, Amir-Massoud
Pan, Yangchen
Torr, Philip
Gu, Jindong
COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 74 - 92
[9] Improving Transferability of Adversarial Attacks with Gaussian Gradient Enhance Momentum
Wang, Jinwei
Wang, Maoyuan
Wu, Hao
Ma, Bin
Luo, Xiangyang
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 421 - 432
[10] IMPROVING VISUAL QUALITY AND TRANSFERABILITY OF ADVERSARIAL ATTACKS ON FACE RECOGNITION SIMULTANEOUSLY WITH ADVERSARIAL RESTORATION
Zhou, Fengfan
Ling, Hefei
Shi, Yuxuan
Chen, Jiazhong
Li, Ping
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4540 - 4544

← 1 2 3 4 5 →