Improving the transferability of adversarial attacks via self-ensemble

被引:1
|
作者
Cheng, Shuyan [1 ]
Li, Peng [1 ]
Liu, Jianguo [1 ]
Xu, He [1 ]
Yao, Yudong [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Stevens Inst Technol, Dept Elect & Comp Engn, Hoboken, NJ 07030 USA
基金
中国国家自然科学基金;
关键词
Black-box attacks; Transferability; Adversarial examples; Self-ensemble; Feature importance;
D O I
10.1007/s10489-024-05728-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have been used extensively for diverse visual tasks, including object detection, face recognition, and image classification. However, they face several security threats, such as adversarial attacks. To improve the resistance of neural networks to adversarial attacks, researchers have investigated the security issues of models from the perspectives of both attacks and defenses. Recently, the transferability of adversarial attacks has received extensive attention, which promotes the application of adversarial attacks in practical scenarios. However, existing transferable attacks tend to trap into a poor local optimum and significantly degrade the transferability because the production of adversarial samples lacks randomness. Therefore, we propose a self-ensemble-based feature-level adversarial attack (SEFA) to boost transferability by randomly disrupting salient features. We provide theoretical analysis to demonstrate the superiority of the proposed method. In particular, perturbing the refined feature importance weighted intermediate features suppresses positive features and encourages negative features to realize adversarial attacks. Subsequently, self-ensemble is introduced to solve the optimization problem, thus enhancing the diversity from an optimization perspective. The diverse orthogonal initial perturbations disrupt these features stochastically, searching the space of transferable perturbations exhaustively to avoid poor local optima and improve transferability effectively. Extensive experiments show the effectiveness and superiority of the proposed SEFA, i.e., the success rates against undefended models and defense models are improved by 7.7% and 13.4%, respectively, compared with existing transferable attacks. Our code is available at https://github.com/chengshuyan/SEFA.
引用
收藏
页码:10608 / 10626
页数:19
相关论文
共 50 条
  • [1] Improving Adversarial Transferability via Neuron Attribution-based Attacks
    Zhang, Jianping
    Wu, Weibin
    Huang, Jen-tse
    Huang, Yizhan
    Wang, Wenxuan
    Su, Yuxin
    Lyu, Michael R.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14973 - 14982
  • [2] Improving the adversarial transferability with relational graphs ensemble adversarial attack
    Pi, Jiatian
    Luo, Chaoyang
    Xia, Fen
    Jiang, Ning
    Wu, Haiying
    Wu, Zhiyou
    FRONTIERS IN NEUROSCIENCE, 2023, 16
  • [3] Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation
    Yi-Ge Xu
    Xi-Peng Qiu
    Li-Gao Zhou
    Xuan-Jing Huang
    Journal of Computer Science and Technology, 2023, 38 : 853 - 866
  • [4] Improving transferability of adversarial examples via statistical attribution-based attacks
    Zhu, Hegui
    Jia, Yanmeng
    Yan, Yue
    Yang, Ze
    NEURAL NETWORKS, 2025, 187
  • [5] Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation
    Xu, Yi-Ge
    Qiu, Xi-Peng
    Zhou, Li-Gao
    Huang, Xuan-Jing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (04) : 853 - 866
  • [6] GM-Attack: Improving the Transferability of Adversarial Attacks
    Hong, Jinbang
    Tang, Keke
    Gao, Chao
    Wang, Songxin
    Guo, Sensen
    Zhu, Peican
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 489 - 500
  • [7] IMPROVING ADVERSARIAL TRANSFERABILITY VIA FEATURE TRANSLATION
    Kim, Yoonji
    Cho, Seungju
    Byun, Junyoung
    Kwon, Myung-Joon
    Kim, Changick
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3359 - 3363
  • [8] Improving Adversarial Transferability via Model Alignment
    Ma, Avery
    Farahmand, Amir-Massoud
    Pan, Yangchen
    Torr, Philip
    Gu, Jindong
    COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 74 - 92
  • [9] Improving Transferability of Adversarial Attacks with Gaussian Gradient Enhance Momentum
    Wang, Jinwei
    Wang, Maoyuan
    Wu, Hao
    Ma, Bin
    Luo, Xiangyang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 421 - 432
  • [10] IMPROVING VISUAL QUALITY AND TRANSFERABILITY OF ADVERSARIAL ATTACKS ON FACE RECOGNITION SIMULTANEOUSLY WITH ADVERSARIAL RESTORATION
    Zhou, Fengfan
    Ling, Hefei
    Shi, Yuxuan
    Chen, Jiazhong
    Li, Ping
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4540 - 4544