Towards Transferable Adversarial Attacks with Centralized Perturbation

被引：0

作者：

Wu, Shangbo ^{[1
]}

Tan, Yu-an ^{[1
]}

Wang, Yajie ^{[1
]}

Ma, Ruinan ^{[1
]}

Ma, Wencong ^{[2
]}

Li, Yuanzhang ^{[2
]}

机构：

[1] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing, Peoples R China

[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6 | 2024年

基金：

中国国家自然科学基金;

关键词：

EXAMPLES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Adversarial transferability enables black-box attacks on unknown victim deep neural networks (DNNs), rendering attacks viable in real-world scenarios. Current transferable attacks create adversarial perturbation over the entire image, resulting in excessive noise that overfit the source model. Concentrating perturbation to dominant image regions that are model-agnostic is crucial to improving adversarial efficacy. However, limiting perturbation to local regions in the spatial domain proves inadequate in augmenting transferability. To this end, we propose a transferable adversarial attack with fine-grained perturbation optimization in the frequency domain, creating centralized perturbation. We devise a systematic pipeline to dynamically constrain perturbation optimization to dominant frequency coefficients. The constraint is optimized in parallel at each iteration, ensuring the directional alignment of perturbation optimization with model prediction. Our approach allows us to centralize perturbation towards sample-specific important frequency features, which are shared by DNNs, effectively mitigating source model overfitting. Experiments demonstrate that by dynamically centralizing perturbation on dominating frequency coefficients, crafted adversarial examples exhibit stronger transferability, and allowing them to bypass various defenses.

引用

页码：6109 / 6116

页数：8

共 32 条

[1] Deng YP, 2020, Arxiv, DOI arXiv:2003.05549
[2] Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks
Dong, Yinpeng
Pang, Tianyu
Su, Hang
Zhu, Jun
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4307 - 4316
[3] Boosting Adversarial Attacks with Momentum
Dong, Yinpeng
Liao, Fangzhou
Pang, Tianyu
Su, Hang
Zhu, Jun
Hu, Xiaolin
Li, Jianguo
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9185 - 9193
[4] Dosovitskiy A., 2021, INT C LEARNING REPRE
[5] Deep Residual Learning in the JPEG Transform Domain
Ehrlich, Max
Davis, Larry
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3483 - 3492
[6] Goodfellow I.J., 2015, 3 INT C LEARNING REP
[7] Guo C., 2020, PR MACH LEARN RES, VVolume 115, P1127
[8] Guo Chong, 2018, 2018 INT C MICROWAVE
[9] Detecting adversarial examples via prediction difference for deep neural networks
Guo, Feng
Zhao, Qingjie
Li, Xuan
Kuang, Xiaohui
Zhang, Jianwei
Han, Yahong
Tan, Yu-an
[J]. INFORMATION SCIENCES, 2019, 501 : 182 - 192
[10] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778

← 1 2 3 4 →