Transferable Interpolated Adversarial Attack with Random-Layer Mixup

被引：0

作者：

Ma, Size ^{[1
]}

Han, Keji ^{[1
]}

Long, Xianzhong ^{[1
]}

Li, Yun ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing, Peoples R China

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II | 2022年 / 13281卷

基金：

中国国家自然科学基金;

关键词：

Black-box adversarial attack; Mixup; Manifold mixup;

D O I：

10.1007/978-3-031-05936-0_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks have shown their vulnerabilities to adversarial examples crafted by adding imperceptible perturbations to original examples. Despite showing powerful attack strength under the white-box setting, most existing adversarial attack methods can only mislead the black-box model with low attack success rates. In response, a class of image transformation-based attacks has been proposed. Its main idea is to apply transformations to adversarial examples during attack iterations and improve the transferability on the black-box model. However, a major limitation of these transformation-based attacks is that they only apply transformations to input images, while ignoring transformations' usages in hidden representations. Based on our observation that mixup in hidden space can help attack methods achieve higher transferability than in input space, we propose the Random-Layer Mixup Attack Method (RLMAM). Our method interpolates the adversarial examples with clean examples in both input space and hidden space. The interpolated adversarial representations induced by our random-layer mixup can improve representations' diversity in both two spaces and alleviate adversarial examples' overfitting phenomenon on the white-box model. Furthermore, we incorporate RLMAM with our enhanced momentum method. Experimental results on ImageNet and CIFAR-10 datasets demonstrate that our RLMAM outperforms other state-of-the-art black-box attacks.

引用

页码：224 / 235

页数：12

共 20 条

[1] Aleksandar M., 2018, 6 INT C LEARNING REP
[2] Alexey, 2017, 5 INT C LEARNING REP
[3] Towards Evaluating the Robustness of Neural Networks
Carlini, Nicholas
Wagner, David
[J]. 2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, : 39 - 57
[4] Chiyuan Z., 2017, 5 INT C LEARNING REP
[5] Christian Szegedy, 2014, 2 INT C LEARNING REP
[6] Boosting Adversarial Attacks with Momentum
Dong, Yinpeng
Liao, Fangzhou
Pang, Tianyu
Su, Hang
Zhu, Jun
Hu, Xiaolin
Li, Jianguo
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 9185 - 9193
[7] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[8] Hongyi Z., 2018, 6 INT C LEARNING REP
[9] Ian J.G., 2015, 3 INT C LEARNING REP
[10] Ilyas A, 2019, ADV NEUR IN, V32

← 1 2 →