On the Effectiveness of Adversarial Training Against Backdoor Attacks

被引：5

作者：

Gao, Yinghua ^{[1
]}

Wu, Dongxian ^{[2
]}

Zhang, Jingfeng ^{[3
]}

Gan, Guanhao ^{[1
]}

Xia, Shu-Tao ^{[1
]}

Niu, Gang ^{[3
]}

Sugiyama, Masashi ^{[2
,3
]}

机构：

[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518071, Peoples R China

[2] Univ Tokyo, Dept Complex Sci & Engn, Chiba 2778561, Japan

[3] RIKEN, Ctr Adv Intelligence Project AIP, Tokyo 1030027, Japan

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Adversarial training (AT); backdoor attack; deep learning; robustness; HEART-DISEASE; LIFE;

D O I：

10.1109/TNNLS.2023.3281872

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although adversarial training (AT) is regarded as a potential defense against backdoor attacks, AT and its variants have only yielded unsatisfactory results or have even inversely strengthened backdoor attacks. The large discrepancy between expectations and reality motivates us to thoroughly evaluate the effectiveness of AT against backdoor attacks across various settings for AT and backdoor attacks. We find that the type and budget of perturbations used in AT are important, and AT with common perturbations is only effective for certain backdoor trigger patterns. Based on these empirical findings, we present some practical suggestions for backdoor defense, including relaxed adversarial perturbation and composite AT. This work not only boosts our confidence in AT's ability to defend against backdoor attacks but also provides some important insights for future research.

引用

页码：14878 / 14888

页数：11

共 39 条

[1] A smart healthcare monitoring system for heart disease prediction based on ensemble deep learning and feature fusion [J].

Ali, Farman ;

El-Sappagh, Shaker ;

Islam, S. M. Riazul ;

Kwak, Daehan ;

Ali, Amjad ;

Imran, Muhammad ;

Kwak, Kyung-Sup .

INFORMATION FUSION, 2020, 63 :208-222

[2]

Bai Y., 2021, PROC INT C PRINCIPLE, P1

[3]

Barni M, 2019, IEEE IMAGE PROC, P101, DOI [10.1109/ICIP.2019.8802997, 10.1109/icip.2019.8802997]

[4]

Borgnia E., 2021, ARXIV

[5] STRONG DATA AUGMENTATION SANITIZES POISONING AND BACKDOOR ATTACKS WITHOUT AN ACCURACY TRADEOFF [J].

Borgnia, Eitan ;

Cherepanova, Valeriia ;

Fowl, Liam ;

Ghiasi, Amin ;

Geiping, Jonas ;

Goldblum, Micah ;

Goldstein, Tom ;

Gupta, Arjun .

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :3855-3859

[6]

Carlini N., 2021, arXiv

[7] Spatial Memory for Context Reasoning in Object Detection [J].

Chen, Xinlei ;

Gupta, Abhinav .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4106-4116

[8] Trojan Attack on Deep Generative Models in Autonomous Driving [J].

Ding, Shaohua ;

Tian, Yulong ;

Xu, Fengyuan ;

Li, Qun ;

Zhong, Sheng .

SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM, PT I, 2019, 304 :299-318

[9]

Goldblum Micah, 2020, arXiv

[10]

Gu Tianyu, 2017, ARXIV

← 1 2 3 4 →