Improving Adversarial Robustness With Adversarial Augmentations

被引:2
作者
Chen, Chuanxi [1 ,2 ]
Ye, Dengpan [1 ,2 ]
He, Yiheng [1 ,2 ]
Tang, Long [1 ,2 ]
Xu, Yue [1 ,2 ]
机构
[1] Wuhan Univ, Key Lab Aerosp Informat Secur & Trusted Comp, Minist Educ, Wuhan 430072, Peoples R China
[2] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Robustness; Internet of Things; Security; Perturbation methods; Feature extraction; Data augmentation; Adversarial robustness; augmentations; contrastive learning (CL); deep neural networks (DNNs); Internet of Things (IoT) security;
D O I
10.1109/JIOT.2023.3301608
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network (DNN)-based applications are extensively being researched and applied in the Internet of Things (IoT) devices in daily lives due to impressive performance. Recently, adversarial attacks pose a significant threat to the security of deep neural networks (DNNs), adversarial training has emerged as a promising and effective defense approach for defending against such attacks. However, existing adversarial training methods have shown limited success in defending against attacks unseen during training, thereby undermining their effectiveness. Besides, generating adversarial perturbations for adversarial training requires massive expensive labeled data, which is a critical obstacle in the robust DNNs-based IoT applications. In this article, we first explore the effective data augmentations by implementing adversarial attacks with self-supervised in latent space. Then, we propose new loss metric functions that can avoid collapse phenomenon of contrastive learning (CL) by measuring the distances between adversarial augmented pairs. Based on the extracted adversarial features in self-supervised CL, we propose a novel adversarial robust learning (ARL) method, which implements adversarial training without any labels and obtains more general robust encoder network. Our approach is validated on commonly used benchmark data sets and models, where it achieves comparable adversarial robustness against different adversarial attacks when compared to supervised adversarial training methods. Additionally, ARL outperforms state-of-the-art self-supervised adversarial learning techniques in terms of achieving higher robustness and clean prediction accuracy for the downstream classification task.
引用
收藏
页码:5105 / 5117
页数:13
相关论文
共 61 条
[1]   Square Attack: A Query-Efficient Black-Box Adversarial Attack via Random Search [J].
Andriushchenko, Maksym ;
Croce, Francesco ;
Flammarion, Nicolas ;
Hein, Matthias .
COMPUTER VISION - ECCV 2020, PT XXIII, 2020, 12368 :484-501
[2]  
[Anonymous], 2009, LEARNING MULTIPLE LA
[3]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[4]  
Bachman P, 2019, ADV NEUR IN, V32
[5]   PDAE: Efficient network intrusion detection in IoT using parallel deep auto-encoders [J].
Basati, Amir ;
Faghih, Mohammad Mehdi .
INFORMATION SCIENCES, 2022, 598 :57-74
[6]  
Bashivan P, 2021, ADV NEUR IN, V34
[7]   Towards Evaluating the Robustness of Neural Networks [J].
Carlini, Nicholas ;
Wagner, David .
2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57
[8]  
Carmon Y., 2019, ARXIV
[9]  
Caron M, 2020, ADV NEUR IN, V33
[10]  
Chen KJ, 2020, INT CONF ACOUST SPEE, P2218, DOI [10.1109/icassp40776.2020.9054475, 10.1109/ICASSP40776.2020.9054475]