DLR: Adversarial examples detection and label recovery for deep neural networks

被引：0

作者：

Han, Keji ^{[1
,2
]}

Ge, Yao ^{[1
,2
]}

Wang, Ruchuan ^{[1
,3
]}

Li, Yun ^{[1
,2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China

[2] Jiangsu Key Lab Big Data Secur & Intelligent Proc, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China

[3] Jiangsu High Technol Res Key Lab Wireless Sensor N, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2025年 / 188卷

基金：

中国国家自然科学基金;

关键词：

Deep neural network; Generative classifier; Adversarial example; Anomaly detection;

D O I：

10.1016/j.patrec.2024.12.009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks (DNNs) have been shown to be vulnerable to adversarial examples crafted by adversaries to deceive the target model. Two popular approaches to mitigate this issue are adversarial training and adversarial example detection. Adversarial training aims to enable the target model to accurately recognize adversarial examples in image classification tasks; however, it often lacks generalizability. Conversely, adversarial detection demonstrates good generalization but does not assist the target model in recognizing adversarial examples. In this paper, we first define the label recovery task to address the adversarial challenges faced by DNNs. We then propose a novel generative classifier specifically for the adversarial example label recovery task. This method is termed Detection and Label Recovery (DLR), which comprises two components: Detector and Recover. The Detector processes both legitimate and adversarial examples, while the Recover component seeks to ascertain the ground-truth label of the detected adversarial example. DLR effectively combines the strengths of adversarial training and adversarial example detection. Experimental results demonstrate that our method outperforms several state-of-the-art approaches.

引用

页码：133 / 139

页数：7

共 21 条

[1] Towards Evaluating the Robustness of Neural Networks
Carlini, Nicholas
Wagner, David
[J]. 2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, : 39 - 57
[2] Croce F, 2020, PR MACH LEARN RES, V119
[3] Ghosh P, 2019, AAAI CONF ARTIF INTE, P541
[4] Goodfellow IJ, 2014, PREPRINT, DOI DOI 10.48550/ARXIV.1412.6572
[5] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[6] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
[7] Krizhevsky Alex, 2009, LEARNING MULTIPLE LA
[8] Kurakin A., 2016, ARXIV
[9] Kurakin A, 2017, Arxiv, DOI arXiv:1607.02533
[10] Gradient-based learning applied to document recognition
Lecun, Y
Bottou, L
Bengio, Y
Haffner, P
[J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324

← 1 2 3 →