DLR: Adversarial examples detection and label recovery for deep neural networks

被引:0
作者
Han, Keji [1 ,2 ]
Ge, Yao [1 ,2 ]
Wang, Ruchuan [1 ,3 ]
Li, Yun [1 ,2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China
[2] Jiangsu Key Lab Big Data Secur & Intelligent Proc, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China
[3] Jiangsu High Technol Res Key Lab Wireless Sensor N, Wenyuan Rd 9, Nanjing 210046, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural network; Generative classifier; Adversarial example; Anomaly detection;
D O I
10.1016/j.patrec.2024.12.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks (DNNs) have been shown to be vulnerable to adversarial examples crafted by adversaries to deceive the target model. Two popular approaches to mitigate this issue are adversarial training and adversarial example detection. Adversarial training aims to enable the target model to accurately recognize adversarial examples in image classification tasks; however, it often lacks generalizability. Conversely, adversarial detection demonstrates good generalization but does not assist the target model in recognizing adversarial examples. In this paper, we first define the label recovery task to address the adversarial challenges faced by DNNs. We then propose a novel generative classifier specifically for the adversarial example label recovery task. This method is termed Detection and Label Recovery (DLR), which comprises two components: Detector and Recover. The Detector processes both legitimate and adversarial examples, while the Recover component seeks to ascertain the ground-truth label of the detected adversarial example. DLR effectively combines the strengths of adversarial training and adversarial example detection. Experimental results demonstrate that our method outperforms several state-of-the-art approaches.
引用
收藏
页码:133 / 139
页数:7
相关论文
共 21 条
  • [1] Towards Evaluating the Robustness of Neural Networks
    Carlini, Nicholas
    Wagner, David
    [J]. 2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, : 39 - 57
  • [2] Croce F, 2020, PR MACH LEARN RES, V119
  • [3] Ghosh P, 2019, AAAI CONF ARTIF INTE, P541
  • [4] Goodfellow IJ, 2014, PREPRINT, DOI DOI 10.48550/ARXIV.1412.6572
  • [5] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [6] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [7] Krizhevsky Alex, 2009, LEARNING MULTIPLE LA
  • [8] Kurakin A., 2016, ARXIV
  • [9] Kurakin A, 2017, Arxiv, DOI arXiv:1607.02533
  • [10] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324