Adversarial example detection based on saliency map features

被引:11
作者
Wang, Shen [1 ]
Gong, Yuxin [1 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
关键词
Machine learning; Adversarial example detection; Interpretability; Saliency map;
D O I
10.1007/s10489-021-02759-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, machine learning has greatly improved image recognition capability. However, studies have shown that neural network models are vulnerable to adversarial examples that make models output wrong answers with high confidence. To understand the vulnerabilities of models, we use interpretability methods to reveal the internal decision-making behaviors of models. Interpretation results reflect that the evolutionary process of nonnormalized saliency maps between clean and adversarial examples are increasingly differentiated along model hidden layers. By taking advantage of this phenomenon, we propose an adversarial example detection method based on multilayer saliency features, which can comprehensively capture the abnormal characteristics of adversarial example interpretations. Experimental results show that the proposed method can effectively detect adversarial examples based on gradient, optimization and black-box attacks, and it is comparable with the state-of-the-art methods.
引用
收藏
页码:6262 / 6275
页数:14
相关论文
共 50 条
  • [21] Saliency Map Based Image Steganography
    Singh, Rama Kant
    Lall, Brejesh
    PROCEEDINGS OF 2013 28TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2013), 2013, : 430 - 435
  • [22] A novel approach for change detection in remote sensing image based on saliency map
    Tian, Minghui
    Wan, Shouhong
    Yue, Lihua
    COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES, 2007, : 397 - +
  • [23] Weakly Supervised Real-time Object Detection Based on Saliency Map
    Li Y.
    Wang P.
    Liu Y.
    Liu G.-J.
    Wang C.-Y.
    Liu X.-Y.
    Guo M.-Z.
    Liu, Yang (yliu76@hit.edu.cn), 1600, Science Press (46): : 242 - 255
  • [24] Adversarial example detection by predicting adversarial noise in the frequency domain
    Seunghwan Jung
    Minyoung Chung
    Yeong-Gil Shin
    Multimedia Tools and Applications, 2023, 82 : 25235 - 25251
  • [25] SAR Target Detection Based on Improved SSD with Saliency Map and Residual Network
    Zhou, Fang
    He, Fengjie
    Gui, Changchun
    Dong, Zhangyu
    Xing, Mengdao
    REMOTE SENSING, 2022, 14 (01)
  • [26] Adversarial example detection by predicting adversarial noise in the frequency domain
    Jung, Seunghwan
    Chung, Minyoung
    Shin, Yeong-Gil
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 25235 - 25251
  • [27] Saliency Map-Based Local White-Box Adversarial Attack Against Deep Neural Networks
    Liu, Haohan
    Zuo, Xingquan
    Huang, Hai
    Wan, Xing
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 3 - 14
  • [28] Logo detection using weakly supervised saliency map
    Gautam Kumar
    Prateek Keserwani
    Partha Pratim Roy
    Debi Prosad Dogra
    Multimedia Tools and Applications, 2021, 80 : 4341 - 4365
  • [29] Incremental Learning With Saliency Map for Moving Object Detection
    Pang, Yanwei
    Ye, Li
    Li, Xuelong
    Pan, Jing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (03) : 640 - 651
  • [30] Logo detection using weakly supervised saliency map
    Kumar, Gautam
    Keserwani, Prateek
    Roy, Partha Pratim
    Dogra, Debi Prosad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 4341 - 4365