Adversarial example detection based on saliency map features

被引：11

作者：

Wang, Shen ^{[1
]}

Gong, Yuxin ^{[1
]}

机构：

[1] Harbin Inst Technol, Harbin, Peoples R China

来源：

APPLIED INTELLIGENCE | 2022年 / 52卷 / 06期

关键词：

Machine learning; Adversarial example detection; Interpretability; Saliency map;

D O I：

10.1007/s10489-021-02759-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, machine learning has greatly improved image recognition capability. However, studies have shown that neural network models are vulnerable to adversarial examples that make models output wrong answers with high confidence. To understand the vulnerabilities of models, we use interpretability methods to reveal the internal decision-making behaviors of models. Interpretation results reflect that the evolutionary process of nonnormalized saliency maps between clean and adversarial examples are increasingly differentiated along model hidden layers. By taking advantage of this phenomenon, we propose an adversarial example detection method based on multilayer saliency features, which can comprehensively capture the abnormal characteristics of adversarial example interpretations. Experimental results show that the proposed method can effectively detect adversarial examples based on gradient, optimization and black-box attacks, and it is comparable with the state-of-the-art methods.

引用

页码：6262 / 6275

页数：14

共 50 条

[21] Saliency Map Based Image Steganography
Singh, Rama Kant
Lall, Brejesh
PROCEEDINGS OF 2013 28TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2013), 2013, : 430 - 435
[22] A novel approach for change detection in remote sensing image based on saliency map
Tian, Minghui
Wan, Shouhong
Yue, Lihua
COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES, 2007, : 397 - +
[23] Weakly Supervised Real-time Object Detection Based on Saliency Map
Li Y.
Wang P.
Liu Y.
Liu G.-J.
Wang C.-Y.
Liu X.-Y.
Guo M.-Z.
Liu, Yang (yliu76@hit.edu.cn), 1600, Science Press (46): : 242 - 255
[24] Adversarial example detection by predicting adversarial noise in the frequency domain
Seunghwan Jung
Minyoung Chung
Yeong-Gil Shin
Multimedia Tools and Applications, 2023, 82 : 25235 - 25251
[25] SAR Target Detection Based on Improved SSD with Saliency Map and Residual Network
Zhou, Fang
He, Fengjie
Gui, Changchun
Dong, Zhangyu
Xing, Mengdao
REMOTE SENSING, 2022, 14 (01)
[26] Adversarial example detection by predicting adversarial noise in the frequency domain
Jung, Seunghwan
Chung, Minyoung
Shin, Yeong-Gil
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (16) : 25235 - 25251
[27] Saliency Map-Based Local White-Box Adversarial Attack Against Deep Neural Networks
Liu, Haohan
Zuo, Xingquan
Huang, Hai
Wan, Xing
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 3 - 14
[28] Logo detection using weakly supervised saliency map
Gautam Kumar
Prateek Keserwani
Partha Pratim Roy
Debi Prosad Dogra
Multimedia Tools and Applications, 2021, 80 : 4341 - 4365
[29] Incremental Learning With Saliency Map for Moving Object Detection
Pang, Yanwei
Ye, Li
Li, Xuelong
Pan, Jing
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (03) : 640 - 651
[30] Logo detection using weakly supervised saliency map
Kumar, Gautam
Keserwani, Prateek
Roy, Partha Pratim
Dogra, Debi Prosad
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 4341 - 4365

← 1 2 3 4 5 →