A Generalized Framework for Adversarial Attack Detection and Prevention Using Grad-CAM and Clustering Techniques

被引：1

作者：

Sim, Jeong-Hyun ^{[1
]}

Song, Hyun-Min ^{[1
]}

机构：

[1] Dankook Univ, Dept Ind Secur, Jukjeon Ro 152, Yongin 16890, South Korea

来源：

SYSTEMS | 2025年 / 13卷 / 02期

关键词：

adversarial attack; AI security; computer vision; explainable AI; ROBUSTNESS;

D O I：

10.3390/systems13020088

中图分类号：

C [社会科学总论];

学科分类号：

03 ; 0303 ;

摘要：

Through advances in AI-based computer vision technology, the performance of modern image classification models has surpassed human perception, making them valuable in various fields. However, adversarial attacks, which involve small changes to images that are hard for humans to perceive, can cause classification models to misclassify images. Considering the availability of classification models that use neural networks, it is crucial to prevent adversarial attacks. Recent detection methods are only effective for specific attacks or cannot be applied to various models. Therefore, in this paper, we proposed an attention mechanism-based method for detecting adversarial attacks. We utilized a framework using an ensemble model, Grad-CAM and calculated the silhouette coefficient for detection. We applied this method to Resnet18, Mobilenetv2, and VGG16 classification models that were fine-tuned on the CIFAR-10 dataset. The average performance demonstrated that Mobilenetv2 achieved an F1-Score of 0.9022 and an accuracy of 0.9103, Resnet18 achieved an F1-Score of 0.9124 and an accuracy of 0.9302, and VGG16 achieved an F1-Score of 0.9185 and an accuracy of 0.9252. The results demonstrated that our method not only detects but also prevents adversarial attacks by mitigating their effects and effectively restoring labels.

引用

页数：23

共 27 条

[1] Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey [J].

Akhtar, Naveed ;

Mian, Ajmal .

IEEE ACCESS, 2018, 6 :14410-14430

[2]

Alparslan Yigit., arXiv

[3]

Baluja S, 2017, Arxiv, DOI arXiv:1703.09387

[4] Towards Evaluating the Robustness of Neural Networks [J].

Carlini, Nicholas ;

Wagner, David .

2017 IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP), 2017, :39-57

[5] Adversarial image detection in deep neural networks [J].

Carrara, Fabio ;

Falchi, Fabrizio ;

Caldelli, Roberto ;

Amato, Giuseppe ;

Becarelli, Rudy .

MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) :2815-2835

[6]

Chernikova A, 2019, IEEE SEC PRIV WORKS, P132, DOI 10.1109/SPW.2019.00033

[7]

Croce F, 2020, PR MACH LEARN RES, V119

[8] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[9]

Goodfellow IJ, 2015, Arxiv, DOI [arXiv:1412.6572, 10.48550/arXiv.1412.6572]

[10]

Kurakin Alexey., 2018, ARTIF INTELL, P99, DOI DOI 10.48550/ARXIV.1607.02533

← 1 2 3 →