SPATIAL SENSITIVE GRAD-CAM: VISUAL EXPLANATIONS FOR OBJECT DETECTION BY INCORPORATING SPATIAL SENSITIVITY

被引:10
作者
Yamauchi, Toshinori [1 ]
Ishikawa, Masayoshi [1 ]
机构
[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan
来源
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2022年
关键词
XAI; Visual Explanation; Object Detection; Grad-CAM;
D O I
10.1109/ICIP46576.2022.9897350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual explanations are important to increase models' transparency. Grad-CAM [1] is an effective method because of its high class discrimination, no requirement of architectural changes, and so on. However, in detection tasks, because Grad-CAM only focuses on the importance of features but does not have spatial sensitivity, it generates heatmaps in which not related regions to detected objects are also highlighted. In this study, we propose Spatial Sensitive Grad-CAM (SSGrad-CAM), which can generate appropriate heatmaps for object detectors. SSGrad-CAM modifies the heatmap generated from Grad-CAM with space maps computed by normalizing the magnitude of gradients. In this manner, SSGrad-CAM can incorporate spatial sensitivity and focus on the importance of both features and space. Through experiments, we confirm SSGrad-CAM can generate appropriate heatmaps for detection results, and also confirm it can generate when models detect objects by paying high attention to their peripheral regions, as well.
引用
收藏
页码:256 / 260
页数:5
相关论文
共 23 条
[1]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[2]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[3]   Interpretable Explanations of Black Boxes by Meaningful Perturbation [J].
Fong, Ruth C. ;
Vedaldi, Andrea .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3449-3457
[4]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[5]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[6]  
HaofanWang ZifanWang, 2020, CVPR WORKSHOPS
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[9]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[10]  
Petsiuk V., 2018, P BRIT MACHINE VISIO