Iterative and Adaptive Sampling with Spatial Attention for Black-Box Model Explanations

被引:0
作者
Vasu, Bhavan [1 ]
Long, Chengjiang [1 ]
机构
[1] Kitware Inc, Clifton Pk, NY 12065 USA
来源
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2020年
关键词
ACTIVE VISUAL RECOGNITION; ENSEMBLE;
D O I
10.1109/wacv45572.2020.9093576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have achieved great success in many real-world applications, yet it remains unclear and difficult to explain their decision-making process to an end-user. In this paper, we address the explainable AI problem for deep neural networks with our proposed framework, named IASSA, which generates an importance map indicating how salient each pixel is for the models prediction with an iterative and adaptive sampling module. We employ an affinity matrix calculated on multi-level deep learning features to explore long-range pixel-to-pixel correlation, which can shift the saliency values guided by our long-range and parameter-free spatial attention module. Extensive experiments on the MS-COCO dataset show that the proposed approach matches or exceeds the performance of state-of-the-art black-box explanation methods.
引用
收藏
页码:2949 / 2958
页数:10
相关论文
共 51 条
[11]   Collaborative Active Visual Recognition from Crowds: A Distributed Ensemble Approach [J].
Hua, Gang ;
Long, Chengjiang ;
Yang, Ming ;
Gao, Yan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (03) :582-594
[12]   Collaborative Active Learning of a Kernel Machine Ensemble for Recognition [J].
Hua, Gang ;
Long, Chengjiang ;
Yang, Ming ;
Gao, Yan .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1209-1216
[13]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[14]   The structure and function of explanations [J].
Lombrozo, Tania .
TRENDS IN COGNITIVE SCIENCES, 2006, 10 (10) :464-470
[15]   The Instrumental Value of Explanations [J].
Lombrozo, Tania .
PHILOSOPHY COMPASS, 2011, 6 (08) :539-551
[16]  
Long C., 2017, IEEE INT C COMP VIS
[17]   Deep Neural Networks in Fully Connected CRF for Image Labeling with Social Network Metadata [J].
Long, Chengjiang ;
Collins, Roddy ;
Swears, Eran ;
Hoogs, Anthony .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1607-1615
[18]   Correlational Gaussian Processes for Cross-domain Visual Recognition [J].
Long, Chengjiang ;
Hua, Gang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4932-4940
[19]   Multi-class Multi-annotator Active Learning with Robust Gaussian Process for Visual Recognition [J].
Long, Chengjiang ;
Hua, Gang .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2839-2847
[20]   A Joint Gaussian Process Model for Active Visual Recognition with Expertise Estimation in Crowdsourcing [J].
Long, Chengjiang ;
Hua, Gang ;
Kapoor, Ashish .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (02) :136-160