Fast multi-resolution occlusion: a method for explaining and understanding deep neural networks

被引:6
作者
Behzadi-Khormouji, Hamed [1 ]
Rostami, Habib [1 ]
机构
[1] Persian Gulf Univ, Fac Intelligent Syst Engn & Data Sci, Comp Engn Dept, Bushehr 75168, Iran
基金
英国科研创新办公室;
关键词
Deep learning; Neural networks; Heatmap; Explaining neural networks; Interpretation of learning models; Perturbation-based visualization; VISUALIZATION; MOTION;
D O I
10.1007/s10489-020-01946-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (DCNNs) contain a high level of complexity and nonlinearity, so it is not clear based on what features DCNN models make decisions and how they can reach such promising results. There are two types of visualization techniques to interpret and explain the deep models: Backpropagation-based and Perturbation-based algorithms. The most notable drawback of the backpropagation-based visualization is that they cannot be applied for all architectures, whereas Perturbation-based visualizations are totally independent of the architectures. These methods, however, take a lot of computation and memory resources which make them slow and expensive, thereby unsuitable for many real-world applications. To cope with these problems, in this paper, a perturbation-based visualization method called Fast Multi-resolution Occlusion (FMO) are presented which is efficient in terms of time and resource consumption and can be considered in real-world applications. In order to compare the FMO with five well-known Perturbation-based visualizations methods such as Occlusion Test, Super-pixel perturbation (LIME), Randomized Input Sampling (RISE), Meaningful Perturbation and Extremal Perturbation, different experiments are designed in terms of time-consumption, visualization quality and localization accuracy. All methods are applied on 5 well-known DCNNs DenseNet121, InceptionV3, InceptionResnetV2, MobileNet and ResNet50 using common benchmark datasets ImageNet, PASCAL VOC07 and COCO14. According to the experimental results, FMO is averagely 2.32 times faster than LIME on five models DenseNet121, InceptionResnetV2, InceptionV3, MobileNet and ResNet50 with images of ILSVRC2012 dataset as well as 24.84 times faster than Occlusion Test, 11.87 times faster than RISE, 8.72 times faster than Meaningful Perturbation and 10.03 times faster than Extremal Perturbation on all of the five used models with images of common dataset ImageNet without scarifying visualization quality. Moreover, the methods are evaluated in terms of localization accuracy on two hard common datasets of PASCAL VOC07 and COCO14. The results show that FMO outperforms the compared relevant methods in terms of localization accuracy. Also, FMO extends the superimposing process of the Occlusion Test method, which yields a heatmap with more visualization quality than the Occlusion Test on many colorful images.
引用
收藏
页码:2431 / 2455
页数:25
相关论文
共 70 条
[1]   Deep convolutional neural network for the automated diagnosis of congestive heart failure using ECG signals [J].
Acharya, U. Rajendra ;
Fujita, Hamido ;
Oh, Shu Lih ;
Hagiwara, Yuki ;
Tan, Jen Hong ;
Adam, Muhammad ;
Tan, Ru San .
APPLIED INTELLIGENCE, 2019, 49 (01) :16-27
[2]   Improving the Reliability of Deep Neural Networks in NLP: A Review [J].
Alshemali, Basemah ;
Kalita, Jugal .
KNOWLEDGE-BASED SYSTEMS, 2020, 191
[3]  
Ancona M., 2018, 6 INT C LEARN REPR I, DOI DOI 10.1109/TNSE.2020.2996738
[4]  
[Anonymous], 2012, Adv. Neural Inf. Process. Syst.
[5]   Identifying Individual Facial Expressions by Deconstructing a Neural Network [J].
Arbabzadah, Farhad ;
Montavon, Gregoire ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PATTERN RECOGNITION, GCPR 2016, 2016, 9796 :344-354
[6]   "What is relevant in a text document?": An interpretable machine learning approach [J].
Arras, Leila ;
Horn, Franziska ;
Montavon, Gregoire ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2017, 12 (08)
[7]   On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].
Bach, Sebastian ;
Binder, Alexander ;
Montavon, Gregoire ;
Klauschen, Frederick ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2015, 10 (07)
[8]  
Baehrens D, 2010, J MACH LEARN RES, V11, P1803
[9]  
Becker S., 2018, INTERPRETING EXPLAIN
[10]   Deep learning, reusable and problem-based architectures for detection of consolidation on chest X-ray images [J].
Behzadi-khormouji, Hamed ;
Rostami, Habib ;
Salehi, Sana ;
Derakhshande-Rishehri, Touba ;
Masoumi, Marzieh ;
Salemi, Siavash ;
Keshavarz, Ahmad ;
Gholamrezanezhad, Ali ;
Assadi, Majid ;
Batouli, Ali .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 185