Fast multi-resolution occlusion: a method for explaining and understanding deep neural networks

被引:5
作者
Behzadi-Khormouji, Hamed [1 ]
Rostami, Habib [1 ]
机构
[1] Persian Gulf Univ, Fac Intelligent Syst Engn & Data Sci, Comp Engn Dept, Bushehr 75168, Iran
关键词
Deep learning; Neural networks; Heatmap; Explaining neural networks; Interpretation of learning models; Perturbation-based visualization; VISUALIZATION; MOTION;
D O I
10.1007/s10489-020-01946-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (DCNNs) contain a high level of complexity and nonlinearity, so it is not clear based on what features DCNN models make decisions and how they can reach such promising results. There are two types of visualization techniques to interpret and explain the deep models: Backpropagation-based and Perturbation-based algorithms. The most notable drawback of the backpropagation-based visualization is that they cannot be applied for all architectures, whereas Perturbation-based visualizations are totally independent of the architectures. These methods, however, take a lot of computation and memory resources which make them slow and expensive, thereby unsuitable for many real-world applications. To cope with these problems, in this paper, a perturbation-based visualization method called Fast Multi-resolution Occlusion (FMO) are presented which is efficient in terms of time and resource consumption and can be considered in real-world applications. In order to compare the FMO with five well-known Perturbation-based visualizations methods such as Occlusion Test, Super-pixel perturbation (LIME), Randomized Input Sampling (RISE), Meaningful Perturbation and Extremal Perturbation, different experiments are designed in terms of time-consumption, visualization quality and localization accuracy. All methods are applied on 5 well-known DCNNs DenseNet121, InceptionV3, InceptionResnetV2, MobileNet and ResNet50 using common benchmark datasets ImageNet, PASCAL VOC07 and COCO14. According to the experimental results, FMO is averagely 2.32 times faster than LIME on five models DenseNet121, InceptionResnetV2, InceptionV3, MobileNet and ResNet50 with images of ILSVRC2012 dataset as well as 24.84 times faster than Occlusion Test, 11.87 times faster than RISE, 8.72 times faster than Meaningful Perturbation and 10.03 times faster than Extremal Perturbation on all of the five used models with images of common dataset ImageNet without scarifying visualization quality. Moreover, the methods are evaluated in terms of localization accuracy on two hard common datasets of PASCAL VOC07 and COCO14. The results show that FMO outperforms the compared relevant methods in terms of localization accuracy. Also, FMO extends the superimposing process of the Occlusion Test method, which yields a heatmap with more visualization quality than the Occlusion Test on many colorful images.
引用
收藏
页码:2431 / 2455
页数:25
相关论文
共 70 条
  • [1] Deep convolutional neural network for the automated diagnosis of congestive heart failure using ECG signals
    Acharya, U. Rajendra
    Fujita, Hamido
    Oh, Shu Lih
    Hagiwara, Yuki
    Tan, Jen Hong
    Adam, Muhammad
    Tan, Ru San
    [J]. APPLIED INTELLIGENCE, 2019, 49 (01) : 16 - 27
  • [2] Improving the Reliability of Deep Neural Networks in NLP: A Review
    Alshemali, Basemah
    Kalita, Jugal
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 191
  • [3] Ancona M, 2018, Towards better understanding of gradient-based attribution methods for deep neural networks
  • [4] Identifying Individual Facial Expressions by Deconstructing a Neural Network
    Arbabzadah, Farhad
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PATTERN RECOGNITION, GCPR 2016, 2016, 9796 : 344 - 354
  • [5] "What is relevant in a text document?": An interpretable machine learning approach
    Arras, Leila
    Horn, Franziska
    Montavon, Gregoire
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2017, 12 (08):
  • [6] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    Bach, Sebastian
    Binder, Alexander
    Montavon, Gregoire
    Klauschen, Frederick
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2015, 10 (07):
  • [7] Baehrens D, 2010, J MACH LEARN RES, V11, P1803
  • [8] Becker Soren, 2018, INTERPRETING EXPLAIN
  • [9] Deep learning, reusable and problem-based architectures for detection of consolidation on chest X-ray images
    Behzadi-khormouji, Hamed
    Rostami, Habib
    Salehi, Sana
    Derakhshande-Rishehri, Touba
    Masoumi, Marzieh
    Salemi, Siavash
    Keshavarz, Ahmad
    Gholamrezanezhad, Ali
    Assadi, Majid
    Batouli, Ali
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 185
  • [10] Bernard N, 2019, HIGH PERFORMANCE COM, V979, P18