Neural Prototype Trees for Interpretable Fine-grained Image Recognition

被引:132
作者
Nauta, Meike [1 ]
van Bree, Ron [1 ]
Seifert, Christin [1 ,2 ]
机构
[1] Univ Twente, Enschede, Netherlands
[2] Univ Duisburg Essen, Duisburg, Germany
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
BLACK-BOX;
D O I
10.1109/CVPR46437.2021.01469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prototype-based methods use interpretable representations to address the black-box nature of deep learning models, in contrast to post-hoc explanation methods that only approximate such models. We propose the Neural Prototype Tree (ProtoTree), an intrinsically interpretable deep learning method for fine-grained image recognition. ProtoTree combines prototype learning with decision trees, and thus results in a globally interpretable model by design. Additionally, ProtoTree can locally explain a single prediction by outlining a decision path through the tree. Each node in our binary tree contains a trainable prototypical part. The presence or absence of this learned prototype in an image determines the routing through a node. Decision making is therefore similar to human reasoning: Does the bird have a red throat? And an elongated beak? Then it's a hummingbird! We tune the accuracy-interpretability trade-off using ensemble methods, pruning and binarizing. We apply pruning without sacrificing accuracy, resulting in a small tree with only 8 learned prototypes along a path to classify a bird from 200 species. An ensemble of 5 ProtoTrees achieves competitive accuracy on the CUB-200-2011 and Stanford Cars data sets. Code is available at github.com/M-Nauta/ProtoTree.
引用
收藏
页码:14928 / 14938
页数:11
相关论文
共 62 条
  • [1] Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
    Adadi, Amina
    Berrada, Mohammed
    [J]. IEEE ACCESS, 2018, 6 : 52138 - 52160
  • [2] Alaniz Stephan, 2019, ARXIV190201780
  • [3] Alvarez-Melis D., 2018, On the Robustness of Interpretability Methods
  • [4] Angelov P, 2020, IEEE SYS MAN CYBERN, P2092, DOI [10.1109/SMC42975.2020.9282812, 10.1109/smc42975.2020.9282812]
  • [5] [Anonymous], 2019, ADV NEUR IN
  • [6] Arik Sercan O., 2019, ARXIV190206292
  • [7] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    Bach, Sebastian
    Binder, Alexander
    Montavon, Gregoire
    Klauschen, Frederick
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2015, 10 (07):
  • [8] Belongie, 2011, CNS T 2011 001
  • [9] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
    BIEDERMAN, I
    [J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
  • [10] Boyan Zhou, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P9716, DOI 10.1109/CVPR42600.2020.00974