Neural Prototype Trees for Interpretable Fine-grained Image Recognition

被引：132

作者：

Nauta, Meike ^{[1
]}

van Bree, Ron ^{[1
]}

Seifert, Christin ^{[1
,2
]}

机构：

[1] Univ Twente, Enschede, Netherlands

[2] Univ Duisburg Essen, Duisburg, Germany

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

BLACK-BOX;

D O I：

10.1109/CVPR46437.2021.01469

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Prototype-based methods use interpretable representations to address the black-box nature of deep learning models, in contrast to post-hoc explanation methods that only approximate such models. We propose the Neural Prototype Tree (ProtoTree), an intrinsically interpretable deep learning method for fine-grained image recognition. ProtoTree combines prototype learning with decision trees, and thus results in a globally interpretable model by design. Additionally, ProtoTree can locally explain a single prediction by outlining a decision path through the tree. Each node in our binary tree contains a trainable prototypical part. The presence or absence of this learned prototype in an image determines the routing through a node. Decision making is therefore similar to human reasoning: Does the bird have a red throat? And an elongated beak? Then it's a hummingbird! We tune the accuracy-interpretability trade-off using ensemble methods, pruning and binarizing. We apply pruning without sacrificing accuracy, resulting in a small tree with only 8 learned prototypes along a path to classify a bird from 200 species. An ensemble of 5 ProtoTrees achieves competitive accuracy on the CUB-200-2011 and Stanford Cars data sets. Code is available at github.com/M-Nauta/ProtoTree.

引用

页码：14928 / 14938

页数：11

共 62 条

[1] Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
Adadi, Amina
Berrada, Mohammed
[J]. IEEE ACCESS, 2018, 6 : 52138 - 52160
[2] Alaniz Stephan, 2019, ARXIV190201780
[3] Alvarez-Melis D., 2018, On the Robustness of Interpretability Methods
[4] Angelov P, 2020, IEEE SYS MAN CYBERN, P2092, DOI [10.1109/SMC42975.2020.9282812, 10.1109/smc42975.2020.9282812]
[5] [Anonymous], 2019, ADV NEUR IN
[6] Arik Sercan O., 2019, ARXIV190206292
[7] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
Bach, Sebastian
Binder, Alexander
Montavon, Gregoire
Klauschen, Frederick
Mueller, Klaus-Robert
Samek, Wojciech
[J]. PLOS ONE, 2015, 10 (07):
[8] Belongie, 2011, CNS T 2011 001
[9] RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING
BIEDERMAN, I
[J]. PSYCHOLOGICAL REVIEW, 1987, 94 (02) : 115 - 147
[10] Boyan Zhou, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P9716, DOI 10.1109/CVPR42600.2020.00974

← 1 2 3 4 5 6 7 →