Convolutional Dynamic Alignment Networks for Interpretable Classifications

被引:35
作者
Boehle, Moritz [1 ]
Fritz, Mario [2 ]
Schiele, Bernt [1 ]
机构
[1] MPI Informat, Saarland Informat Campus, Saarbrucken, Germany
[2] CISPA Helmholtz Ctr Informat Secur, Saarbrucken, Germany
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00990
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a new family of neural network models called Convolutional Dynamic Alignment Networks(1) (CoDA-Nets), which are performant classifiers with a high degree of inherent interpretability. Their core building blocks are Dynamic Alignment Units (DAUs), which linearly transform their input with weight vectors that dynamically align with task-relevant patterns. As a result, CoDA-Nets model the classification prediction through a series of input-dependent linear transformations, allowing for linear decomposition of the output into individual input contributions. Given the alignment of the DAUs, the resulting contribution maps align with discriminative input patterns. These model-inherent decompositions are of high visual quality and outperform existing attribution methods under quantitative metrics. Further, CoDA-Nets constitute performant classifiers, achieving on par results to ResNet and VGG models on e.g. CIFAR-10 and TinyImagenet.
引用
收藏
页码:10024 / 10033
页数:10
相关论文
共 39 条
[1]  
Alom Md Zahangir, 2020, NEURAL COMPUTING APP
[2]  
AlvarezMelis D, 2018, ADV NEURAL INFORM PR
[3]  
[Anonymous], 2018, Advances in Neural Information Processing Systems (NeurIPS
[4]   On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].
Bach, Sebastian ;
Binder, Alexander ;
Montavon, Gregoire ;
Klauschen, Frederick ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2015, 10 (07)
[5]  
Baehrens D, 2010, J MACH LEARN RES, V11, P1803
[6]  
Brendel W, 2019, ARXIV190400760
[7]  
Chen CF, 2019, ADV NEUR IN, V32
[8]  
Cubuk Ekin D., 2019, arXiv:1909.13719
[9]   THE APPROXIMATION OF ONE MATRIX BY ANOTHER OF LOWER RANK [J].
Eckart, Carl ;
Young, Gale .
PSYCHOMETRIKA, 1936, 1 (03) :211-218
[10]  
Gouk Henry, 2018, MACH LEARN KNOWL DIS