Visual interpretability for deep learning: a survey

被引:556
作者
Zhang, Quan-shi [1 ]
Zhu, Song-chun [1 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90095 USA
基金
美国国家科学基金会;
关键词
Artificial intelligence; Deep learning; Interpretable model;
D O I
10.1631/FITEE.1700808
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper reviews recent studies in understanding neural-network representations and learning neural networks with interpretable/disentangled middle-layer representations. Although deep neural networks have exhibited superior performance in various tasks, interpretability is always Achilles' heel of deep neural networks. At present, deep neural networks obtain high discrimination power at the cost of a low interpretability of their black-box representations. We believe that high model interpretability may help people break several bottlenecks of deep learning, e.g., learning from a few annotations, learning via human-computer communications at the semantic level, and semantically debugging network representations. We focus on convolutional neural networks (CNNs), and revisit the visualization of CNN representations, methods of diagnosing representations of pre-trained CNNs, approaches for disentangling pre-trained CNN representations, learning of CNNs with disentangled representations, and middle-to-end learning based on model interpretability. Finally, we discuss prospective trends in explainable artificial intelligence.
引用
收藏
页码:27 / 39
页数:13
相关论文
共 47 条
[21]  
[Anonymous], 2016, HARNESSING DEEP NEUR
[22]   Understanding deep features with computer-generated imagery [J].
Aubry, Mathieu ;
Russell, Bryan C. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2875-2883
[23]   Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models [J].
Aubry, Mathieu ;
Maturana, Daniel ;
Efros, Alexei A. ;
Russell, Bryan C. ;
Sivic, Josef .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3762-3769
[24]   Inverting Visual Representations with Convolutional Networks [J].
Dosovitskiy, Alexey ;
Brox, Thomas .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4829-4837
[25]   Interpretable Explanations of Black Boxes by Meaningful Perturbation [J].
Fong, Ruth C. ;
Vedaldi, Andrea .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3449-3457
[26]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[27]  
Kindermans P.-J., 2017, Learning how to explain neural networks: PatternNet and PatternAttribution
[28]  
Koh PW, 2017, PR MACH LEARN RES, V70
[29]  
Krizhevsky A., 2017, COMMUN ACM, V60, P84, DOI [DOI 10.1145/3065386, 10.1145/3065386]
[30]  
Lakkaraju H, 2017, AAAI CONF ARTIF INTE, P2124