Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications

被引:692
作者
Samek, Wojciech [1 ,2 ]
Montavon, Gregoire [2 ,3 ]
Lapuschkin, Sebastian [1 ]
Anders, Christopher J. [2 ,3 ]
Mueller, Klaus-Robert [2 ,3 ,4 ,5 ]
机构
[1] Fraunhofer Heinrich Hertz Inst, Dept Artificial Intelligence, D-10587 Berlin, Germany
[2] BIFOLD Berlin Inst Fdn Learning & Data, D-10587 Berlin, Germany
[3] Tech Univ Berlin, Machine Learning Grp, D-10587 Berlin, Germany
[4] Korea Univ, Dept Artificial Intelligence, Seoul 136713, South Korea
[5] Max Planck Inst Informat, D-66123 Saarbrocken, Germany
关键词
Black-box models; deep learning; explainable artificial intelligence (XAI); Interpretability; model transparency; neural networks; CLASSIFICATION; MODELS; EXPLANATION; PREDICTION; DECISIONS;
D O I
10.1109/JPROC.2021.3060483
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the broader and highly successful usage of machine learning (ML) in industry and the sciences, there has been a growing demand for explainable artificial intelligence (XAI). Interpretability and explanation methods for gaining a better understanding of the problem-solving abilities and strategies of nonlinear ML, in particular, deep neural networks, are, therefore, receiving increased attention. In this work, we aim to: 1) provide a timely overview of this active emerging field, with a focus on "post hoc" explanations, and explain its theoretical foundations; 2) put interpretability algorithms to a test both from a theory and comparative evaluation perspective using extensive simulations; 3) outline best practice aspects, i.e., how to best include interpretation methods into the standard usage of ML; and 4) demonstrate successful usage of XAI in a representative selection of application scenarios. Finally, we discuss challenges and possible future directions of this exciting foundational field of ML.
引用
收藏
页码:247 / 278
页数:32
相关论文
共 185 条
[1]  
Adebayo J, 2018, P INT C LEARN REPR
[2]  
Agarwal C., 2020, P AS C COMP VIS
[3]  
Ancona M., 2018, INT C LEARNING REPRE
[4]  
Ancona Marco, 2019, INT C MACHINE LEARNI, P272
[5]  
Anders C. J., 2019, Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, P297, DOI DOI 10.1007/978-3-030-28954-6_16
[6]  
Nguyen A, 2016, ADV NEUR IN, V29
[7]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[8]  
[Anonymous], 2016, Adv. Neural. Inf. Process. Syst
[9]  
[Anonymous], 2004, P 10 ACM SIGKDD INT, DOI [DOI 10.1145/1014052.101411, DOI 10.1145/1014052.1014118]
[10]  
Arjona-Medina JA, 2019, ADV NEUR IN, V32