Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications

被引：692

作者：

Samek, Wojciech ^{[1
,2
]}

Montavon, Gregoire ^{[2
,3
]}

Lapuschkin, Sebastian ^{[1
]}

Anders, Christopher J. ^{[2
,3
]}

Mueller, Klaus-Robert ^{[2
,3
,4
,5
]}

机构：

[1] Fraunhofer Heinrich Hertz Inst, Dept Artificial Intelligence, D-10587 Berlin, Germany

[2] BIFOLD Berlin Inst Fdn Learning & Data, D-10587 Berlin, Germany

[3] Tech Univ Berlin, Machine Learning Grp, D-10587 Berlin, Germany

[4] Korea Univ, Dept Artificial Intelligence, Seoul 136713, South Korea

[5] Max Planck Inst Informat, D-66123 Saarbrocken, Germany

来源：

PROCEEDINGS OF THE IEEE | 2021年 / 109卷 / 03期

关键词：

Black-box models; deep learning; explainable artificial intelligence (XAI); Interpretability; model transparency; neural networks; CLASSIFICATION; MODELS; EXPLANATION; PREDICTION; DECISIONS;

D O I：

10.1109/JPROC.2021.3060483

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the broader and highly successful usage of machine learning (ML) in industry and the sciences, there has been a growing demand for explainable artificial intelligence (XAI). Interpretability and explanation methods for gaining a better understanding of the problem-solving abilities and strategies of nonlinear ML, in particular, deep neural networks, are, therefore, receiving increased attention. In this work, we aim to: 1) provide a timely overview of this active emerging field, with a focus on "post hoc" explanations, and explain its theoretical foundations; 2) put interpretability algorithms to a test both from a theory and comparative evaluation perspective using extensive simulations; 3) outline best practice aspects, i.e., how to best include interpretation methods into the standard usage of ML; and 4) demonstrate successful usage of XAI in a representative selection of application scenarios. Finally, we discuss challenges and possible future directions of this exciting foundational field of ML.

引用

页码：247 / 278

页数：32

共 185 条

[1]

Adebayo J, 2018, P INT C LEARN REPR

[2]

Agarwal C., 2020, P AS C COMP VIS

[3]

Ancona M., 2018, INT C LEARNING REPRE

[4]

Ancona Marco, 2019, INT C MACHINE LEARNI, P272

[5]

Anders C. J., 2019, Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, P297, DOI DOI 10.1007/978-3-030-28954-6_16

[6]

Nguyen A, 2016, ADV NEUR IN, V29

[7]

Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640

[8]

[Anonymous], 2016, Adv. Neural. Inf. Process. Syst

[9]

[Anonymous], 2004, P 10 ACM SIGKDD INT, DOI [DOI 10.1145/1014052.101411, DOI 10.1145/1014052.1014118]

[10]

Arjona-Medina JA, 2019, ADV NEUR IN, V32

← 1 2 3 4 5 6 7 8 9 10 →