Interpretable and explainable machine learning: A methods-centric overview with concrete examples

被引：48

作者：

Marcinkevics, Ricards ^{[1
]}

Vogt, Julia E. ^{[1
]}

机构：

[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland

来源：

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY | 2023年 / 13卷 / 03期

关键词：

explainability; interpretability; machine learning; neural networks; FALSE DISCOVERY RATE; BLACK-BOX; CLASSIFICATION; EXPLANATIONS; REGRESSION;

D O I：

10.1002/widm.1493

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Interpretability and explainability are crucial for machine learning (ML) and statistical applications in medicine, economics, law, and natural sciences and form an essential principle for ML model design and development. Although interpretability and explainability have escaped a precise and universal definition, many models and techniques motivated by these properties have been developed over the last 30 years, with the focus currently shifting toward deep learning. We will consider concrete examples of state-of-the-art, including specially tailored rule-based, sparse, and additive classification models, interpretable representation learning, and methods for explaining black-box models post hoc. The discussion will emphasize the need for and relevance of interpretability and explainability, the divide between them, and the inductive biases behind the presented "zoo" of interpretable models and explanation methods.This article is categorized under:Fundamental Concepts of Data and Knowledge > Explainable AITechnologies > Machine LearningCommercial, Legal, and Ethical Issues > Social Considerations

引用

页数：32

共 50 条

[1] A spectrum of explainable and interpretable machine learning approaches for genomic studies
Conard, Ashley Mae
DenAdel, Alan
Crawford, Lorin
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (05):
[2] Explainable and interpretable machine learning and data mining
Atzmueller, Martin
Fuernkranz, Johannes
Kliegr, Tomas
Schmid, Ute
DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (05) : 2571 - 2595
[3] Explainable AI: A Review of Machine Learning Interpretability Methods
Linardatos, Pantelis
Papastefanopoulos, Vasilis
Kotsiantis, Sotiris
ENTROPY, 2021, 23 (01) : 1 - 45
[4] The coming of age of interpretable and explainable machine learning models
Lisboa, P. J. G.
Saralajew, S.
Vellido, A.
Fernandez-Domenech, R.
Villmann, T.
NEUROCOMPUTING, 2023, 535 : 25 - 39
[5] Interpretable and Explainable Machine Learning for Ultrasonic Defect Sizing
Pyle, Richard J.
Hughes, Robert R.
Wilcox, Paul D.
IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2023, 70 (04) : 277 - 290
[6] Harnessing Prior Knowledge for Explainable Machine Learning: An Overview
Beckh, Katharina
Mueller, Sebastian
Jakobs, Matthias
Toborek, Vanessa
Tan, Hanxiao
Fischer, Raphael
Welke, Pascal
Houben, Sebastian
von Rueden, Laura
2023 IEEE CONFERENCE ON SECURE AND TRUSTWORTHY MACHINE LEARNING, SATML, 2023, : 450 - 463
[7] Definitions, methods, and applications in interpretable machine learning
Murdoch, W. James
Singh, Chandan
Kumbier, Karl
Abbasi-Asl, Reza
Yu, Bin
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (44) : 22071 - 22080
[8] A Survey of Interpretable Machine Learning Methods
Wang, Yan
Tuerhong, Gulanbaier
2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 232 - 237
[9] Exploring Evaluation Methods for Interpretable Machine Learning: A Survey
Alangari, Nourah
Menai, Mohamed El Bachir
Mathkour, Hassan
Almosallam, Ibrahim
INFORMATION, 2023, 14 (08)
[10] An interpretable schizophrenia diagnosis framework using machine learning and explainable artificial intelligence
Shivaprasad, Samhita
Chadaga, Krishnaraj
Dias, Cifha Crecil
Sampathila, Niranjana
Prabhu, Srikanth
SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)

← 1 2 3 4 5 →