Interpretable and explainable machine learning: A methods-centric overview with concrete examples

被引:48
|
作者
Marcinkevics, Ricards [1 ]
Vogt, Julia E. [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
explainability; interpretability; machine learning; neural networks; FALSE DISCOVERY RATE; BLACK-BOX; CLASSIFICATION; EXPLANATIONS; REGRESSION;
D O I
10.1002/widm.1493
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpretability and explainability are crucial for machine learning (ML) and statistical applications in medicine, economics, law, and natural sciences and form an essential principle for ML model design and development. Although interpretability and explainability have escaped a precise and universal definition, many models and techniques motivated by these properties have been developed over the last 30 years, with the focus currently shifting toward deep learning. We will consider concrete examples of state-of-the-art, including specially tailored rule-based, sparse, and additive classification models, interpretable representation learning, and methods for explaining black-box models post hoc. The discussion will emphasize the need for and relevance of interpretability and explainability, the divide between them, and the inductive biases behind the presented "zoo" of interpretable models and explanation methods.This article is categorized under:Fundamental Concepts of Data and Knowledge > Explainable AITechnologies > Machine LearningCommercial, Legal, and Ethical Issues > Social Considerations
引用
收藏
页数:32
相关论文
共 50 条
  • [1] A spectrum of explainable and interpretable machine learning approaches for genomic studies
    Conard, Ashley Mae
    DenAdel, Alan
    Crawford, Lorin
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (05):
  • [2] Explainable and interpretable machine learning and data mining
    Atzmueller, Martin
    Fuernkranz, Johannes
    Kliegr, Tomas
    Schmid, Ute
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (05) : 2571 - 2595
  • [3] Explainable AI: A Review of Machine Learning Interpretability Methods
    Linardatos, Pantelis
    Papastefanopoulos, Vasilis
    Kotsiantis, Sotiris
    ENTROPY, 2021, 23 (01) : 1 - 45
  • [4] The coming of age of interpretable and explainable machine learning models
    Lisboa, P. J. G.
    Saralajew, S.
    Vellido, A.
    Fernandez-Domenech, R.
    Villmann, T.
    NEUROCOMPUTING, 2023, 535 : 25 - 39
  • [5] Interpretable and Explainable Machine Learning for Ultrasonic Defect Sizing
    Pyle, Richard J.
    Hughes, Robert R.
    Wilcox, Paul D.
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2023, 70 (04) : 277 - 290
  • [6] Harnessing Prior Knowledge for Explainable Machine Learning: An Overview
    Beckh, Katharina
    Mueller, Sebastian
    Jakobs, Matthias
    Toborek, Vanessa
    Tan, Hanxiao
    Fischer, Raphael
    Welke, Pascal
    Houben, Sebastian
    von Rueden, Laura
    2023 IEEE CONFERENCE ON SECURE AND TRUSTWORTHY MACHINE LEARNING, SATML, 2023, : 450 - 463
  • [7] Definitions, methods, and applications in interpretable machine learning
    Murdoch, W. James
    Singh, Chandan
    Kumbier, Karl
    Abbasi-Asl, Reza
    Yu, Bin
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (44) : 22071 - 22080
  • [8] A Survey of Interpretable Machine Learning Methods
    Wang, Yan
    Tuerhong, Gulanbaier
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 232 - 237
  • [9] Exploring Evaluation Methods for Interpretable Machine Learning: A Survey
    Alangari, Nourah
    Menai, Mohamed El Bachir
    Mathkour, Hassan
    Almosallam, Ibrahim
    INFORMATION, 2023, 14 (08)
  • [10] An interpretable schizophrenia diagnosis framework using machine learning and explainable artificial intelligence
    Shivaprasad, Samhita
    Chadaga, Krishnaraj
    Dias, Cifha Crecil
    Sampathila, Niranjana
    Prabhu, Srikanth
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)