Interpretable and explainable machine learning: A methods-centric overview with concrete examples

被引:48
|
作者
Marcinkevics, Ricards [1 ]
Vogt, Julia E. [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
explainability; interpretability; machine learning; neural networks; FALSE DISCOVERY RATE; BLACK-BOX; CLASSIFICATION; EXPLANATIONS; REGRESSION;
D O I
10.1002/widm.1493
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpretability and explainability are crucial for machine learning (ML) and statistical applications in medicine, economics, law, and natural sciences and form an essential principle for ML model design and development. Although interpretability and explainability have escaped a precise and universal definition, many models and techniques motivated by these properties have been developed over the last 30 years, with the focus currently shifting toward deep learning. We will consider concrete examples of state-of-the-art, including specially tailored rule-based, sparse, and additive classification models, interpretable representation learning, and methods for explaining black-box models post hoc. The discussion will emphasize the need for and relevance of interpretability and explainability, the divide between them, and the inductive biases behind the presented "zoo" of interpretable models and explanation methods.This article is categorized under:Fundamental Concepts of Data and Knowledge > Explainable AITechnologies > Machine LearningCommercial, Legal, and Ethical Issues > Social Considerations
引用
收藏
页数:32
相关论文
共 50 条
  • [21] Interpretable Prediction of a Decentralized Smart Grid Based on Machine Learning and Explainable Artificial Intelligence
    Cifci, Ahmet
    IEEE ACCESS, 2025, 13 : 36285 - 36305
  • [22] An interpretable machine learning approach for predicting the capacity and failure mode of reinforced concrete columns
    Haggag, May
    Ismail, Mohamed K.
    El-Dakhakhni, Wael
    ADVANCES IN STRUCTURAL ENGINEERING, 2025, 28 (02) : 246 - 263
  • [23] Explainable machine learning methods to predict postpartum depression risk
    Shivaprasad, Susmita
    Chadaga, Krishnaraj
    Sampathila, Niranjana
    Prabhu, Srikanth
    Chadaga, P. Rajagopala
    Swathi, K. S.
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)
  • [24] Review Study of Interpretation Methods for Future Interpretable Machine Learning
    Mi, Jian-Xun
    Li, An-Di
    Zhou, Li-Fang
    IEEE ACCESS, 2020, 8 (08): : 191969 - 191985
  • [25] Prediction of Compressive Strength of Concrete Specimens Based on Interpretable Machine Learning
    Wang, Wenhu
    Zhong, Yihui
    Liao, Gang
    Ding, Qing
    Zhang, Tuan
    Li, Xiangyang
    MATERIALS, 2024, 17 (15)
  • [26] Machine Learning from examples: Inductive and Lazy methods
    de Mantaras, RL
    Armengol, E
    DATA & KNOWLEDGE ENGINEERING, 1998, 25 (1-2) : 99 - 123
  • [27] Interpretable Machine Learning Models for Malicious Domains Detection Using Explainable Artificial Intelligence (XAI)
    Aslam, Nida
    Khan, Irfan Ullah
    Mirza, Samiha
    AlOwayed, Alanoud
    Anis, Fatima M.
    Aljuaid, Reef M.
    Baageel, Reham
    SUSTAINABILITY, 2022, 14 (12)
  • [28] Interpretable Machine Learning for Assessing the Cumulative Damage of a Reinforced Concrete Frame Induced by Seismic Sequences
    Lazaridis, Petros C.
    Kavvadias, Ioannis E.
    Demertzis, Konstantinos
    Iliadis, Lazaros
    Vasiliadis, Lazaros K.
    SUSTAINABILITY, 2023, 15 (17)
  • [29] An explainable machine learning model to predict and elucidate the compressive behavior of high-performance concrete
    Chakraborty, Debaditya
    Awolusi, Ibukun
    Gutierrez, Lilianna
    RESULTS IN ENGINEERING, 2021, 11
  • [30] A Comparison between Explainable Machine Learning Methods for Classification and Regression Problems in the Actuarial Context
    Lozano-Murcia, Catalina
    Romero, Francisco P.
    Serrano-Guerrero, Jesus
    Olivas, Jose A.
    MATHEMATICS, 2023, 11 (14)