An Empirical Comparison of Interpretable Models to Post-Hoc Explanations

被引:2
|
作者
Mahya, Parisa [1 ]
Fuernkranz, Johannes [1 ]
机构
[1] Johannes Kepler Univ Linz, Inst Applicat Oriented Knowledge Proc FAW, A-4040 Linz, Austria
关键词
explainable AI; interpretable machine learning; interpretable models; black-box explanation; white-box models;
D O I
10.3390/ai4020023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, some effort went into explaining intransparent and black-box models, such as deep neural networks or random forests. So-called model-agnostic methods typically approximate the prediction of the intransparent black-box model with an interpretable model, without considering any specifics of the black-box model itself. It is a valid question whether direct learning of interpretable white-box models should not be preferred over post-hoc approximations of intransparent and black-box models. In this paper, we report the results of an empirical study, which compares post-hoc explanations and interpretable models on several datasets for rule-based and feature-based interpretable models. The results seem to underline that often directly learned interpretable models approximate the black-box models at least as well as their post-hoc surrogates, even though the former do not have direct access to the black-box model.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 50 条
  • [41] Ixekizumab and ustekinumab in psoriasis: post-hoc comparison of onset and duration of treatment response
    Radtke, Marc
    Conrad, Curdin
    Schuster, Christopher
    Saure, Daniel
    Mert, Can
    Riedl, Elisabeth
    Costanzo, Antonio
    JOURNAL OF DERMATOLOGICAL TREATMENT, 2022, 33 (02) : 1168 - 1170
  • [42] Empower Post-hoc Graph Explanations with Information Bottleneck: A Pre-training and Fine-tuning Perspective
    Wang, Jihong
    Luo, Minnan
    Li, Jundong
    Lin, Yun
    Dong, Yushun
    Dong, Jin Song
    Zheng, Qinghua
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2349 - 2360
  • [43] Heterogeneous graph neural networks with post-hoc explanations for multi-modal and explainable land use inference
    Zhai, Xuehao
    Jiang, Junqi
    Dejl, Adam
    Rago, Antonio
    Guo, Fangce
    Toni, Francesca
    Sivakumar, Aruna
    INFORMATION FUSION, 2025, 120
  • [44] Why Don't XAI Techniques Agree? Characterizing the Disagreements Between Post-hoc Explanations of Defect Predictions
    Roy, Saumendu
    Laberge, Gabriel
    Roy, Banani
    Khomh, Foutse
    Nikanjam, Amin
    Mondal, Saikat
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 444 - 448
  • [45] Generating post-hoc explanations for Skip-gram-based node embeddings by identifying important nodes with bridgeness
    Park, Hogun
    Neville, Jennifer
    NEURAL NETWORKS, 2023, 164 : 546 - 561
  • [46] Augmenting post-hoc explanations for predictive process monitoring with uncertainty quantification via conformalized Monte Carlo dropout
    Mehdiyev, Nijat
    Majlatow, Maxim
    Fettke, Peter
    DATA & KNOWLEDGE ENGINEERING, 2025, 156
  • [47] Explaining black-box classifiers using post-hoc explanations-by-example: The effect of explanations and error-rates in XAI user studies
    Kenny, Eoin M.
    Ford, Courtney
    Quinn, Molly
    Keane, Mark T.
    ARTIFICIAL INTELLIGENCE, 2021, 294
  • [48] Post-hoc data analysis: benefits and limitations
    Curran-Everett, Douglas
    Milgrom, Henry
    CURRENT OPINION IN ALLERGY AND CLINICAL IMMUNOLOGY, 2013, 13 (03) : 223 - 224
  • [49] Limitations of Post-Hoc Feature Alignment for Robustness
    Burns, Collin
    Steinhardt, Jacob
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2525 - 2533
  • [50] Post-hoc Counterfactual Generation with Supervised Autoencoder
    Guyomard, Victor
    Fessant, Francoise
    Bouadi, Tassadit
    Guyet, Thomas
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021, PT I, 2021, 1524 : 105 - 114