An Empirical Comparison of Interpretable Models to Post-Hoc Explanations

被引：2

作者：

Mahya, Parisa ^{[1
]}

Fuernkranz, Johannes ^{[1
]}

机构：

[1] Johannes Kepler Univ Linz, Inst Applicat Oriented Knowledge Proc FAW, A-4040 Linz, Austria

来源：

AI | 2023年 / 4卷 / 02期

关键词：

explainable AI; interpretable machine learning; interpretable models; black-box explanation; white-box models;

D O I：

10.3390/ai4020023

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, some effort went into explaining intransparent and black-box models, such as deep neural networks or random forests. So-called model-agnostic methods typically approximate the prediction of the intransparent black-box model with an interpretable model, without considering any specifics of the black-box model itself. It is a valid question whether direct learning of interpretable white-box models should not be preferred over post-hoc approximations of intransparent and black-box models. In this paper, we report the results of an empirical study, which compares post-hoc explanations and interpretable models on several datasets for rule-based and feature-based interpretable models. The results seem to underline that often directly learned interpretable models approximate the black-box models at least as well as their post-hoc surrogates, even though the former do not have direct access to the black-box model.

引用

页码：426 / 436

页数：11

共 50 条

[41] Ixekizumab and ustekinumab in psoriasis: post-hoc comparison of onset and duration of treatment response
Radtke, Marc
Conrad, Curdin
Schuster, Christopher
Saure, Daniel
Mert, Can
Riedl, Elisabeth
Costanzo, Antonio
JOURNAL OF DERMATOLOGICAL TREATMENT, 2022, 33 (02) : 1168 - 1170
[42] Empower Post-hoc Graph Explanations with Information Bottleneck: A Pre-training and Fine-tuning Perspective
Wang, Jihong
Luo, Minnan
Li, Jundong
Lin, Yun
Dong, Yushun
Dong, Jin Song
Zheng, Qinghua
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2349 - 2360
[43] Heterogeneous graph neural networks with post-hoc explanations for multi-modal and explainable land use inference
Zhai, Xuehao
Jiang, Junqi
Dejl, Adam
Rago, Antonio
Guo, Fangce
Toni, Francesca
Sivakumar, Aruna
INFORMATION FUSION, 2025, 120
[44] Why Don't XAI Techniques Agree? Characterizing the Disagreements Between Post-hoc Explanations of Defect Predictions
Roy, Saumendu
Laberge, Gabriel
Roy, Banani
Khomh, Foutse
Nikanjam, Amin
Mondal, Saikat
2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 444 - 448
[45] Generating post-hoc explanations for Skip-gram-based node embeddings by identifying important nodes with bridgeness
Park, Hogun
Neville, Jennifer
NEURAL NETWORKS, 2023, 164 : 546 - 561
[46] Augmenting post-hoc explanations for predictive process monitoring with uncertainty quantification via conformalized Monte Carlo dropout
Mehdiyev, Nijat
Majlatow, Maxim
Fettke, Peter
DATA & KNOWLEDGE ENGINEERING, 2025, 156
[47] Explaining black-box classifiers using post-hoc explanations-by-example: The effect of explanations and error-rates in XAI user studies
Kenny, Eoin M.
Ford, Courtney
Quinn, Molly
Keane, Mark T.
ARTIFICIAL INTELLIGENCE, 2021, 294
[48] Post-hoc data analysis: benefits and limitations
Curran-Everett, Douglas
Milgrom, Henry
CURRENT OPINION IN ALLERGY AND CLINICAL IMMUNOLOGY, 2013, 13 (03) : 223 - 224
[49] Limitations of Post-Hoc Feature Alignment for Robustness
Burns, Collin
Steinhardt, Jacob
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2525 - 2533
[50] Post-hoc Counterfactual Generation with Supervised Autoencoder
Guyomard, Victor
Fessant, Francoise
Bouadi, Tassadit
Guyet, Thomas
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021, PT I, 2021, 1524 : 105 - 114

← 1 2 3 4 5 →