An Empirical Comparison of Interpretable Models to Post-Hoc Explanations

被引：2

作者：

Mahya, Parisa ^{[1
]}

Fuernkranz, Johannes ^{[1
]}

机构：

[1] Johannes Kepler Univ Linz, Inst Applicat Oriented Knowledge Proc FAW, A-4040 Linz, Austria

来源：

AI | 2023年 / 4卷 / 02期

关键词：

explainable AI; interpretable machine learning; interpretable models; black-box explanation; white-box models;

D O I：

10.3390/ai4020023

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, some effort went into explaining intransparent and black-box models, such as deep neural networks or random forests. So-called model-agnostic methods typically approximate the prediction of the intransparent black-box model with an interpretable model, without considering any specifics of the black-box model itself. It is a valid question whether direct learning of interpretable white-box models should not be preferred over post-hoc approximations of intransparent and black-box models. In this paper, we report the results of an empirical study, which compares post-hoc explanations and interpretable models on several datasets for rule-based and feature-based interpretable models. The results seem to underline that often directly learned interpretable models approximate the black-box models at least as well as their post-hoc surrogates, even though the former do not have direct access to the black-box model.

引用

页码：426 / 436

页数：11

共 50 条

[31] Ontology-Based Post-Hoc Neural Network Explanations Via Simultaneous Concept Extraction
Ponomarev, Andrew
Agafonov, Anton
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 433 - 446
[32] How can I choose an explainer? An Application-grounded Evaluation of Post-hoc Explanations
Jesus, Sergio
Belem, Catarina
Balayan, Vladimir
Bento, Joao
Saleiro, Pedro
Bizarro, Pedro
Gama, Joao
PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 805 - 815
[33] Normalizing trust: Participants' immediately post-hoc explanations of behaviour in Milgram's "obedience' experiments
Hollander, Matthew M.
Turowetz, Jason
BRITISH JOURNAL OF SOCIAL PSYCHOLOGY, 2017, 56 (04) : 655 - 674
[34] Generation of empirical offender profiles - Post-hoc classification in the murder of an intimate partner
Busch, TP
Scholz, OB
KRIMINALISTIK, 2001, 55 (8-9): : 549 - 556
[35] Exploring post-hoc agnostic models for explainable cooking recipe recommendations
Yera, Raciel
Alzahrani, Ahmad A.
Martinez, Luis
KNOWLEDGE-BASED SYSTEMS, 2022, 251
[36] Is PET solely a post-hoc tool to validate psychological models of memory?
Decety, J
COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE III-SCIENCES DE LA VIE-LIFE SCIENCES, 1998, 321 (2-3): : 207 - 208
[37] A Post-Hoc Interpretable Ensemble Model to Feature Effect Analysis in Warfarin Dose Prediction for Chinese Patients
Zhang, Yuzhen
Xie, Cheng
Xue, Ling
Tao, Yanyun
Yue, Guoqi
Jiang, Bin
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (02) : 840 - 851
[38] POST-HOC, NON-ERGO-PROPTER-HOC
PEIRICK, J
IEEE SPECTRUM, 1994, 31 (03) : 6 - 6
[39] Through the looking glass: evaluating post hoc explanations using transparent models
Velmurugan, Mythreyi
Ouyang, Chun
Sindhgatta, Renuka
Moreira, Catarina
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023,
[40] LLMs for the post-hoc creation of provenance
Almuntashiri, Abdullah Hamed
Ibanez, Luis-Daniel
Chapman, Adriane
9TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, EUROS&PW 2024, 2024, : 562 - 566

← 1 2 3 4 5 →