An Empirical Comparison of Interpretable Models to Post-Hoc Explanations

被引:2
|
作者
Mahya, Parisa [1 ]
Fuernkranz, Johannes [1 ]
机构
[1] Johannes Kepler Univ Linz, Inst Applicat Oriented Knowledge Proc FAW, A-4040 Linz, Austria
关键词
explainable AI; interpretable machine learning; interpretable models; black-box explanation; white-box models;
D O I
10.3390/ai4020023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, some effort went into explaining intransparent and black-box models, such as deep neural networks or random forests. So-called model-agnostic methods typically approximate the prediction of the intransparent black-box model with an interpretable model, without considering any specifics of the black-box model itself. It is a valid question whether direct learning of interpretable white-box models should not be preferred over post-hoc approximations of intransparent and black-box models. In this paper, we report the results of an empirical study, which compares post-hoc explanations and interpretable models on several datasets for rule-based and feature-based interpretable models. The results seem to underline that often directly learned interpretable models approximate the black-box models at least as well as their post-hoc surrogates, even though the former do not have direct access to the black-box model.
引用
收藏
页码:426 / 436
页数:11
相关论文
共 50 条
  • [31] Ontology-Based Post-Hoc Neural Network Explanations Via Simultaneous Concept Extraction
    Ponomarev, Andrew
    Agafonov, Anton
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 433 - 446
  • [32] How can I choose an explainer? An Application-grounded Evaluation of Post-hoc Explanations
    Jesus, Sergio
    Belem, Catarina
    Balayan, Vladimir
    Bento, Joao
    Saleiro, Pedro
    Bizarro, Pedro
    Gama, Joao
    PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 805 - 815
  • [33] Normalizing trust: Participants' immediately post-hoc explanations of behaviour in Milgram's "obedience' experiments
    Hollander, Matthew M.
    Turowetz, Jason
    BRITISH JOURNAL OF SOCIAL PSYCHOLOGY, 2017, 56 (04) : 655 - 674
  • [34] Generation of empirical offender profiles - Post-hoc classification in the murder of an intimate partner
    Busch, TP
    Scholz, OB
    KRIMINALISTIK, 2001, 55 (8-9): : 549 - 556
  • [35] Exploring post-hoc agnostic models for explainable cooking recipe recommendations
    Yera, Raciel
    Alzahrani, Ahmad A.
    Martinez, Luis
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [36] Is PET solely a post-hoc tool to validate psychological models of memory?
    Decety, J
    COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE III-SCIENCES DE LA VIE-LIFE SCIENCES, 1998, 321 (2-3): : 207 - 208
  • [37] A Post-Hoc Interpretable Ensemble Model to Feature Effect Analysis in Warfarin Dose Prediction for Chinese Patients
    Zhang, Yuzhen
    Xie, Cheng
    Xue, Ling
    Tao, Yanyun
    Yue, Guoqi
    Jiang, Bin
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (02) : 840 - 851
  • [38] POST-HOC, NON-ERGO-PROPTER-HOC
    PEIRICK, J
    IEEE SPECTRUM, 1994, 31 (03) : 6 - 6
  • [39] Through the looking glass: evaluating post hoc explanations using transparent models
    Velmurugan, Mythreyi
    Ouyang, Chun
    Sindhgatta, Renuka
    Moreira, Catarina
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023,
  • [40] LLMs for the post-hoc creation of provenance
    Almuntashiri, Abdullah Hamed
    Ibanez, Luis-Daniel
    Chapman, Adriane
    9TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, EUROS&PW 2024, 2024, : 562 - 566