Learning to Explain: A Model -Agnostic Framework for Explaining Black Box Models

被引:2
|
作者
Barkan, Oren [1 ]
Asher, Yuval [2 ]
Eshel, Amit [2 ]
Elisha, Yelionatan [1 ]
Koenigstein, Noam [2 ]
机构
[1] Open Univ, Milton Keynes, England
[2] Tel Aviv Univ, Tel Aviv, Israel
来源
23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023 | 2023年
基金
以色列科学基金会;
关键词
Explainable AI; computer vision; transformers;
D O I
10.1109/ICDM58522.2023.00105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Learning to Explain (LTX), a model-agnostic framework designed for providing post -hoc explanations for vision models. The LTX framework introduces an "explainer" model that generates explanation maps, highlighting the crucial regions that justify the predictions made by the model being explained. To train the explainer, we employ a two -stage process consisting of initial pretraining followed by per-instance finetuning. During both stages of training, we utilize a unique configuration where we compare the explained model's prediction for a masked input with its original prediction for the unmasked input. This approach enables the use of a novel counterfactual objective, which aims to anticipate the model's output using masked versions of the input image. Importantly, the LTX framework is not restricted to a specific model architecture and can provide explanations for both Transformer-based and convolutional models. Through our evaluations, we demonstrate that LTX significantly outperforms the current state-of-the-art in explainability across various metrics. Our code is available at: https://githab.cian/LTX-CodelLTX
引用
收藏
页码:944 / 949
页数:6
相关论文
共 50 条
  • [41] Decision-making framework with double-loop learning through interpretable black-box machine learning models
    Bohanec, Marko
    Robnik-Sikonja, Marko
    Borstnar, Mirjana Kljajic
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2017, 117 (07) : 1389 - 1406
  • [42] Inside the Black Box: Exploring Mental Models in the Learning Environment
    Smith, Linda J.
    IMSCI 10: 4TH INTERNATIONAL MULTI-CONFERENCE ON SOCIETY, CYBERNETICS AND INFORMATICS, VOL I, 2010, : 189 - 194
  • [43] Personalized Federated Semi -Supervised Learning with Black -Box Models
    Huang, Siyin
    Li, Shao-Yuan
    Chen, Songcan
    23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1061 - 1066
  • [44] Drift Detection for Black-Box Deep Learning Models
    Piano, Luca
    Garcea, Fabio
    Cavallone, Andrea
    Vazquez, Ignacio Aparicio
    Morra, Lia
    Lamberti, Fabrizio
    IT PROFESSIONAL, 2024, 26 (02) : 24 - 31
  • [45] Explainable Debugger for Black-box Machine Learning Models
    Rasouli, Peyman
    Yu, Ingrid Chieh
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] Learning outside the Black-Box: The pursuit of interpretable models
    Crabbe, Jonathan
    Zhang, Yao
    Zame, William R.
    van der Schaar, Mihaela
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [47] Polishing the black box: flexible model-based partitioning surrogate models for interpretable machine learning model
    Khasawneh, Tariq
    Azzeh, Mohammad
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [48] Explain the Explainer: Interpreting Model-Agnostic Counterfactual Explanations of a Deep Reinforcement Learning Agent
    Chen Z.
    Silvestri F.
    Tolomei G.
    Wang J.
    Zhu H.
    Ahn H.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1443 - 1457
  • [49] Evaluation of the factors explaining the use of agricultural land: A machine learning and model-agnostic approach
    Viana, Claudia M.
    Santos, Mauricio
    Freire, Dulce
    Abrantes, Patricia
    Rocha, Jorge
    ECOLOGICAL INDICATORS, 2021, 131
  • [50] Learning in a black box
    Nax, Heinrich H.
    Burton-Chellew, Maxwell N.
    West, Stuart A.
    Young, H. Peyton
    JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2016, 127 : 1 - 15