Learning to Explain: A Model -Agnostic Framework for Explaining Black Box Models

被引：2

作者：

Barkan, Oren ^{[1
]}

Asher, Yuval ^{[2
]}

Eshel, Amit ^{[2
]}

Elisha, Yelionatan ^{[1
]}

Koenigstein, Noam ^{[2
]}

机构：

[1] Open Univ, Milton Keynes, England

[2] Tel Aviv Univ, Tel Aviv, Israel

来源：

23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023 | 2023年

基金：

以色列科学基金会;

关键词：

Explainable AI; computer vision; transformers;

D O I：

10.1109/ICDM58522.2023.00105

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present Learning to Explain (LTX), a model-agnostic framework designed for providing post -hoc explanations for vision models. The LTX framework introduces an "explainer" model that generates explanation maps, highlighting the crucial regions that justify the predictions made by the model being explained. To train the explainer, we employ a two -stage process consisting of initial pretraining followed by per-instance finetuning. During both stages of training, we utilize a unique configuration where we compare the explained model's prediction for a masked input with its original prediction for the unmasked input. This approach enables the use of a novel counterfactual objective, which aims to anticipate the model's output using masked versions of the input image. Importantly, the LTX framework is not restricted to a specific model architecture and can provide explanations for both Transformer-based and convolutional models. Through our evaluations, we demonstrate that LTX significantly outperforms the current state-of-the-art in explainability across various metrics. Our code is available at: https://githab.cian/LTX-CodelLTX

引用

页码：944 / 949

页数：6

共 50 条

[41] Decision-making framework with double-loop learning through interpretable black-box machine learning models
Bohanec, Marko
Robnik-Sikonja, Marko
Borstnar, Mirjana Kljajic
INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2017, 117 (07) : 1389 - 1406
[42] Inside the Black Box: Exploring Mental Models in the Learning Environment
Smith, Linda J.
IMSCI 10: 4TH INTERNATIONAL MULTI-CONFERENCE ON SOCIETY, CYBERNETICS AND INFORMATICS, VOL I, 2010, : 189 - 194
[43] Personalized Federated Semi -Supervised Learning with Black -Box Models
Huang, Siyin
Li, Shao-Yuan
Chen, Songcan
23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, ICDM 2023, 2023, : 1061 - 1066
[44] Drift Detection for Black-Box Deep Learning Models
Piano, Luca
Garcea, Fabio
Cavallone, Andrea
Vazquez, Ignacio Aparicio
Morra, Lia
Lamberti, Fabrizio
IT PROFESSIONAL, 2024, 26 (02) : 24 - 31
[45] Explainable Debugger for Black-box Machine Learning Models
Rasouli, Peyman
Yu, Ingrid Chieh
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[46] Learning outside the Black-Box: The pursuit of interpretable models
Crabbe, Jonathan
Zhang, Yao
Zame, William R.
van der Schaar, Mihaela
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[47] Polishing the black box: flexible model-based partitioning surrogate models for interpretable machine learning model
Khasawneh, Tariq
Azzeh, Mohammad
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
[48] Explain the Explainer: Interpreting Model-Agnostic Counterfactual Explanations of a Deep Reinforcement Learning Agent
Chen Z.
Silvestri F.
Tolomei G.
Wang J.
Zhu H.
Ahn H.
IEEE Transactions on Artificial Intelligence, 2024, 5 (04): : 1443 - 1457
[49] Evaluation of the factors explaining the use of agricultural land: A machine learning and model-agnostic approach
Viana, Claudia M.
Santos, Mauricio
Freire, Dulce
Abrantes, Patricia
Rocha, Jorge
ECOLOGICAL INDICATORS, 2021, 131
[50] Learning in a black box
Nax, Heinrich H.
Burton-Chellew, Maxwell N.
West, Stuart A.
Young, H. Peyton
JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2016, 127 : 1 - 15

← 1 2 3 4 5 →