Counterfactual explanations for misclassified images: How human and machine explanations differ

被引：6

作者：

Delaney, Eoin ^{[1
,2
,3
]}

Pakrashi, Arjun ^{[1
,3
]}

Greene, Derek ^{[1
,2
,3
]}

Keane, Mark T. ^{[1
,3
]}

机构：

[1] Univ Coll Dublin, Sch Comp Sci, Dublin, Ireland

[2] Insight Ctr Data Analyt, Dublin, Ireland

[3] VistaMilk SFI Res Ctr, Dublin, Ireland

来源：

ARTIFICIAL INTELLIGENCE | 2023年 / 324卷

基金：

爱尔兰科学基金会;

关键词：

XAI; Counterfactual explanation; User testing; BLACK-BOX;

D O I：

10.1016/j.artint.2023.103995

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Counterfactual explanations have emerged as a popular solution for the eXplainable AI (XAI) problem of elucidating the predictions of black-box deep-learning systems because people easily understand them, they apply across different problem domains and seem to be legally compliant. Although over 100 counterfactual methods exist in the XAI literature, each claiming to generate plausible explanations akin to those preferred by people, few of these methods have actually been tested on users (similar to 7%). Even fewer studies adopt a user-centered perspective; for instance, asking people for their counterfactual explanations to determine their perspective on a "good explanation". This gap in the literature is addressed here using a novel methodology that (i) gathers human-generated counterfactual explanations for misclassified images, in two user studies and, then, (ii) compares these human-generated explanations to computationally-generated explanations for the same misclassifications. Results indicate that humans do not "minimally edit" images when generating counterfactual explanations. Instead, they make larger, "meaningful" edits that better approximate prototypes in the counterfactual class. An analysis based on "explanation goals" is proposed to account for this divergence between human and machine explanations. The implications of these proposals for future work are discussed. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons .org /licenses /by /4 .0/).

引用

页数：25

共 50 条

[31] Categorical and Continuous Features in Counterfactual Explanations of AI Systems
Warren, Greta
Byrne, Ruth m. j.
Keane, Mark t.
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2024, 14 (04)
[32] CIRF: Importance of related features for plausible counterfactual explanations
Kim, Hee-Dong
Ju, Yeong-Joon
Hong, Jung-Ho
Lee, Seong-Whan
INFORMATION SCIENCES, 2024, 678
[33] Categorical and Continuous Features in Counterfactual Explanations of AI Systems
Warren, Greta
Byrne, Ruth M. J.
Keane, Mark T.
PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023, 2023, : 171 - 187
[34] Explaining recommendation system using counterfactual textual explanations
Ranjbar, Niloofar
Momtazi, Saeedeh
Homayoonpour, MohammadMehdi
MACHINE LEARNING, 2024, 113 (04) : 1989 - 2012
[35] Predicting Stress and Providing Counterfactual Explanations: A Pilot Study on Caregivers
Shibuya, Kei
King, Zachary D.
Khalid, Maryam
Yu, Han
Shen, Yufei
Zanna, Khadija
Brown, Ryan L.
Majd, Marzieh
Fagunders, Christopher P.
Sano, Akane
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
[36] Evolving Counterfactual Explanations with Particle Swarm Optimization and Differential Evolution
Andersen, Hayden
Lensen, Andrew
Browne, Will N.
Mei, Yi
2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
[37] CETD: Counterfactual Explanations by Considering Temporal Dependencies in Sequential Recommendation
He, Ming
An, Boyang
Wang, Jiwen
Wen, Hao
APPLIED SCIENCES-BASEL, 2023, 13 (20):
[38] Generally-Occurring Model Change for Robust Counterfactual Explanations
Xu, Ao
Wu, Tieru
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 215 - 229
[39] Counterfactual Explanations in Personal Informatics for Personalized Mental Health Management
Jung, Gyuwon
Lee, Uichin
COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 756 - 760
[40] On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations
Albini, Emanuele
Sharma, Shubham
Mishra, Saumitra
Dervovic, Danial
Magazzeni, Daniele
PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 411 - 431

← 1 2 3 4 5 →