Counterfactual explanations for misclassified images: How human and machine explanations differ

被引:6
作者
Delaney, Eoin [1 ,2 ,3 ]
Pakrashi, Arjun [1 ,3 ]
Greene, Derek [1 ,2 ,3 ]
Keane, Mark T. [1 ,3 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci, Dublin, Ireland
[2] Insight Ctr Data Analyt, Dublin, Ireland
[3] VistaMilk SFI Res Ctr, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
XAI; Counterfactual explanation; User testing; BLACK-BOX;
D O I
10.1016/j.artint.2023.103995
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Counterfactual explanations have emerged as a popular solution for the eXplainable AI (XAI) problem of elucidating the predictions of black-box deep-learning systems because people easily understand them, they apply across different problem domains and seem to be legally compliant. Although over 100 counterfactual methods exist in the XAI literature, each claiming to generate plausible explanations akin to those preferred by people, few of these methods have actually been tested on users (similar to 7%). Even fewer studies adopt a user-centered perspective; for instance, asking people for their counterfactual explanations to determine their perspective on a "good explanation". This gap in the literature is addressed here using a novel methodology that (i) gathers human-generated counterfactual explanations for misclassified images, in two user studies and, then, (ii) compares these human-generated explanations to computationally-generated explanations for the same misclassifications. Results indicate that humans do not "minimally edit" images when generating counterfactual explanations. Instead, they make larger, "meaningful" edits that better approximate prototypes in the counterfactual class. An analysis based on "explanation goals" is proposed to account for this divergence between human and machine explanations. The implications of these proposals for future work are discussed. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons .org /licenses /by /4 .0/).
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Categorical and Continuous Features in Counterfactual Explanations of AI Systems
    Warren, Greta
    Byrne, Ruth m. j.
    Keane, Mark t.
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2024, 14 (04)
  • [32] CIRF: Importance of related features for plausible counterfactual explanations
    Kim, Hee-Dong
    Ju, Yeong-Joon
    Hong, Jung-Ho
    Lee, Seong-Whan
    INFORMATION SCIENCES, 2024, 678
  • [33] Categorical and Continuous Features in Counterfactual Explanations of AI Systems
    Warren, Greta
    Byrne, Ruth M. J.
    Keane, Mark T.
    PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023, 2023, : 171 - 187
  • [34] Explaining recommendation system using counterfactual textual explanations
    Ranjbar, Niloofar
    Momtazi, Saeedeh
    Homayoonpour, MohammadMehdi
    MACHINE LEARNING, 2024, 113 (04) : 1989 - 2012
  • [35] Predicting Stress and Providing Counterfactual Explanations: A Pilot Study on Caregivers
    Shibuya, Kei
    King, Zachary D.
    Khalid, Maryam
    Yu, Han
    Shen, Yufei
    Zanna, Khadija
    Brown, Ryan L.
    Majd, Marzieh
    Fagunders, Christopher P.
    Sano, Akane
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [36] Evolving Counterfactual Explanations with Particle Swarm Optimization and Differential Evolution
    Andersen, Hayden
    Lensen, Andrew
    Browne, Will N.
    Mei, Yi
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [37] CETD: Counterfactual Explanations by Considering Temporal Dependencies in Sequential Recommendation
    He, Ming
    An, Boyang
    Wang, Jiwen
    Wen, Hao
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [38] Generally-Occurring Model Change for Robust Counterfactual Explanations
    Xu, Ao
    Wu, Tieru
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IV, 2024, 15019 : 215 - 229
  • [39] Counterfactual Explanations in Personal Informatics for Personalized Mental Health Management
    Jung, Gyuwon
    Lee, Uichin
    COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 756 - 760
  • [40] On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations
    Albini, Emanuele
    Sharma, Shubham
    Mishra, Saumitra
    Dervovic, Danial
    Magazzeni, Daniele
    PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 411 - 431