Explaining Image Classifiers with Visual Debates

被引:0
|
作者
Kori, Avinash [1 ]
Glocker, Ben [1 ]
Toni, Francesca [1 ]
机构
[1] Imperial Coll London, London, England
来源
关键词
Explainability; Debates; Image Classification;
D O I
10.1007/978-3-031-78980-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current deep learning-based models for image classification are effective at making decisions, but their lack of transparency can be a significant concern in high-stakes settings. To address this issue, many state-of-the-art methods define (post-hoc) explanations as visual heatmaps or image segments deemed responsible for the classifiers' outputs. However, the static nature of these explanations often fails to align with human explanatory practices. To obtain human-oriented explanations, we propose an alternative, novel form of dialogue-based interactive explanations for image classifiers: visual debates between two fictional players who interact to argue for and against the classifiers' outputs. Specifically, in our method, the players propose arguments, which are (abstract) features drawn from classifiers' latent representations and these arguments are countered by the opposing player. We present a realization of visual debates based on quantization for extracting arguments, recurrent networks for modelling player behaviour, and network dissection for argument visualization. Experimentally, we show that our visual debates satisfy the desiderata of dialecticity, convergence, and faithfulness.
引用
收藏
页码:200 / 214
页数:15
相关论文
共 50 条
  • [21] Visual Probing: Cognitive Framework for Explaining Self-Supervised Image Representations
    Oleszkiewicz, Witold
    Basaj, Dominika
    Sieradzki, Igor
    Gorszczak, Michal
    Rychalska, Barbara
    Lewandowska, Koryna
    Trzcinski, Tomasz
    Zielinski, Bartosz
    IEEE ACCESS, 2023, 11 : 13028 - 13043
  • [22] Selection of image classifiers
    Giacinto, G
    Roli, F
    Fumera, G
    ELECTRONICS LETTERS, 2000, 36 (05) : 420 - 422
  • [23] Beyond visual features: A weak semantic image representation using exemplar classifiers for classification
    Zhang, Chunjie
    Liu, Jing
    Tian, Qi
    Liang, Chao
    Huang, Qingming
    NEUROCOMPUTING, 2013, 120 : 318 - 324
  • [24] Explaining Graph Classifiers by Unsupervised Node Relevance Attribution
    Fontanesi, Michele
    Micheli, Alessio
    Podda, Marco
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, PT II, XAI 2024, 2024, 2154 : 63 - 74
  • [25] Explaining Classifiers using Adversarial Perturbations on the Perceptual Ball
    Elliott, Andrew
    Law, Stephen
    Russell, Chris
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10688 - 10697
  • [26] Explaining the Success of AdaBoost and Random Forests as Interpolating Classifiers
    Wyner, Abraham J.
    Olson, Matthew
    Bleich, Justin
    Mease, David
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18 : 1 - 33
  • [27] Explaining black-box classifiers: Properties and functions
    Amgoud, Leila
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 155 : 40 - 65
  • [28] Explaining the success of adaboost and random forests as interpolating classifiers
    Wyner, Abraham J.
    Olson, Matthew
    Bleich, Justin
    Mease, David
    Journal of Machine Learning Research, 2017, 18 : 1 - 33
  • [29] A Novel Local Ablation Approach for Explaining Multimodal Classifiers
    Ellis, Charles A.
    Zhang, Rongen
    Calhoun, Vince D.
    Carbajal, Darwin A.
    Sendi, Mohammad S. E.
    Wang, May D.
    Miller, Robyn L.
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (IEEE BIBE 2021), 2021,
  • [30] Visual exploration of an ensemble of classifiers
    Ribeiro, Paula Ceccon
    Schardong, Guilherme G.
    Barbosa, Simone D. J.
    de Souza, Clarisse Sieckenius
    Lopes, Helio
    COMPUTERS & GRAPHICS-UK, 2019, 85 : 23 - 41