Explainable Emotion Decoding for Human and Computer Vision

被引:1
作者
Borriero, Alessio [2 ,4 ]
Milazzo, Martina [1 ]
Diano, Matteo [2 ]
Orsenigo, Davide [2 ,3 ]
Villa, Maria Chiara [2 ,4 ]
DiFazio, Chiara [2 ]
Tamietto, Marco [2 ,5 ]
Perotti, Alan [3 ]
机构
[1] Univ Roma La Sapienza, Rome, Italy
[2] Univ Turin, Turin, Italy
[3] Centai Inst, Turin, Italy
[4] Univ Camerino, Camerino, Italy
[5] Tilburg Univ, Tilburg, Netherlands
来源
EXPLAINABLE ARTIFICIAL INTELLIGENCE, PT II, XAI 2024 | 2024年 / 2154卷
关键词
eXplainable Artificial Intelligence; NeuroImaging; Computer Vision; Emotion Recognition; Neuroscience; ANTERIOR CINGULATE; FUNCTIONAL NEUROANATOMY; PATTERN-ANALYSIS; NEURAL-NETWORKS; FMRI; CORTEX; REPRESENTATION; INFORMATION; CORTICES; FACE;
D O I
10.1007/978-3-031-63797-1_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern Machine Learning (ML) has significantly advanced various research fields, but the opaque nature of ML models hinders their adoption in several domains. Explainable AI (XAI) addresses this challenge by providing additional information to help users understand the internal decision-making process of ML models. In the field of neuroscience, enriching a ML model for brain decoding with attribution-based XAI techniques means being able to highlight which brain areas correlate with the task at hand, thus offering valuable insights to domain experts. In this paper, we analyze human and Computer Vision (CV) systems in parallel, training and explaining two ML models based respectively on functional Magnetic Resonance Imaging (fMRI) and movie frames. We do so by leveraging the "StudyForrest" dataset, which includes functional Magnetic Resonance Imaging (fMRI) scans of subjects watching the "Forrest Gump" movie, emotion annotations, and eye-tracking data. For human vision the ML task is to link fMRI data with emotional annotations, and the explanations highlight the brain regions strongly correlated with the label. On the other hand, for computer vision, the input data is movie frames, and the explanations are pixel-level heatmaps. We cross-analyzed our results, linking human attention (obtained through eye-tracking) with XAI saliency on CV models and brain region activations. We show how a parallel analysis of human and computer vision can provide useful information for both the neuroscience community (allocation theory) and the ML community (biological plausibility of convolutional models).
引用
收藏
页码:178 / 201
页数:24
相关论文
共 73 条
[1]   A systematic survey on multimodal emotion recognition using learning algorithms [J].
Ahmed, Naveed ;
Al Aghbari, Zaher ;
Girija, Shini .
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 17
[2]   Perceived Image Decoding From Brain Activity Using Shared Information of Multi-Subject fMRI Data [J].
Akamatsu, Yusuke ;
Harakawa, Ryosuke ;
Ogawa, Takahiro ;
Haseyama, Miki .
IEEE ACCESS, 2021, 9 :26593-26606
[3]   On testing for spatial correspondence between maps of human brain structure and function [J].
Alexander-Bloch, Aaron F. ;
Shou, Haochang ;
Liu, Siyuan ;
Satterthwaite, Theodore D. ;
Glahn, David C. ;
Shinohara, Russell T. ;
Vandekar, Simon N. ;
Raznahan, Armin .
NEUROIMAGE, 2018, 178 :540-551
[4]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[5]   Analyzing biological and artificial neural networks: challenges with opportunities for synergy? [J].
Barrett, David G. T. ;
Morcos, Ari S. ;
Macke, Jakob H. .
CURRENT OPINION IN NEUROBIOLOGY, 2019, 55 :55-64
[6]   AFFECT AS A PSYCHOLOGICAL PRIMITIVE [J].
Barrett, Lisa Feldman ;
Bliss-Moreau, Eliza .
ADVANCES IN EXPERIMENTAL SOCIAL PSYCHOLOGY, VOL 41, 2009, 41 :167-218
[7]   Decoding the neural representation of affective states [J].
Baucom, Laura B. ;
Wedell, Douglas H. ;
Wang, Jing ;
Blitzer, David N. ;
Shinkareva, Svetlana V. .
NEUROIMAGE, 2012, 59 (01) :718-727
[8]  
Bodria F, 2021, Arxiv, DOI [arXiv:2102.13076, 10.48550/arXiv.2102.13076]
[9]   The Representation of Biological Classes in the Human Brain [J].
Connolly, Andrew C. ;
Guntupalli, J. Swaroop ;
Gors, Jason ;
Hanke, Michael ;
Halchenko, Yaroslav O. ;
Wu, Yu-Chien ;
Abdi, Herve ;
Haxby, James V. .
JOURNAL OF NEUROSCIENCE, 2012, 32 (08) :2608-2618
[10]   The Organization and Operation of Inferior Temporal Cortex [J].
Conway, Bevil R. .
ANNUAL REVIEW OF VISION SCIENCE, VOL 4, 2018, 4 :381-402