A novel signal channel attention network for multi-modal emotion recognition

被引：1

作者：

Du, Ziang ^{[1
]}

Ye, Xia ^{[1
]}

Zhao, Pujie ^{[1
]}

机构：

[1] Xian Res Inst High Tech, Xian, Shaanxi, Peoples R China

来源：

FRONTIERS IN NEUROROBOTICS | 2024年 / 18卷

关键词：

hypercomplex neural networks; physiological signals; attention fusion module; multi-modal fusion; emotion recognition;

D O I：

10.3389/fnbot.2024.1442080

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Physiological signal recognition is crucial in emotion recognition, and recent advancements in multi-modal fusion have enabled the integration of various physiological signals for improved recognition tasks. However, current models for emotion recognition with hyper complex multi-modal signals face limitations due to fusion methods and insufficient attention mechanisms, preventing further enhancement in classification performance. To address these challenges, we propose a new model framework named Signal Channel Attention Network (SCA-Net), which comprises three main components: an encoder, an attention fusion module, and a decoder. In the attention fusion module, we developed five types of attention mechanisms inspired by existing research and performed comparative experiments using the public dataset MAHNOB-HCI. All of these experiments demonstrate the effectiveness of the attention module we addressed for our baseline model in improving both accuracy and F1 score metrics. We also conducted ablation experiments within the most effective attention fusion module to verify the benefits of multi-modal fusion. Additionally, we adjusted the training process for different attention fusion modules by employing varying early stopping parameters to prevent model overfitting.

引用

页数：11

共 50 条

[1] ATTENTION DRIVEN FUSION FOR MULTI-MODAL EMOTION RECOGNITION
Priyasad, Darshana
Fernando, Tharindu
Denman, Simon
Sridharan, Sridha
Fookes, Clinton
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3227 - 3231
[2] Semantic Alignment Network for Multi-Modal Emotion Recognition
Hou, Mixiao
Zhang, Zheng
Liu, Chang
Lu, Guangming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5318 - 5329
[3] Multi-modal Correlated Network for emotion recognition in speech
Ren, Minjie
Nie, Weizhi
Liu, Anan
Su, Yuting
VISUAL INFORMATICS, 2019, 3 (03) : 150 - 155
[4] Structure Aware Multi-Graph Network for Multi-Modal Emotion Recognition in Conversations
Zhang, Duzhen
Chen, Feilong
Chang, Jianlong
Chen, Xiuyi
Tian, Qi
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3987 - 3997
[5] Multi-modal fusion network with complementarity and importance for emotion recognition
Liu, Shuai
Gao, Peng
Li, Yating
Fu, Weina
Ding, Weiping
INFORMATION SCIENCES, 2023, 619 : 679 - 694
[6] DeepVANet: A Deep End-to-End Network for Multi-modal Emotion Recognition
Zhang, Yuhao
Hossain, Md Zakir
Rahman, Shafin
HUMAN-COMPUTER INTERACTION, INTERACT 2021, PT III, 2021, 12934 : 227 - 237
[7] Multi-Modal Emotion Recognition Combining Face Image and EEG Signal
Hu, Ying
Wang, Feng
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (07)
[8] IS CROSS-ATTENTION PREFERABLE TO SELF-ATTENTION FOR MULTI-MODAL EMOTION RECOGNITION?
Rajan, Vandana
Brutti, Alessio
Cavallaro, Andrea
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4693 - 4697
[9] Multi-Modal Emotion Recognition Fusing Video and Audio
Xu, Chao
Du, Pufeng
Feng, Zhiyong
Meng, Zhaopeng
Cao, Tianyi
Dong, Caichao
APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 455 - 462
[10] Emotion recognition with multi-modal peripheral physiological signals
Gohumpu, Jennifer
Xue, Mengru
Bao, Yanchi
FRONTIERS IN COMPUTER SCIENCE, 2023, 5

← 1 2 3 4 5 →