A Review of Key Technologies for Emotion Analysis Using Multimodal Information

被引:16
作者
Zhu, Xianxun [1 ]
Guo, Chaopeng [1 ]
Feng, Heyang [1 ]
Huang, Yao [1 ]
Feng, Yichen [1 ]
Wang, Xiangyang [1 ]
Wang, Rui [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, 99 Shangda Rd, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal information; Emotional analysis; Multimodal fusion; ACUTE CORONARY SYNDROMES; SENTIMENT; FUSION; RECOGNITION; EXPRESSIONS; ATTENTION; STRENGTH; TRIGGERS; DATABASE; MODEL;
D O I
10.1007/s12559-024-10287-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion analysis, an integral aspect of human-machine interactions, has witnessed significant advancements in recent years. With the rise of multimodal data sources such as speech, text, and images, there is a profound need for a comprehensive review of pivotal elements within this domain. Our paper delves deep into the realm of emotion analysis, examining multimodal data sources encompassing speech, text, images, and physiological signals. We provide a curated overview of relevant literature, academic forums, and competitions. Emphasis is laid on dissecting unimodal processing methods, including preprocessing, feature extraction, and tools across speech, text, images, and physiological signals. We further discuss the nuances of multimodal data fusion techniques, spotlighting early, late, model, and hybrid fusion strategies. Key findings indicate the essentiality of analyzing emotions across multiple modalities. Detailed discussions on emotion elicitation, expression, and representation models are presented. Moreover, we uncover challenges such as dataset creation, modality synchronization, model efficiency, limited data scenarios, cross-domain applicability, and the handling of missing modalities. Practical solutions and suggestions are provided to address these challenges. The realm of multimodal emotion analysis is vast, with numerous applications ranging from driver sentiment detection to medical evaluations. Our comprehensive review serves as a valuable resource for both scholars and industry professionals. It not only sheds light on the current state of research but also highlights potential directions for future innovations. The insights garnered from this paper are expected to pave the way for subsequent advancements in deep multimodal emotion analysis tailored for real-world deployments.
引用
收藏
页码:1504 / 1530
页数:27
相关论文
共 50 条
[31]   A Review of Emotion Recognition Using Physiological Signals [J].
Shu, Lin ;
Xie, Jinyan ;
Yang, Mingyue ;
Li, Ziyi ;
Li, Zhenqi ;
Liao, Dan ;
Xu, Xiangmin ;
Yang, Xinyi .
SENSORS, 2018, 18 (07)
[32]   A review on EEG-based multimodal learning for emotion recognition [J].
Pillalamarri, Rajasekhar ;
Shanmugam, Udhayakumar .
ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (05)
[33]   The first look: a biometric analysis of emotion recognition using key facial features [J].
Gonzalez-Acosta, Ana M. S. ;
Vargas-Trevino, Marciano ;
Batres-Mendoza, Patricia ;
Guerra-Hernandez, Erick I. ;
Gutierrez-Gutierrez, Jaime ;
Cano-Perez, Jose L. ;
Solis-Arrazola, Manuel A. ;
Rostro-Gonzalez, Horacio .
FRONTIERS IN COMPUTER SCIENCE, 2025, 7
[34]   Multimodal Emotion Analysis of Robotic Assistance in Elderly Care [J].
Demaeght, Annebeth ;
Miclau, Christina ;
Hartmann, Julia ;
Markwardt, Janina ;
Korn, Oliver .
PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2022, 2022, :230-236
[35]   An algorithm for image recognition and analysis using neural information technologies [J].
Gafurov, O.M. ;
Syryamkin, V.I. ;
Gafurov, A.O. ;
Stolyarova, S.S. .
Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika), 2012, 71 (17) :1565-1574
[36]   Information Retrieval and Recommendation using Emotion from Speech Signals [J].
Iliev, Alexander ;
Stanchev, Peter L. .
IEEE 1ST CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2018), 2018, :222-225
[37]   A Hybrid Multimodal Emotion Recognition Framework for UX Evaluation Using Generalized Mixture Functions [J].
Razzaq, Muhammad Asif ;
Hussain, Jamil ;
Bang, Jaehun ;
Hua, Cam-Hao ;
Satti, Fahad Ahmed ;
Rehman, Ubaid Ur ;
Bilal, Hafiz Syed Muhammad ;
Kim, Seong Tae ;
Lee, Sungyoung .
SENSORS, 2023, 23 (09)
[38]   Expression-EEG Based Collaborative Multimodal Emotion Recognition Using Deep AutoEncoder [J].
Zhang, Hongli .
IEEE ACCESS, 2020, 8 :164130-164143
[39]   Attention-based multimodal sentiment analysis and emotion recognition using deep neural networks [J].
Aslam, Ajwa ;
Sargano, Allah Bux ;
Habib, Zulfiqar .
APPLIED SOFT COMPUTING, 2023, 144
[40]   Multimodal Information-Based Broad and Deep Learning Model for Emotion Understanding [J].
Li, Min ;
Chen, Luefeng ;
Wu, Min ;
Pedrycz, Witold ;
Hirota, Kaoru .
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, :7410-7414