A Review of Key Technologies for Emotion Analysis Using Multimodal Information

被引:16
作者
Zhu, Xianxun [1 ]
Guo, Chaopeng [1 ]
Feng, Heyang [1 ]
Huang, Yao [1 ]
Feng, Yichen [1 ]
Wang, Xiangyang [1 ]
Wang, Rui [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, 99 Shangda Rd, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal information; Emotional analysis; Multimodal fusion; ACUTE CORONARY SYNDROMES; SENTIMENT; FUSION; RECOGNITION; EXPRESSIONS; ATTENTION; STRENGTH; TRIGGERS; DATABASE; MODEL;
D O I
10.1007/s12559-024-10287-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion analysis, an integral aspect of human-machine interactions, has witnessed significant advancements in recent years. With the rise of multimodal data sources such as speech, text, and images, there is a profound need for a comprehensive review of pivotal elements within this domain. Our paper delves deep into the realm of emotion analysis, examining multimodal data sources encompassing speech, text, images, and physiological signals. We provide a curated overview of relevant literature, academic forums, and competitions. Emphasis is laid on dissecting unimodal processing methods, including preprocessing, feature extraction, and tools across speech, text, images, and physiological signals. We further discuss the nuances of multimodal data fusion techniques, spotlighting early, late, model, and hybrid fusion strategies. Key findings indicate the essentiality of analyzing emotions across multiple modalities. Detailed discussions on emotion elicitation, expression, and representation models are presented. Moreover, we uncover challenges such as dataset creation, modality synchronization, model efficiency, limited data scenarios, cross-domain applicability, and the handling of missing modalities. Practical solutions and suggestions are provided to address these challenges. The realm of multimodal emotion analysis is vast, with numerous applications ranging from driver sentiment detection to medical evaluations. Our comprehensive review serves as a valuable resource for both scholars and industry professionals. It not only sheds light on the current state of research but also highlights potential directions for future innovations. The insights garnered from this paper are expected to pave the way for subsequent advancements in deep multimodal emotion analysis tailored for real-world deployments.
引用
收藏
页码:1504 / 1530
页数:27
相关论文
共 50 条
[21]   Sentiment and emotion analysis from textual information: A systematic literature review [J].
Bermudez-Sosa, Herbert Jair ;
Olarte-Henao, Jonathan ;
Rojas-Berrio, Sandra .
JOURNAL OF INFORMATION SCIENCE, 2025,
[22]   Deep learning-based late fusion of multimodal information for emotion classification of music video [J].
Pandeya, Yagya Raj ;
Lee, Joonwhoan .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) :2887-2905
[23]   A systematic review of emotion recognition using cardio-based signals [J].
Ismail, Sharifah Noor Masidayu Sayed ;
Aziz, Nor Azlina Ab. ;
Ibrahim, Siti Zainab ;
Mohamad, Mohd Saberi .
ICT EXPRESS, 2024, 10 (01) :156-183
[24]   Multimodal sentiment analysis: A systematic review of history, datasets, multimodal fusion methods, applications, challenges and future directions [J].
Gandhi, Ankita ;
Adhvaryu, Kinjal ;
Poria, Soujanya ;
Cambria, Erik ;
Hussain, Amir .
INFORMATION FUSION, 2023, 91 :424-444
[25]   A Systematic Review on Multimodal Emotion Recognition: Building Blocks, Current State, Applications, and Challenges [J].
Kalateh, Sepideh ;
Estrada-Jimenez, Luis A. ;
Nikghadam-Hojjati, Sanaz ;
Barata, Jose .
IEEE ACCESS, 2024, 12 :103976-104019
[26]   Joint multimodal sentiment analysis based on information relevance [J].
Chen, Danlei ;
Su, Wang ;
Wu, Peng ;
Hua, Bolin .
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
[27]   A two-stage multimodal emotion analysis using body actions and facial features [J].
Tseng, Hsiao-Ting ;
Hsieh, Chen-Chiung ;
Xu, Cheng-Hong .
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (04)
[28]   Joint Multi-Scale Multimodal Transformer for Emotion Using Consumer Devices [J].
Khan, Mustaqeem ;
Ahmad, Jamil ;
Gueaieb, Wail ;
De Masi, Giulia ;
Karray, Fakhri ;
El Saddik, Abdulmotaleb .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2025, 71 (01) :1092-1101
[29]   Enhancing emotion recognition using multimodal fusion of physiological, environmental, personal data [J].
Kim, Hakpyeong ;
Hong, Taehoon .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[30]   MEAS: Multimodal Emotion Analysis System for Short Videos on Social Media Platforms [J].
Wei, Qinglan ;
Zhou, Yaqi ;
Xiang, Shenlian ;
Xiao, Longhui ;
Zhang, Yuan .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,