A systematic review of trimodal affective computing approaches: Text, audio, and visual integration in emotion recognition and sentiment analysis

被引:12
作者
Al-Saadawi, Hussein Farooq Tayeb [1 ]
Das, Bihter [1 ]
Das, Resul [1 ]
机构
[1] Firat Univ, Fac Technol, Dept Software Engn, TR-23119 Elazig, Turkiye
关键词
Multi-modal emotion recognition; Trimodal affective analysis; Multi-modal sentiment analysis; Multi-modal fusion; OF-THE-ART; INFORMATION FUSION; EXTRACTION;
D O I
10.1016/j.eswa.2024.124852
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At the heart of affective computing lies the crucial task of decoding human emotions, a field that expertly intertwines emotion identification with the nuances of sentiment analysis. This dynamic discipline harnesses an array of data sources, from the intricacies of textual information to the subtleties of auditory signals and the dynamic realm of visual cues. One of its primary challenges is discerning emotions from physical cues like facial expressions and vocal tones, especially when these emotions are subtly concealed. The precise information yielded by physiological signals is invaluable, yet the complexity of their acquisition in real-world settings remains a formidable challenge. Our comprehensive systematic review marks a significant foray into trimodal affective computing, integrating text, audio, and visual data to provide a holistic understanding. We analyzed over 410 research articles from prominent conferences and journals spanning the last two decades. This extensive study categorizes and critically evaluates a spectrum of affect recognition methods, from unimodal to multimodal approaches, including bimodal and trimodal, offering profound insights into their structural composition and practical effectiveness. In concluding our exploration, we highlight the pivotal aspects of affective computing and chart a course for future groundbreaking research. This includes refining data integration techniques, overcoming challenges in emotion recognition, and addressing the critical ethical dimensions inherent in this field.
引用
收藏
页数:23
相关论文
共 139 条
[1]   Lightweight Micro-Expression Recognition on Composite Database [J].
Ab Razak, Nur Aishah ;
Sahran, Shahnorbanun .
APPLIED SCIENCES-BASEL, 2023, 13 (03)
[2]  
Adesola F., 2023, 2023 INT C SCI ENG B, P1, DOI [10.1109/SEB-SDG57117.2023.10124472, DOI 10.1109/SEB-SDG57117.2023.10124472]
[3]   An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition [J].
Ahmed, Md. Rayhan ;
Islam, Salekul ;
Islam, A. K. M. Muzahidul ;
Shatabda, Swakkhar .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 218
[4]   A systematic survey on multimodal emotion recognition using learning algorithms [J].
Ahmed, Naveed ;
Al Aghbari, Zaher ;
Girija, Shini .
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 17
[5]   Systematic review of 3D facial expression recognition methods [J].
Alexandre, Gilderlane Ribeiro ;
Soares, Jose Marques ;
Pereira The, George Andre .
PATTERN RECOGNITION, 2020, 100
[6]  
Alsaadawi H., 2023, Balkan Journal of Electrical and Computer Engineering (BAJECE), V11
[7]   Human-Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention [J].
Alsabhan, Waleed .
SENSORS, 2023, 23 (03)
[8]   Multi-label emotion classification in texts using transfer learning [J].
Ameer, Iqra ;
Bolucu, Necva ;
Siddiqui, Muhammad Hammad Fahim ;
Can, Burcu ;
Sidorov, Grigori ;
Gelbukh, Alexander .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
[9]  
Amiriparian Shahin, 2024, The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition, DOI [10.13140/RG.2.2.31167.32166, DOI 10.13140/RG.2.2.31167.32166]
[10]   Recognizing Semi-Natural and Spontaneous Speech Emotions Using Deep Neural Networks [J].
Amjad, Ammar ;
Khan, Lal ;
Ashraf, Noman ;
Mahmood, Muhammad Bilal ;
Chang, Hsien-Tsung .
IEEE ACCESS, 2022, 10 :37149-37163