Emotion recognition using multi-modal data and machine learning techniques: A tutorial and review

被引:410
作者
Zhang, Jianhua [1 ]
Yin, Zhong [2 ]
Chen, Peng [3 ]
Nichele, Stefano [1 ]
机构
[1] Oslo Metropolitan Univ, Dept Comp Sci, Oslo, Norway
[2] Univ Shanghai Sci & Technol, Dept Control Sci & Engn, Shanghai, Peoples R China
[3] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition; Affective computing; Physiological signals; Feature dimensionality reduction; Data fusion; Machine learning; Deep learning; EEG SIGNAL CLASSIFICATION; NEURAL-NETWORK; FEATURE-EXTRACTION; FEATURE-SELECTION; MENTAL WORKLOAD; APPROXIMATE ENTROPY; FACIAL EXPRESSIONS; AUTOMATIC-ANALYSIS; SENTIMENT ANALYSIS; MIXTURE MODEL;
D O I
10.1016/j.inffus.2020.01.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the rapid advances in machine learning (ML) and information fusion has made it possible to endow machines/computers with the ability of emotion understanding, recognition, and analysis. Emotion recognition has attracted increasingly intense interest from researchers from diverse fields. Human emotions can be recognized from facial expressions, speech, behavior (gesture/posture) or physiological signals. However, the first three methods can be ineffective since humans may involuntarily or deliberately conceal their real emotions (so-called social masking). The use of physiological signals can lead to more objective and reliable emotion recognition. Compared with peripheral neurophysiological signals, electroencephalogram (EEG) signals respond to fluctuations of affective states more sensitively and in real time and thus can provide useful features of emotional states. Therefore, various EEG-based emotion recognition techniques have been developed recently. In this paper, the emotion recognition methods based on multi-channel EEG signals as well as multi-modal physiological signals are reviewed. According to the standard pipeline for emotion recognition, we review different feature extraction (e.g., wavelet transform and nonlinear dynamics), feature reduction, and ML classifier design methods (e.g., k-nearest neighbor (KNN), naive Bayesian (NB), support vector machine (SVM) and random forest (RF)). Furthermore, the EEG rhythms that are highly correlated with emotions are analyzed and the correlation between different brain areas and emotions is discussed. Finally, we compare different ML and deep learning algorithms for emotion recognition and suggest several open problems and future research directions in this exciting and fast-growing area of AI.
引用
收藏
页码:103 / 126
页数:24
相关论文
共 209 条
[1]   Domain Adversarial for Acoustic Emotion Recognition [J].
Abdelwahab, Mohammed ;
Busso, Carlos .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) :2423-2435
[2]  
Alam F, 2014, ADV HIGH ED PROF DEV, P1, DOI 10.4018/978-1-4666-5011-4
[3]   Emotion Recognition in Never-Seen Languages Using a Novel Ensemble Method with Emotion Profiles [J].
Albornoz, E. M. ;
Milone, D. H. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (01) :43-53
[4]  
Alhagry S, 2017, INT J ADV COMPUT SC, V8, P355, DOI 10.14569/IJACSA.2017.081046
[5]   A real-time automated system for the recognition of human facial expressions [J].
Anderson, K ;
McOwan, PW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (01) :96-105
[6]  
[Anonymous], 2007, ROBUST SPEECH RECOGN
[7]  
[Anonymous], 2007, Proc. Adv. Neural Inf. Process. Syst.
[8]  
[Anonymous], P ENT JUL
[9]  
[Anonymous], 2012, C RES PRACTICE INFRO
[10]  
[Anonymous], 1997, NIMH CTR STUDY EMOTI