Hierarchical fusion of visual and physiological signals for emotion recognition

被引:0
作者
Yuchun Fang
Ruru Rong
Jun Huang
机构
[1] Shanghai University,School of Computer Engineering and Science
[2] Chinese Academy of Science,Shanghai Advanced Research Institute
来源
Multidimensional Systems and Signal Processing | 2021年 / 32卷
关键词
Emotion recognition; Facial expression; Electroencephalogram;
D O I
暂无
中图分类号
学科分类号
摘要
Emotion recognition is an attractive and essential topic in image and signal processing. In this paper, we propose a multi-level fusion method to combine visual information and physiological signals for emotion recognition. For visual information, we propose a serial fusion of two-stage features to enhance the representation of facial expression in a video sequence. We propose to integrate the Neural Aggregation Network with Convolutional Neural Network feature map to reinforce the vital emotional frames. For physiological signals, we propose a parallel fusion scheme to widen the band of the annotation of the electroencephalogram signals. We extract the frequency feature with the Linear-Frequency Cepstral Coefficients and enhance it with the signal complexity denoted by Sample Entropy (SampEn). In the classification stage, we realize both feature level and decision level fusion of both visual and physiological information. Experimental results validate the effectiveness of the proposed multi-level multi-modal feature representation method.
引用
收藏
页码:1103 / 1121
页数:18
相关论文
共 102 条
[1]  
Bailenson JN(2008)Real-time classification of evoked emotions using facial feature tracking and physiological responses International Journal of Human-Computer Studies 66 303-317
[2]  
Pontikakis ED(2005)Multiscale entropy analysis of biological signals Physical Review E 71 021906-63
[3]  
Mauss IB(2013)Challenges in representation learning: A report on three machine learning contests Neural Network 64 59-136
[4]  
Gross JJ(2013)Categorical and dimensional affect analysis in continuous input: Current trends and future directions Image and Vision Computing 31 120-178
[5]  
Jabon ME(2010)Event-related beta oscillations are affected by emotional eliciting stimuli Neuroscience Letters 483 173-78
[6]  
Hutcherson CA(2019)Emotion recognition using deep learning approach from audio-visual emotional big data Information Fusion 49 69-124
[7]  
Costa M(2016)Multi-modal emotion analysis from facial expressions and electroencephalogram Computer Vision and Image Understanding 147 114-31
[8]  
Goldberger AL(2019)Combining facial expressions and electroencephalography to enhance emotion recognition Future Internet 11 105-174
[9]  
Peng CK(2011)Deap: A database for emotion analysis; using physiological signals IEEE transactions on affective computing 3 18-155736
[10]  
Goodfellow IJ(2013)Fusion of facial expressions and eeg for implicit affective tagging Image and Vision Computing 31 164-1806