Bimodal Emotion Recognition Based on Convolutional Neural Network

被引:3
作者
Chen, Mengmeng [1 ]
Jiang, Lifen [1 ]
Ma, Chunmei [1 ]
Sun, Huazhi [1 ]
机构
[1] Tianjin Normal Univ, 393,Binshui West Rd, Tianjin, Peoples R China
来源
ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING | 2019年
关键词
Emotion recognition; bimodal; D-CNN; convolutional neural network;
D O I
10.1145/3318299.3318347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computer emotion recognition plays an important role in the field of artificial intelligence and is a key technology to realize human machine interaction. Aiming at a cross-modal fusion problem of two nonlinear features of facial expression image and speech emotion, a bimodal fusion emotion recognition model (D-CNN) based on convolutional neural network is proposed. Firstly, a fine-grained feature extraction method based on convolutional neural network is proposed. Secondly, in order to obtain joint features representation, a feature fusion method based on the fine-grained features of bimodal is proposed. Finally, in order to verify the performance of the D-CNN model, experiments were conducted on the open source dataset eNTERFACE'05. The experimental results show that the multi-modal emotion recognition model D-CNN is more than 10% higher than the single emotion recognition model of speech and facial expression respectively. In addition, compared with the other commonly used bimodal emotion recognition methods(such as universal background model), the recognition rete of D-CNN is increased by 5%.
引用
收藏
页码:178 / 181
页数:4
相关论文
共 17 条
[1]  
Ajay S G, 2017, EXPLORING SIGNIFICAN
[2]   Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution [J].
Barsoum, Emad ;
Zhang, Cha ;
Ferrer, Cristian Canton ;
Zhang, Zhengyou .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :279-283
[3]  
Chaudhari P. A. A., NEED BOOSTED GMM SPE, V3, P1
[4]  
CHEN Peng-zhan, 2017, J E CHINA JIAOTONG U, V34, P100
[5]   Editors' Introduction [J].
Gupta, Pamila ;
Lee, Christopher J. ;
Moorman, Marissa J. ;
Shukla, Sandhya .
RADICAL HISTORY REVIEW, 2018, (131) :1-12
[6]  
Hasani B., FACIAL EXPRESSION RE, P30
[7]   Speech Emotion Recognition Using CNN [J].
Huang, Zhengwei ;
Dong, Ming ;
Mao, Qirong ;
Zhan, Yongzhao .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :801-804
[8]   EEG-Based Emotion Recognition Using Deep Learning Network with Principal Component Based Covariate Shift Adaptation [J].
Jirayucharoensak, Suwicha ;
Pan-Ngum, Setha ;
Israsena, Pasin .
SCIENTIFIC WORLD JOURNAL, 2014,
[9]  
Lim WM, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), P1, DOI 10.1109/IEEM.2016.7797824
[10]   Multimodal information fusion application to human emotion recognition from face and speech [J].
Mansoorizadeh, Muharram ;
Charkari, Nasrollah Moghaddam .
MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 49 (02) :277-297