Emotion Recognition from Face Images in an Unconstrained Environment for usage on Social Robots

被引:14
作者
Webb, Nicola [1 ]
Ruiz-Garcia, Ariel [2 ]
Elshaw, Mark [2 ]
Palade, Vasile [3 ]
机构
[1] Univ West England, Bristol Robot Lab, Coventry, W Midlands, England
[2] Coventry Univ, Comp Elect & Math, Coventry, W Midlands, England
[3] Coventry Univ, Ctr Data Sci, Coventry, W Midlands, England
来源
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2020年
关键词
Stacked Convolutional Autoencoders; Greedy Layer-Wise Training; Deep Learning; Emotion Recognition; Social Robotics;
D O I
10.1109/ijcnn48605.2020.9207494
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have proven to be efficient systems for learning complex data representations. However, one of their main constraints is their inability to deal with changes in the data distribution. For instance, in real-time facial expression recognition, the data used to evaluate a model commonly differs in quality compared to that used to train the model, leading to poor generalization performance. In this work we propose a novel Deep Convolutional Neural Network (CNN) architecture pre-trained as a Stacked Convolutional Autoencoder (SCAE) to address emotion recognition in unconstrained environments. The SCAE is trained in a greedy layer-wise unsupervised fashion, and combines convolutional and fully connected layers and learns to encode facial expression images as an illumination and facial pose invariant feature vector. The CNN offers state-of-the-art classification rate of 99.52% on a combined corpus of gamma corrected version of the CK+, JAFFE, FEEDTUM and KDEF datasets. When evaluated on unseen data obtained in unconstrained environments, our approach achieves 79.75%, an increase of over 28% compared to a CNN without our pretraining approach, supporting the methodology proposed in this work.
引用
收藏
页数:8
相关论文
共 30 条
[1]  
Ahn B., 2018, DEV EVALUATION HUMAN
[2]  
[Anonymous], 2013, COMBINING MODALITY S
[3]  
[Anonymous], 2008, Introduction: Caribbean Migrations to Western Europe and the United States: Essays on Incorporation, Identity, and Citizenship, DOI DOI 10.1109/CVPR.2008.4587369
[4]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[5]  
Burkert P., 2015, DEXPRESSION DEEP CON
[6]  
Ca P.V., 2010, J MACH LEARN RES, V11, P3371
[7]   Pose-and-illumination-invariant face representation via a triplet-loss trained deep reconstruction model [J].
Chen, Xingyu ;
Lan, Xuguang ;
Liang, Guoqiang ;
Liu, Jianyi ;
Zheng, Nanning .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (21) :22043-22058
[8]   Face recognition using Histograms of Oriented Gradients [J].
Deniz, O. ;
Bueno, G. ;
Salido, J. ;
De la Torre, F. .
PATTERN RECOGNITION LETTERS, 2011, 32 (12) :1598-1603
[9]  
Elissa K., 2009, SOCIALLY ASSISTIVE R
[10]   Blind inverse gamma correction [J].
Farid, H .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (10) :1428-1433