Deep peak-neutral difference feature for facial expression recognition

被引:24
作者
Chen, Jingying [1 ]
Xu, Ruyi [1 ]
Liu, Leyuan [1 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Hubei, Peoples R China
关键词
Facial expression recognition; Facial-expression feature; Deep neutral network; SALIENCY;
D O I
10.1007/s11042-018-5909-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial expression recognition (FER) is important in vision-related applications. Deep neural networks demonstrate impressive performance for face recognition; however, it should be noted that this method relies heavily on a great deal of manually labeled training data, which is not available for facial expressions in real-world applications. Hence, we propose a powerful facial feature called deep peak-neutral difference (DPND) for FER. DPND is defined as the difference between two deep representations of the fully expressive (peak) and neutral facial expression frames. The difference tends to emphasize the facial parts that are changed in the transition from the neutral to the expressive face and to eliminate the face identity information retained in the fine-tuned deep neural network for facial expression, the network has been trained on large-scale face recognition dataset. Furthermore, unsupervised clustering and semi-supervised classification methods are presented to automatically acquire the neutral and peak frames from the expression sequence. The proposed facial expression feature achieved encouraging results on public databases, which suggests that it has strong potential to recognize facial expressions in real-world applications.
引用
收藏
页码:29871 / 29887
页数:17
相关论文
共 47 条
[1]   Survey on RGB, 3D, Thermal, and Multimodal Approaches for Facial Expression Recognition: History, Trends, and Affect-Related Applications [J].
Adrian Corneanu, Ciprian ;
Oliu Simon, Marc ;
Cohn, Jeffrey F. ;
Escalera Guerrero, Sergio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) :1548-1568
[2]   Local Gabor Binary Patterns from Three Orthogonal Planes for Automatic Facial Expression Recognition [J].
Almaev, Timur R. ;
Valstar, Michel F. .
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, :356-361
[3]   Efficient smile detection by Extreme Learning Machine [J].
An, Le ;
Yang, Songfan ;
Bhanu, Bir .
NEUROCOMPUTING, 2015, 149 :354-363
[4]  
[Anonymous], 2013, arXiv
[5]  
[Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
[6]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[7]   A hybrid intelligence-aided approach to affect-sensitive e-learning [J].
Chen, Jingying ;
Luo, Nan ;
Liu, Yuanyuan ;
Liu, Leyuan ;
Zhang, Kun ;
Kolodziej, Joanna .
COMPUTING, 2016, 98 (1-2) :215-233
[8]  
Chen Jingying., 2012, IEEE COMP SOC C COMP, P29, DOI [10.1111/j.1601-183X.2012.00843.x, DOI 10.1109/CVPRW.2012.6238905]
[9]  
Dapogny Arnaud., 2017, IEEE Transactions on Affective Computing, P1, DOI DOI 10.1109/TAFFC.2017
[10]   Large-scale image retrieval with Sparse Embedded Hashing [J].
Ding, Guiguang ;
Zhou, Jile ;
Guo, Yuchen ;
Lin, Zijia ;
Zhao, Sicheng ;
Han, Jungong .
NEUROCOMPUTING, 2017, 257 :24-36