A spatio-temporal RBM-based model for facial expression recognition

被引:52
作者
Elaiwat, S. [1 ]
Bennamoun, M. [1 ]
Boussaid, F. [2 ]
机构
[1] Univ Western Australia, Sch Comp Sci & Software Engn, Crawley, WA, Australia
[2] Univ Western Australia, Sch Elect Elect & Comp Engn, Crawley, WA, Australia
基金
澳大利亚研究理事会;
关键词
Face expression recognition; Restricted Boltzmann Machines; Spatio-temporal features; Image transformations;
D O I
10.1016/j.patcog.2015.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to recognize facial expressions will be an important characteristic of next generation human computer interfaces. Towards this goal, we propose a novel REM-based model to learn effectively the relationships (or transformations) between image pairs associated with different facial expressions. The proposed model has the ability to disentangle these transformations (e.g. pose variations and facial expressions) by encoding them into two different hidden sets, namely facial-expression morphlets, and non-facial-expression morphlets. The first hidden set is used to encode facial-expression morphlets through a factored four-way sub-model conditional to label units. The second hidden set is used to encode non-facial-expression morphlets through a factored three-way sub-model. With such a strategy, the proposed model can learn transformations between image pairs while disentangling facial-expression transformations from non-facial-expression transformations. This is achieved using an algorithm, dubbed Quadripartite Contrastive Divergence. Reported experiments demonstrate the superior performance of the proposed model compared to state-of-the-art. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:152 / 161
页数:10
相关论文
共 36 条
[1]  
[Anonymous], 2010, P 3 INT WORKSH EMOTI
[2]  
[Anonymous], 2006, Advances in Neural Information Processing Systems
[3]  
[Anonymous], THESIS U TORONTO
[4]  
Chew SW, 2012, PROC CVPR IEEE, P2554, DOI 10.1109/CVPR.2012.6247973
[5]  
Coates A., 2011, P 14 AISTATS, P215
[6]  
Dhall Abhinav, 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P878, DOI 10.1109/FG.2011.5771366
[7]   Emotion Recognition In The Wild Challenge 2013 [J].
Dhall, Abhinav ;
Goecke, Roland ;
Joshi, Jyoti ;
Wagner, Michael ;
Gedeon, Tom .
ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, :509-515
[8]   Collecting Large, Richly Annotated Facial-Expression Databases from Movies [J].
Dhall, Abhinav ;
Goecke, Roland ;
Lucey, Simon ;
Gedeon, Tom .
IEEE MULTIMEDIA, 2012, 19 (03) :34-41
[9]   Training products of experts by minimizing contrastive divergence [J].
Hinton, GE .
NEURAL COMPUTATION, 2002, 14 (08) :1771-1800
[10]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554