AU-inspired Deep Networks for Facial Expression Feature Learning

被引:165
作者
Liu, Mengyi [1 ]
Li, Shaoxin [1 ]
Shan, Shiguang [1 ]
Chen, Xilin [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
关键词
Facial expression recognition; AU-inspired Deep Networks (AUDN); Micro-Action-Pattern; Receptive field; Group-wise sub-network learning; RECOGNITION;
D O I
10.1016/j.neucom.2015.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing technologies for facial expression recognition utilize off-the-shelf feature extraction methods for classification. In this paper, aiming at learning better features specific for expression representation, we propose to construct a deep architecture, AU-inspired Deep Networks (AUDN), inspired by the psychological theory that expressions can be decomposed into multiple facial Action Units (AUs). To fully exploit this inspiration but avoid detecting AUs, we propose to automatically learn: (1) informative local appearance variation; (2) optimal way to combining local variation and (3) high level representation for final expression recognition. Accordingly, the proposed AUDN is composed of three sequential modules. Firstly, we build a convolutional layer and a max-pooling layer to learn the Micro-Action-Pattern (MAP) representation, which can explicitly depict local appearance variations caused by facial expressions. Secondly, feature grouping is applied to simulate larger receptive fields by combining correlated MAPs adaptively, aiming to generate more abstract mid-level semantics. Finally, a multi-layer learning process is employed in each receptive field respectively to construct group-wise sub-networks for higher-level representations. Experiments on three expression databases CK+, MMI and SFEW demonstrate that, by simply applying linear classifiers on the learned features, our method can achieve state-of-the-art results on all the databases, which validates the effectiveness of AUDN in both lab-controlled and wild environments. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 35 条
[1]  
[Anonymous], AS C COMP VIS ACCV
[2]  
[Anonymous], 2011, ADV NEURAL INFORM PR
[3]  
Bengio Yoshua, 2006, Advances in Neural Information Processing Systems 19, V19, P153
[4]   Convergence of Ant Colony Optimization on First-Order Deceptive Systems [J].
Chen, Yixin ;
Sun, Haiying .
2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, :158-+
[5]  
Coates A., 2011, International Conference on Machine Learning, V8, P10
[6]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[7]  
Dhall A, 2011, 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS)
[8]  
Ekman P., 1978, Manual for the Facial Action Coding System
[9]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[10]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507