Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks

被引:163
作者
Hasani, Behzad [1 ]
Mahoor, Mohammad H. [1 ]
机构
[1] Univ Denver, Dept Elect & Comp Engn, Denver, CO 80208 USA
来源
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2017年
基金
美国国家科学基金会;
关键词
MODELS; FACE;
D O I
10.1109/CVPRW.2017.282
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) have shown to outperform traditional methods in various visual recognition tasks including Facial Expression Recognition (FER). In spite of efforts made to improve the accuracy of FER systems using DNN, existing methods still are not generalizable enough in practical applications. This paper proposes a 3D Convolutional Neural Network method for FER in videos. This new network architecture consists of 3D Inception-ResNet layers followed by an LSTM unit that together extracts the spatial relations within facial images as well as the temporal relations between different frames in the video. Facial landmark points are also used as inputs to our network which emphasize on the importance of facial components rather than the facial regions that may not contribute significantly to generating facial expressions. Our proposed method is evaluated using four publicly available databases in subject-independent and cross-database tasks and outperforms state-of-the-art methods.
引用
收藏
页码:2278 / 2288
页数:11
相关论文
共 63 条
[1]  
[Anonymous], FACE ALIGNMENT IN 30
[2]  
[Anonymous], 1983, EMFACS EMOTIONAL FAC
[3]  
[Anonymous], ARXIV170306995
[4]  
[Anonymous], 1997, Neural Computation
[5]  
[Anonymous], 2006, Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Volume 2, Washington, DC, USA
[6]  
[Anonymous], 2016, IEEE C COMPUTER VISI
[7]  
[Anonymous], IEEE C COMP VIS PATT
[8]  
[Anonymous], 2016, APPL COMP VIS WACV 2
[9]  
[Anonymous], IEEE C COMP VIS PATT
[10]  
[Anonymous], 2016, ARXIV160800911