Facial Expression Recognition Using a Temporal Ensemble of Multi-Level Convolutional Neural Networks

被引:44
作者
Hai-Duong Nguyen [1 ]
Kim, Sun-Hee [2 ]
Lee, Guee-Sang [1 ]
Yang, Hyung-Jeong [1 ]
Na, In-Seop [3 ]
Kim, Soo-Hyung [1 ]
机构
[1] Chonnam Natl Univ, Sch Elect & Comp Engn, 77 Yongbong Ro, Gwangju 500757, South Korea
[2] Korea Univ, Dept Brain & Cognit Engn, Seoul 02841, South Korea
[3] Chosun Univ, Software Convergence Educ Inst, 309 Pilmun Daero, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
EmotiW challenge; ensemble model; facial expression recognition in the wild; FER2013; hierarchical features; multi-level convolutional neural networks;
D O I
10.1109/TAFFC.2019.2946540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition is indispensable in human-machine interaction systems. It comprises locating facial regions of interest in images and classifying them into one of seven classes: angry, disgust, fear, happy, neutral, sad, and surprise. Despite several breakthroughs in image classification, particularly in facial expression recognition, this research area is still challenging, as sampling in the wild is a demanding task. In this study, a two-stage method is proposed for recognizing facial expressions given a sequence of images. At the first stage, all face regions are extracted in each frame, and essential information that would be helpful and related to human emotion is obtained. Then, the extracted features from the previous step are considered temporal data and are assigned to one of the seven basic emotions. In addition, a study of multi-level features is conducted in a convolutional neural network for facial expression recognition. Moreover, various network connections are introduced to improve the classification task. By combining the proposed network connections, superior results are obtained compared to state-of-the-art methods on the FER2013 dataset. Furthermore, the performance of our temporal model is better than that of the single architecture of the 2017 EmotiW challenge winner on the AFEW 7.0 dataset.
引用
收藏
页码:226 / 237
页数:12
相关论文
共 38 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
[2]  
[Anonymous], 2013, KAGGLE COMPETITION
[3]  
[Anonymous], 1977, FACS FACIAL ACTION C
[4]  
[Anonymous], 2013, ARXIV13060239
[5]  
[Anonymous], 2017, KERAS VIS
[6]  
Breeze A, 2016, ALASTAIR BREEZE COMP
[7]  
Chollet F., 2015, Keras
[8]  
Connie Tee, 2017, Multi-disciplinary Trends in Artificial Intelligence. 11th International Workshop, MIWAI 2017. Proceedings: LNAI 10607, P139, DOI 10.1007/978-3-319-69456-6_12
[9]  
Dhall A, 2017, PROCEEDINGS OF THE 19TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2017, P524, DOI 10.1145/3136755.3143004
[10]   DISTRIBUTION OF THE ESTIMATORS FOR AUTOREGRESSIVE TIME-SERIES WITH A UNIT ROOT [J].
DICKEY, DA ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) :427-431