A facial expression recognition system using robust face features from depth videos and deep learning

被引:54
作者
Uddin, Md. Zia [1 ]
Hassan, Mohammed Mehedi [2 ]
Almogren, Ahmad [2 ]
Zuair, Mansour [2 ]
Fortino, Giancarlo [3 ]
Torresen, Jim [1 ]
机构
[1] Univ Oslo, Dept Informat, Oslo, Norway
[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[3] Univ Calabria, Dept Informat Modeling Elect & Syst, Commenda Di Rende, Italy
关键词
Depth image; Facial expression recognition; Modified local directional patterns; Generalized discriminant analysis; Deep belief network; INDEPENDENT COMPONENT ANALYSIS; ALGORITHM;
D O I
10.1016/j.compeleceng.2017.04.019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes a depth camera-based robust facial expression recognition (FER) system that can be adopted for better human machine interaction. Although video-based facial expression analysis has been focused on by many researchers, there are still various problems to be solved in this regard such as noise due to illumination variations over time. Depth video data in the helps to make an FER system person-independent as pixel values in depth images are distributed based on distances from a depth camera. Besides, depth images should resolve some privacy issues as real identity of a user can be hidden. The accuracy of an FER system is much dependent on the extraction of robust features. Here, we propose a novel method to extract salient features from depth faces that are further combined with deep learning for efficient training and recognition. Eight directional strengths are obtained for each pixel in a depth image where signs of some top strengths are arranged to represent unique as well as robust face features, which can be denoted as Modified Local Directional Patterns (MLDP). The MLDP features are further processed by Generalized Discriminant Analysis (GDA) for better face feature extraction. GDA is an efficient tool that helps distinguishing MLDP features of different facial expressions by clustering the features from the same expression as close as possible and separating the features from different expressions as much as possible in a non-linear space. Then, MLDP-GDA features are applied with Deep Belief Network (DBN) for training different facial expressions. Finally, the trained DBN is used to recognize facial expressions in a depth video for testing. The proposed approach was compared with other traditional approaches in a standalone system where the proposed one showed its superiority by achieving mean recognition rate of 96.25% where the other approaches could make 91.67% at the best. The deep learning-based training and recognition of the facial expression features can also be undertaken with cloud computing to support many users and make the system faster than a standalone system. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:114 / 125
页数:12
相关论文
共 28 条
[1]   Automatic facial expression recognition using facial animation parameters and multistream HMMs [J].
Aleksic, Petar S. ;
Katsaggelos, Aggelos K. .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2006, 1 (01) :3-11
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]  
[Anonymous], 2008, 2008 IEEE C COMP VIS, DOI DOI 10.1109/CVPR.2008.458
[4]  
Bartlett Marian Stewart, 1999, P 6 JOINT S NEUR COM, P8
[5]   Cognitive Architectures for Human-Robot Interaction [J].
Baxter, Paul ;
Trafton, Greg .
HRI'14: PROCEEDINGS OF THE 2014 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2014, :504-505
[6]  
Buciu I, 2003, 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, P855
[7]   Configural information in facial expression perception [J].
Calder, AJ ;
Young, AW ;
Keane, J ;
Dean, M .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2000, 26 (02) :527-551
[8]   Facial expression recognition by supervised independent component analysis using MAP estimation [J].
Chen, Fan ;
Kotani, Kazunori .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (02) :341-350
[9]   Recognizing facial action units using independent component analysis and support vector machine [J].
Chuang, Chao-Fa ;
Shih, Frank Y. .
PATTERN RECOGNITION, 2006, 39 (09) :1795-1798
[10]   Classifying facial actions [J].
Donato, G ;
Bartlett, MS ;
Hager, JC ;
Ekman, P ;
Sejnowski, TJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (10) :974-989