Robust dynamic facial expressions recognition using Lbp-Top descriptors and Bag-of-Words classification model

被引:3
作者
Spizhevoy A.S. [1 ]
机构
[1] Itseez Incorporated, Nizhny Novgorod
关键词
bag-of-words; dynamic facial expressions; local binary patterns;
D O I
10.1134/S1054661816010247
中图分类号
学科分类号
摘要
In this work we investigate the problem of robust dynamic facial expression recognition. We develop a complete pipeline that relies on the LBP-TOP descriptors and the Bag-of-Words (BoW) model for basic expressions classification. Experiments performed on the standard dataset such as the Extended Cohn-Kanade (CK+) database show that the developed approach achieves the average recognition rate of 97.7%, thus outperforming state-of-the-art methods in terms of accuracy. The proposed method is quite robust as it uses only relevant parts of video frames such as areas around mouth, noise, eyes, etc. Ability to work with arbitrary length sequence is also a plus for practical applications, since it means there is no need for complex temporal normalization methods. © 2016, Pleiades Publishing, Ltd.
引用
收藏
页码:216 / 220
页数:4
相关论文
共 16 条
[1]  
Cootes T.F., Taylor C.J., Cooper D.H., Graham J., Active shape models-their training and application, Comput. Vision Image Understand., 61, 1, pp. 38-59, (1995)
[2]  
Cootes T.F., Edwards G.J., Taylor C.J., Active appearance models, IEEE Trans. Pattern Anal. Mach. Intellig., 23, 6, pp. 681-685, (2001)
[3]  
Dollar P., Rabaud V., Cottrell G., Belongie S., Behavior recognition via sparse spatio-temporal features, Proc. 2nd Joint IEEE Int. Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 65-72, (2005)
[4]  
Mattivi R., Shao L., Human action recognition using LBP-TOP as sparse spatio-temporal feature descriptor, Computer Analysis of Images and Patterns, pp. 740-747, (2009)
[5]  
Sanin A., Sanderson C., Harandi M.T., Lovell B.C., Spatio-temporal covariance descriptors for action and gesture recognition, Proc. IEEE Workshop on Applications of Computer Vision, pp. 103-110, (2013)
[6]  
Wang Z., Wang S., Ji Q., Capturing complex spatio-temporal relations among facial muscles for facial expression recognition, Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3422-3429, (2013)
[7]  
Lucey P., Cohn J.F., Kanade T., Saragih J., Ambadar Z., Matthews I., The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotionspecified expression, Proc. IEEE Computer Society Conf. on Computer Vision and Pattern Recognition Workshops, pp. 94-101, (2010)
[8]  
Jain S., Hu C., Aggarwal J.K., Facial expression recognition with temporal modeling of shapes, Proc. IEEE Int. Computer Vision Workshops, pp. 1642-1649, (2011)
[9]  
Huang X., Zhao G., Pietikainen M., Zheng W., Dynamic facial expression recognition using boosted component-based spatiotemporal features and multiclassifier fusion, Advanced Concepts for Intelligent Vision Systems, pp. 312-322, (2010)
[10]  
Lorincz A., Jeni L., Szabo Z., Cohn J., Kanade T., Emotional expression classification using time-series kernels, Proc. IEEE Computer Vision and Pattern Recognition Workshops, pp. 889-895, (2013)