Recurrent neural network for facial landmark detection

被引:24
作者
Chen, Yu [1 ]
Yang, Jian [1 ]
Qian, Jianjun [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
关键词
Facial landmark; Deep neural network; Recurrent neural network; FACE ALIGNMENT; LOCALIZATION;
D O I
10.1016/j.neucom.2016.09.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial landmark detection is an important issue in many computer vision applications about faces. It is very challenging as human faces in wild conditions often present large variations in shape due to different poses, occlusions or expressions. Deep neural networks have been applied to learn the map from face images to face shapes. To the best of our knowledge, Recurrent Neural Network (RNN) has not been used in this issue yet. In this paper, we propose a method which utilizes RNN and Deep Neural Network (DNN) to learn the face shape. First, we build a global network using Long Short Term Memory (LSTM) architecture of RNN to get the initial landmark estimation of faces. Then, we use feed-forward neural networks for local search where a component-based searching method is explored. By using LSTM-RNN, the initial estimation is more reliable which makes the following component-based search feasible and accurate. Experiments show that the global network using LSTM-RNN gets better results than previous networks in both videos and single image. Our method outperforms the state-of-the-art algorithms especially in terms of fine estimation of landmarks. (C) 2016 Published by Elsevier B.V.
引用
收藏
页码:26 / 38
页数:13
相关论文
共 63 条
[41]  
Leutenegger S, 2011, IEEE I CONF COMP VIS, P2548, DOI 10.1109/ICCV.2011.6126542
[42]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[43]   Active appearance models revisited [J].
Matthews, I ;
Baker, S .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :135-164
[44]   Fast Keypoint Recognition Using Random Ferns [J].
Oezuysal, Mustafa ;
Calonder, Michael ;
Lepetit, Vincent ;
Fua, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (03) :448-461
[45]  
Plahl C, 2013, INT CONF ACOUST SPEE, P6714, DOI 10.1109/ICASSP.2013.6638961
[46]  
Rublee E, 2011, IEEE I CONF COMP VIS, P2564, DOI 10.1109/ICCV.2011.6126544
[47]  
Saragih J, 2007, IEEE I CONF COMP VIS, P2173
[48]   Deformable Model Fitting by Regularized Landmark Mean-Shift [J].
Saragih, Jason M. ;
Lucey, Simon ;
Cohn, Jeffrey F. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 91 (02) :200-215
[49]   Accurate Regression Procedures for Active Appearance Models [J].
Sauer, Patrick ;
Cootes, Tim ;
Taylor, Chris .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[50]   Bidirectional recurrent neural networks [J].
Schuster, M ;
Paliwal, KK .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (11) :2673-2681