Recurrent Shape Regression

被引:4
作者
Cui, Zhen [1 ]
Xiao, Shengtao [2 ]
Niu, Zhiheng [2 ]
Yan, Shuicheng [2 ,3 ]
Zheng, Wenming [4 ]
机构
[1] Nanjing Univ Sci & Technol, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119077, Singapore
[3] 360 AI Inst, Beijing 100016, Peoples R China
[4] Southeast Univ, Key Lab Child Dev & Learning Sci, Sch Biol Sci & Med Engn, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Shape regression; cascaded regression; recurrent neural network; shape detection; face alignment; FACE ALIGNMENT;
D O I
10.1109/TPAMI.2018.2828424
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An end-to-end network architecture, the Recurrent Shape Regression (RSR). is presented to deal with the task of facial shape detection, a crucial step in many computer vision problems. The RSR generalizes the conventional cascaded regression into a recurrent dynamic network through abstracting common latent models with stage-to-stage operations. Instead of invariant regression transformation, we construct shape-dependent dynamic regressors to attain the recurrence of regression action itself. The regressors can be stacked into a high-order regression network to represent more complex shape regression. By further integrating feature learning as well as global shape constraint, the RSR becomes more controllable in entire optimization of shape regression, where the gradient computation can be efficiently back-propagated through time. To handle the possible partial occlusions of shapes. we propose a mimic virtual occlusion strategy by randomly disturbing certain point cliques without the requirement of any annotations of occlusion information or even occluded training data. Extensive experiments on five face datasets demonstrate that the proposed RSR outperforms the recent state-of-the-art cascaded approaches.
引用
收藏
页码:1271 / 1278
页数:8
相关论文
共 46 条
[1]  
[Anonymous], 2016, Deep learning
[2]   Robust Discriminative Response Map Fitting with Constrained Local Models [J].
Asthana, Akshay ;
Zafeiriou, Stefanos ;
Cheng, Shiyang ;
Pantic, Maja .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3444-3451
[3]   Localizing Parts of Faces Using a Consensus of Exemplars [J].
Belhumeur, Peter N. ;
Jacobs, David W. ;
Kriegman, David J. ;
Kumar, Neeraj .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2930-2940
[4]   Robust face landmark estimation under occlusion [J].
Burgos-Artizzu, Xavier P. ;
Perona, Pietro ;
Dollar, Piotr .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1513-1520
[5]   Face Alignment by Explicit Shape Regression [J].
Cao, Xudong ;
Wei, Yichen ;
Wen, Fang ;
Sun, Jian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) :177-190
[6]   Multi-view facial landmark detection by using a 3D shape model [J].
Cech, Jan ;
Franc, Vojtech ;
Uricar, Michal ;
Matas, Jiri .
IMAGE AND VISION COMPUTING, 2016, 47 :60-70
[7]   Active appearance models [J].
Cootes, TF ;
Edwards, GJ ;
Taylor, CJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) :681-685
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   M3 CSR: Multi-view, multi-scale and multi-component cascade shape regression [J].
Deng, Jiankang ;
Liu, Qingshan ;
Yang, Jing ;
Tao, Dacheng .
IMAGE AND VISION COMPUTING, 2016, 47 :19-26
[10]   Cascaded Pose Regression [J].
Dollar, Piotr ;
Welinder, Peter ;
Perona, Pietro .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1078-1085