Efficient generic face model fitting to images and videos

被引:15
作者
Unzueta, Luis [1 ]
Pimenta, Waldir [2 ]
Goenetxea, Jon [1 ]
Santos, Luis Paulo [2 ]
Dornaika, Fadi [3 ,4 ]
机构
[1] Vicomtech IK4, Donostia San Sebastian 20009, Spain
[2] Univ Minho, Dept Informat, P-4710057 Braga, Portugal
[3] Univ Basque Country, EHU UPV, Comp Engn Fac, Donostia San Sebastian 20018, Spain
[4] Ikerbasque, Basque Fdn Sci, Bilbao 48011, Spain
关键词
Face model fitting; Head pose estimation; Facial feature detection; Face tracking; ACTIVE APPEARANCE MODELS; POSE; FEATURES; TRACKING;
D O I
10.1016/j.imavis.2014.02.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a robust and lightweight method for the automatic fitting of deformable 3D face models on facial images. Popular fitting techniques such as those based on statistical models of shape and appearance require a training stage based on a set of facial images and their corresponding facial landmarks, which have to be manually labeled. Therefore, new images in which to fit the model cannot differ too much in shape and appearance (including illumination variation, facial hair, wrinkles, etc.) from those used for training. By contrast, our approach can fit a generic face model in two steps: (1) the detection of facial features based on local image gradient analysis and (2) the backprojection of a deformable 3D face model through the optimization of its deformation parameters. The proposed approach can retain the advantages of both learning-free and learning-based approaches. Thus, we can estimate the position, orientation, shape and actions of faces, and initialize user-specific face tracking approaches, such as Online Appearance Models (OAMs), which have shown to be more robust than generic user tracking approaches. Experimental results show that our method outperforms other fitting alternatives under challenging illumination conditions and with a computational cost that allows its implementation in devices with low hardware specifications, such as smartphones and tablets. Our proposed approach lends itself nicely to many frameworks addressing semantic inference in face images and videos. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:321 / 334
页数:14
相关论文
共 44 条
[1]  
Ahlberg J., 2001, CANDIDE-3 -- an updated parameterized face
[2]  
[Anonymous], P INT C COMP VIS
[3]  
[Anonymous], 2006, BRIT MACH VIS C
[4]  
[Anonymous], 2013, P IEEE C COMP VIS PA
[5]   Facial feature detection using distance vector fields [J].
Asteriadis, Stylianos ;
Nikolaidis, Nikos ;
Pitas, Ioannis .
PATTERN RECOGNITION, 2009, 42 (07) :1388-1398
[6]  
Asthana A., 2013, P IEEE C COMP VIS PA
[7]   Face recognition based on fitting a 3D morphable model [J].
Blanz, V ;
Vetter, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (09) :1063-1074
[8]  
Blanz V., 2004, COMPUTER VISION PATT, V2, P454
[9]  
Cao XD, 2012, PROC CVPR IEEE, P2887, DOI 10.1109/CVPR.2012.6248015
[10]   3D Active Appearance Model for Aligning Faces in 2D Images [J].
Chen, Chun-Wei ;
Wang, Chieh-Chih .
2008 IEEE/RSJ INTERNATIONAL CONFERENCE ON ROBOTS AND INTELLIGENT SYSTEMS, VOLS 1-3, CONFERENCE PROCEEDINGS, 2008, :3133-3139