Automatic Acquisition of High-fidelity Facial Performances Using Monocular Videos

被引:102
作者
Shi, Fuhao [1 ]
Wu, Hsiang-Tao [2 ]
Tong, Xin [2 ]
Chai, Jinxiang [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
[2] Microsoft Res Asia, Beijing, Peoples R China
来源
ACM TRANSACTIONS ON GRAPHICS | 2014年 / 33卷 / 06期
基金
美国国家科学基金会;
关键词
facial performance capture; face animation; facial modeling; facial detection and tracking; facial editing; SHAPE; RECOGNITION;
D O I
10.1145/2661229.2661290
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a facial performance capture system that automatically captures high-fidelity facial performances using uncontrolled monocular videos (e.g., Internet videos). We start the process by detecting and tracking important facial features such as the nose tip and mouth corners across the entire sequence and then use the detected facial features along with multilinear facial models to reconstruct 3D head poses and large-scale facial deformation of the subject at each frame. We utilize per-pixel shading cues to add fine-scale surface details such as emerging or disappearing wrinkles and folds into large-scale facial deformation. At a final step, we iterate our reconstruction procedure on large-scale facial geometry and fine-scale facial details to further improve the accuracy of facial reconstruction. We have tested our system on monocular videos downloaded from the Internet, demonstrating its accuracy and robustness under a variety of uncontrolled lighting conditions and overcoming significant shape differences across individuals. We show our system advances the state of the art in facial performance capture by comparing against alternative methods.
引用
收藏
页数:13
相关论文
共 37 条
[1]   Shape quantization and recognition with randomized trees [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1997, 9 (07) :1545-1588
[2]  
[Anonymous], 2013, ACM T GRAPHIC
[3]  
[Anonymous], 2009, P 2009 ACM SIGGRAPHE, DOI [DOI 10.1145/1599470.1599472, 10.1145/1599470.1599472]
[4]   Lambertian reflectance and linear subspaces [J].
Basri, R ;
Jacobs, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (02) :218-233
[5]   High-Quality Single-Shot Capture of Facial Geometry [J].
Beeler, Thabo ;
Bickel, Bernd ;
Beardsley, Paul ;
Sumner, Bob ;
Gross, Markus .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)
[6]   High-Quality Passive Facial Performance Capture using Anchor Frames [J].
Beeler, Thabo ;
Hahn, Fabian ;
Bradley, Derek ;
Bickel, Bernd ;
Beardsley, Paul ;
Gotsman, Craig ;
Sumner, Robert W. ;
Gross, Markus .
ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04)
[7]  
Belhumeur PN, 2011, PROC CVPR IEEE, P545, DOI 10.1109/CVPR.2011.5995602
[8]   Multi-scale capture of facial geometry and motion [J].
Bickel, Bernd ;
Botsch, Mario ;
Angst, Roland ;
Matusik, Wojciech ;
Otaduy, Miguel ;
Pfister, Hanspeter .
ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03)
[9]   Reanimating faces in images and video [J].
Blanz, V ;
Basso, C ;
Poggio, T ;
Vetter, T .
COMPUTER GRAPHICS FORUM, 2003, 22 (03) :641-650
[10]   Online Modeling For Realtime Facial Animation [J].
Bouaziz, Sofien ;
Wang, Yangang ;
Pauly, Mark .
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)