Lightweight Binocular Facial Performance Capture under Uncontrolled Lighting

被引：98

作者：

Valgaerts, Levi ^{[1
]}

Wu, Chenglei ^{[1
,2
]}

Bruhn, Andres ^{[3
]}

Seidel, Hans-Peter ^{[1
]}

Theobalt, Christian ^{[1
]}

机构：

[1] MPI Informat, Saarbrucken, Germany

[2] Intel Visual Comp Inst, Saarbrucken, Germany

[3] Univ Stuttgart, D-7000 Stuttgart, Germany

来源：

ACM TRANSACTIONS ON GRAPHICS | 2012年 / 31卷 / 06期

关键词：

Facial Performance Capture; Scene Flow; Shading-based Refinement; Uncontrolled Lighting; 3D MOTION; SHAPE; STEREO; IMAGES; FACES; VIDEO; FLOW;

D O I：

10.1145/2366145.2366206

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Recent progress in passive facial performance capture has shown impressively detailed results on highly articulated motion. However, most methods rely on complex multi-camera set-ups, controlled lighting or fiducial markers. This prevents them from being used in general environments, outdoor scenes, during live action on a film set, or by freelance animators and everyday users who want to capture their digital selves. In this paper, we therefore propose a lightweight passive facial performance capture approach that is able to reconstruct high-quality dynamic facial geometry from only a single pair of stereo cameras. Our method succeeds under uncontrolled and time-varying lighting, and also in outdoor scenes. Our approach builds upon and extends recent image-based scene flow computation, lighting estimation and shading-based refinement algorithms. It integrates them into a pipeline that is specifically tailored towards facial performance reconstruction from challenging binocular footage under uncontrolled lighting. In an experimental evaluation, the strong capabilities of our method become explicit: We achieve detailed and spatio-temporally coherent results for expressive facial motion in both indoor and outdoor scenes - even from low quality input images recorded with a hand-held consumer stereo camera. We believe that our approach is the first to capture facial performances of such high quality from a single stereo rig and we demonstrate that it brings facial performance capture out of the studio, into the wild, and within the reach of everybody.

引用

页数：11

共 41 条

[1]

[Anonymous], 2000, Multiple View Geometry in Computer Vision

[2]

[Anonymous], 2009, ACM SIGGRAPH 2009 Courses, SIGGRAPH '09

[3]

[Anonymous], P VIS MOD VIS WORKSH

[4] Multi-View Scene Flow Estimation: A View Centered Variational Approach [J].

Basha, Tali ;

Moses, Yael ;

Kiryati, Nahum .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1506-1513

[5] Photometric stereo with general, unknown lighting [J].

Basri, Ronen ;

Jacobs, David ;

Kemelmacher, Ira .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 72 (03) :239-257

[6]

BEELER T, 2011, ACM T GRAPHIC, V1, P1

[7] High-Quality Single-Shot Capture of Facial Geometry [J].

Beeler, Thabo ;

Bickel, Bernd ;

Beardsley, Paul ;

Sumner, Bob ;

Gross, Markus .

ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)

[8] Multi-scale capture of facial geometry and motion [J].

Bickel, Bernd ;

Botsch, Mario ;

Angst, Roland ;

Matusik, Wojciech ;

Otaduy, Miguel ;

Pfister, Hanspeter .

ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03)

[9]

BIRKBECK N., 2011, P ICCV

[10] Reanimating faces in images and video [J].

Blanz, V ;

Basso, C ;

Poggio, T ;

Vetter, T .

COMPUTER GRAPHICS FORUM, 2003, 22 (03) :641-650

← 1 2 3 4 5 →