Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction

被引：271

作者：

Gafni, Guy ^{[1
]}

Thies, Justus ^{[1
]}

Zollhoefer, Michael ^{[2
]}

Niessner, Matthias ^{[1
]}

机构：

[1] Tech Univ Munich, Munich, Germany

[2] Facebook Real Labs Res, Pittsburgh, PA USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

欧洲研究理事会;

关键词：

FACES;

D O I：

10.1109/CVPR46437.2021.00854

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face(1). Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoint or headposes is required. In contrast to state-of-the-art approaches that model the geometry and material properties explicitly, or are purely image-based, we introduce an implicit representation of the head based on scene representation networks. To handle the dynamics of the face, we combine our scene representation network with a low-dimensional morphable model which provides explicit control over pose and expressions. We use volumetric rendering to generate images from this hybrid representation and demonstrate that such a dynamic neural scene representation can be learned from monocular input data only, without the need of a specialized capture setup. In our experiments, we show that this learned volumetric representation allows for photorealistic image generation that surpasses the quality of state-of-the-art video-based reenactment methods.

引用

页码：8645 / 8654

页数：10

共 52 条

[1]

Afchar D, 2018, IEEE INT WORKS INFOR

[2]

Aneja Shivangi, 2020, GENERALIZED ZERO FEW

[3]

[Anonymous], 2015, COMPUTER VISION PATT

[4] Bringing Portraits to Life [J].

Averbuch-Elor, Hadar ;

Cohen-Or, Daniel ;

Kopf, Johannes ;

Cohen, Michael F. .

ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06)

[5] High-Quality Passive Facial Performance Capture using Anchor Frames [J].

Beeler, Thabo ;

Hahn, Fabian ;

Bradley, Derek ;

Bickel, Bernd ;

Beardsley, Paul ;

Gotsman, Craig ;

Sumner, Robert W. ;

Gross, Markus .

ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04)

[6] A morphable model for the synthesis of 3D faces [J].

Blanz, V ;

Vetter, T .

SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194

[7] Exchanging faces in images [J].

Blanz, V ;

Scherbaum, K ;

Vetter, T ;

Seidel, HP .

COMPUTER GRAPHICS FORUM, 2004, 23 (03) :669-676

[8] Reanimating faces in images and video [J].

Blanz, V ;

Basso, C ;

Poggio, T ;

Vetter, T .

COMPUTER GRAPHICS FORUM, 2003, 22 (03) :641-650

[9] Online Modeling For Realtime Facial Animation [J].

Bouaziz, Sofien ;

Wang, Yangang ;

Pauly, Mark .

ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)

[10] Accurate and Robust 3D Facial Capture Using a Single RGBD Camera [J].

Chen, Yen-Lin ;

Wu, Hsiang-Tao ;

Shi, Fuhao ;

Tong, Xin ;

Chai, Jinxiang .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3615-3622

← 1 2 3 4 5 6 →