Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction

被引:271
作者
Gafni, Guy [1 ]
Thies, Justus [1 ]
Zollhoefer, Michael [2 ]
Niessner, Matthias [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Facebook Real Labs Res, Pittsburgh, PA USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
欧洲研究理事会;
关键词
FACES;
D O I
10.1109/CVPR46437.2021.00854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present dynamic neural radiance fields for modeling the appearance and dynamics of a human face(1). Digitally modeling and reconstructing a talking human is a key building-block for a variety of applications. Especially, for telepresence applications in AR or VR, a faithful reproduction of the appearance including novel viewpoint or headposes is required. In contrast to state-of-the-art approaches that model the geometry and material properties explicitly, or are purely image-based, we introduce an implicit representation of the head based on scene representation networks. To handle the dynamics of the face, we combine our scene representation network with a low-dimensional morphable model which provides explicit control over pose and expressions. We use volumetric rendering to generate images from this hybrid representation and demonstrate that such a dynamic neural scene representation can be learned from monocular input data only, without the need of a specialized capture setup. In our experiments, we show that this learned volumetric representation allows for photorealistic image generation that surpasses the quality of state-of-the-art video-based reenactment methods.
引用
收藏
页码:8645 / 8654
页数:10
相关论文
共 52 条
[1]  
Afchar D, 2018, IEEE INT WORKS INFOR
[2]  
Aneja Shivangi, 2020, GENERALIZED ZERO FEW
[3]  
[Anonymous], 2015, COMPUTER VISION PATT
[4]   Bringing Portraits to Life [J].
Averbuch-Elor, Hadar ;
Cohen-Or, Daniel ;
Kopf, Johannes ;
Cohen, Michael F. .
ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06)
[5]   High-Quality Passive Facial Performance Capture using Anchor Frames [J].
Beeler, Thabo ;
Hahn, Fabian ;
Bradley, Derek ;
Bickel, Bernd ;
Beardsley, Paul ;
Gotsman, Craig ;
Sumner, Robert W. ;
Gross, Markus .
ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04)
[6]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[7]   Exchanging faces in images [J].
Blanz, V ;
Scherbaum, K ;
Vetter, T ;
Seidel, HP .
COMPUTER GRAPHICS FORUM, 2004, 23 (03) :669-676
[8]   Reanimating faces in images and video [J].
Blanz, V ;
Basso, C ;
Poggio, T ;
Vetter, T .
COMPUTER GRAPHICS FORUM, 2003, 22 (03) :641-650
[9]   Online Modeling For Realtime Facial Animation [J].
Bouaziz, Sofien ;
Wang, Yangang ;
Pauly, Mark .
ACM TRANSACTIONS ON GRAPHICS, 2013, 32 (04)
[10]   Accurate and Robust 3D Facial Capture Using a Single RGBD Camera [J].
Chen, Yen-Lin ;
Wu, Hsiang-Tao ;
Shi, Fuhao ;
Tong, Xin ;
Chai, Jinxiang .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3615-3622