Authentic Volumetric Avatars from a Phone Scan

被引:68
作者
Cao, Chen [1 ]
Simon, Tomas [1 ]
Kim, Jin Kyu [1 ]
Schwartz, Gabe [1 ]
Zollhoefer, Michael [1 ]
Saito, Shun-Suke [1 ]
Lombardi, Stephen [1 ]
Wei, Shih-En [1 ]
Belko, Danielle [1 ]
Yu, Shoou-, I [1 ]
Sheikh, Yaser [1 ]
Saragih, Jason [1 ]
机构
[1] Real Labs, 131 15th St, Pittsburgh, PA 15222 USA
来源
ACM TRANSACTIONS ON GRAPHICS | 2022年 / 41卷 / 04期
关键词
3D Avatar Creation; Neural Rendering; OF-THE-ART; 3D;
D O I
10.1145/3528223.3530143
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Creating photorealistic avatars of existing people currently requires extensive person-specific data capture, which is usually only accessible to the VFX industry and not the general public. Our work aims to address this drawback by relying only on a short mobile phone capture to obtain a drivable 3D head avatar that matches a person's likeness faithfully. In contrast to existing approaches, our architecture avoids the complex task of directly modeling the entire manifold of human appearance, aiming instead to generate an avatar model that can be specialized to novel identities using only small amounts of data. The model dispenses with low-dimensional latent spaces that are commonly employed for hallucinating novel identities, and instead, uses a conditional representation that can extract person-specific information at multiple scales from a high resolution registered neutral phone scan. We achieve high quality results through the use of a novel universal avatar prior that has been trained on high resolution multi-view video captures of facial performances of hundreds of human subjects. By fine-tuning the model using inverse rendering we achieve increased realism and personalize its range of motion. The output of our approach is not only a high-fidelity 3D head avatar that matches the person's facial shape and appearance, but one that can also be driven using a jointly discovered shared global expression space with disentangled controls for gaze direction. Via a series of experiments we demonstrate that our avatars are faithful representations of the subject's likeness. Compared to other state-of-the-art methods for lightweight avatar creation, our approach exhibits superior visual quality and animateability.
引用
收藏
页数:19
相关论文
共 88 条
  • [1] Abdal R, 2020, PROC CVPR IEEE, P8293, DOI 10.1109/CVPR42600.2020.00832
  • [2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?
    Abdal, Rameen
    Qin, Yipeng
    Wonka, Peter
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4431 - 4440
  • [3] Near-Eye Varifocal Augmented Reality Display using See-Through Screens
    Aksit, Kaan
    Lopes, Ward
    Kim, Jonghyun
    Shirley, Peter
    Luebke, David
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06):
  • [4] The Digital Emily Project: Achieving a Photorealistic Digital Actor
    Alexander, Oleg
    Rogers, Mike
    Lambeth, William
    Chiang, Jen-Yuan
    Ma, Wan-Chun
    Wang, Chuan-Chang
    Debevec, Paul
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2010, 30 (04) : 20 - 31
  • [5] Alexander Oleg, 2013, ACM SIGGRAPH 2013 PO, V1, P1
  • [6] Modeling Facial Geometry using Compositional VAEs
    Bagautdinov, Timur
    Wu, Chenglei
    Saragih, Jason
    Fua, Pascal
    Sheikh, Yaser
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3877 - 3886
  • [7] High-Quality Passive Facial Performance Capture using Anchor Frames
    Beeler, Thabo
    Hahn, Fabian
    Bradley, Derek
    Bickel, Bernd
    Beardsley, Paul
    Gotsman, Craig
    Sumner, Robert W.
    Gross, Markus
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (04):
  • [8] Multi-scale capture of facial geometry and motion
    Bickel, Bernd
    Botsch, Mario
    Angst, Roland
    Matusik, Wojciech
    Otaduy, Miguel
    Pfister, Hanspeter
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03):
  • [9] A morphable model for the synthesis of 3D faces
    Blanz, V
    Vetter, T
    [J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
  • [10] A 3D Morphable Model learnt from 10,000 faces
    Booth, James
    Roussos, Anastasios
    Zafeiriou, Stefanos
    Ponniah, Allan
    Dunaway, David
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5543 - 5552