H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

被引:36
作者
Ramon, Eduard [1 ,2 ]
Triginer, Gil [1 ]
Escur, Janna [1 ]
Pumarola, Albert [3 ]
Garcia, Jaime [1 ]
Giro-i-Nieto, Xavier [2 ,3 ]
Moreno-Noguer, Francesc [3 ]
机构
[1] Crisalix SA, Manila, Philippines
[2] Univ Politecn Cataluna, Catalunya, Spain
[3] CSIC UPC, Inst Robot & Informat Ind, Madrid, Spain
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.00557
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent learning approaches that implicitly represent surface geometry using coordinate-based neural representations have shown impressive results in the problem of multi-view 3D reconstruction. The effectiveness of these techniques is, however, subject to the availability of a large number (several tens) of input views of the scene, and computationally demanding optimizations. In this paper, we tackle these limitations for the specific problem of few-shot full 3D head reconstruction, by endowing coordinate-based representations with a probabilistic shape prior that enables faster convergence and better generalization when using few input images (down to three). First, we learn a shape model of 3D heads from thousands of incomplete raw scans using implicit representations. At test time, we jointly overfit two coordinate-based neural networks to the scene, one modelling the geometry and another estimating the surface radiance, using implicit differentiable rendering. We devise a two-stage optimization strategy in which the learned prior is used to initialize and constrain the geometry during an initial optimization phase. Then, the prior is unfrozen and fine-tuned to the scene. By doing this, we achieve high-fidelity head reconstructions, including hair and shoulders, and with a high level of detail that consistently outperforms both state-of-the-art 3D Morphable Models methods in the few-shot scenario, and non-parametric methods when large sets of views are available.
引用
收藏
页码:5600 / 5609
页数:10
相关论文
共 56 条
[51]   Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images [J].
Wang, Nanyang ;
Zhang, Yinda ;
Li, Zhuwen ;
Fu, Yanwei ;
Liu, Wei ;
Jiang, Yu-Gang .
COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 :55-71
[52]  
Wei Huawei, 2019, ARXIV190405562
[53]  
Yan XC, 2016, ADV NEUR IN, V29
[54]  
Yariv Lior, 2020, NeurIPS, V33
[55]  
Yenamandra Tarun, 2020, ARXIV201114143
[56]   pixelNeRF: Neural Radiance Fields from One or Few Images [J].
Yu, Alex ;
Ye, Vickie ;
Tancik, Matthew ;
Kanazawa, Angjoo .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4576-4585