H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

被引:36
作者
Ramon, Eduard [1 ,2 ]
Triginer, Gil [1 ]
Escur, Janna [1 ]
Pumarola, Albert [3 ]
Garcia, Jaime [1 ]
Giro-i-Nieto, Xavier [2 ,3 ]
Moreno-Noguer, Francesc [3 ]
机构
[1] Crisalix SA, Manila, Philippines
[2] Univ Politecn Cataluna, Catalunya, Spain
[3] CSIC UPC, Inst Robot & Informat Ind, Madrid, Spain
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.00557
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent learning approaches that implicitly represent surface geometry using coordinate-based neural representations have shown impressive results in the problem of multi-view 3D reconstruction. The effectiveness of these techniques is, however, subject to the availability of a large number (several tens) of input views of the scene, and computationally demanding optimizations. In this paper, we tackle these limitations for the specific problem of few-shot full 3D head reconstruction, by endowing coordinate-based representations with a probabilistic shape prior that enables faster convergence and better generalization when using few input images (down to three). First, we learn a shape model of 3D heads from thousands of incomplete raw scans using implicit representations. At test time, we jointly overfit two coordinate-based neural networks to the scene, one modelling the geometry and another estimating the surface radiance, using implicit differentiable rendering. We devise a two-stage optimization strategy in which the learned prior is used to initialize and constrain the geometry during an initial optimization phase. Then, the prior is unfrozen and fine-tuned to the scene. By doing this, we achieve high-fidelity head reconstructions, including hair and shoulders, and with a high level of detail that consistently outperforms both state-of-the-art 3D Morphable Models methods in the few-shot scenario, and non-parametric methods when large sets of views are available.
引用
收藏
页码:5600 / 5609
页数:10
相关论文
共 56 条
[1]  
Amberg Brian, 2007, CVPR '07. IEEE Conference on Computer Vision and Pattern Recognition, P1
[2]   Extreme 3D Face Reconstruction: Seeing Through Occlusions [J].
Anh Tuan Tran ;
Hassner, Tal ;
Masi, Iacopo ;
Paz, Eran ;
Nirkin, Yuval ;
Medioni, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3935-3944
[3]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00899
[4]  
[Anonymous], 2017, 3DV, DOI DOI 10.1109/3DV.2017.00054
[5]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00375
[6]  
[Anonymous], 2019, P IEEECVF C COMPUTER, DOI DOI 10.1109/CVPR.2019.00105
[7]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00356
[8]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.580
[9]   SAL: Sign Agnostic Learning of Shapes from Raw Data [J].
Atzmon, Matan ;
Lipman, Yaron .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2562-2571
[10]  
Bai Ziqian, 2020, CVPR, V4