Towards Metrical Reconstruction of Human Faces

被引:68
作者
Zielonka, Wojciech [1 ]
Bolkart, Timo [1 ]
Thies, Justus [1 ]
机构
[1] Max Planck Inst Intelligent Syst, Tubingen, Germany
来源
COMPUTER VISION, ECCV 2022, PT XIII | 2022年 / 13673卷
关键词
D O I
10.1007/978-3-031-19778-9_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face reconstruction and tracking is a building block of numerous applications in AR/VR, human-machine interaction, as well as medical applications. Most of these applications rely on a metrically correct prediction of the shape, especially, when the reconstructed subject is put into a metrical context (i.e., when there is a reference object of known size). A metrical reconstruction is also needed for any application that measures distances and dimensions of the subject (e.g., to virtually fit a glasses frame). State-of-the-art methods for face reconstruction from a single image are trained on large 2D image datasets in a self-supervised fashion. However, due to the nature of a perspective projection they are not able to reconstruct the actual face dimensions, and even predicting the average human face outperforms some of these methods in a metrical sense. To learn the actual shape of a face, we argue for a supervised training scheme. Since there exists no large-scale 3D dataset for this task, we annotated and unified small- and medium-scale databases. The resulting unified dataset is still a medium-scale dataset with more than 2k identities and training purely on it would lead to overfitting. To this end, we take advantage of a face recognition network pretrained on a large-scale 2D image dataset, which provides distinct features for different faces and is robust to expression, illumination, and camera changes. Using these features, we train our face shape estimator in a supervised fashion, inheriting the robustness and generalization of the face recognition network. Our method, which we call MICA (MetrIC fAce), outperforms the state-of-the-art reconstruction methods by a large margin, both on current non-metric benchmarks as well as on our metric benchmarks (15% and 24% lower average error on NoW, respectively). Project website: https://zielon.github.io/mica/
引用
收藏
页码:250 / 269
页数:20
相关论文
共 85 条
[1]   Cross-modal Deep Face Normals with Deactivable Skip Connections [J].
Abrevaya, Victoria Fernandez ;
Boukhayma, Adnane ;
Torr, Philip H. S. ;
Boyer, Edmond .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4978-4988
[2]  
An X., 2020, arXiv
[3]   Extreme 3D Face Reconstruction: Seeing Through Occlusions [J].
Anh Tuan Tran ;
Hassner, Tal ;
Masi, Iacopo ;
Paz, Eran ;
Nirkin, Yuval ;
Medioni, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3935-3944
[4]  
[Anonymous], 2016, PHOTOREALISTIC FACIA
[5]  
[Anonymous], 2019, 3D DENSE FACE ALIGNM
[6]  
Bagdanov A.D., 2011, P 2011 JOINT ACM WOR
[7]   What Does 2D Geometric Information Really Tell Us About 3D Face Shape? [J].
Bas, Anil ;
Smith, William A. P. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (10) :1455-1473
[8]  
BESL PJ, 1992, P SOC PHOTO-OPT INS, V1611, P586, DOI 10.1117/12.57955
[9]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[10]   Exchanging faces in images [J].
Blanz, V ;
Scherbaum, K ;
Vetter, T ;
Seidel, HP .
COMPUTER GRAPHICS FORUM, 2004, 23 (03) :669-676