Towards Metrical Reconstruction of Human Faces

被引：68

作者：

Zielonka, Wojciech ^{[1
]}

Bolkart, Timo ^{[1
]}

Thies, Justus ^{[1
]}

机构：

[1] Max Planck Inst Intelligent Syst, Tubingen, Germany

来源：

COMPUTER VISION, ECCV 2022, PT XIII | 2022年 / 13673卷

关键词：

D O I：

10.1007/978-3-031-19778-9_15

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Face reconstruction and tracking is a building block of numerous applications in AR/VR, human-machine interaction, as well as medical applications. Most of these applications rely on a metrically correct prediction of the shape, especially, when the reconstructed subject is put into a metrical context (i.e., when there is a reference object of known size). A metrical reconstruction is also needed for any application that measures distances and dimensions of the subject (e.g., to virtually fit a glasses frame). State-of-the-art methods for face reconstruction from a single image are trained on large 2D image datasets in a self-supervised fashion. However, due to the nature of a perspective projection they are not able to reconstruct the actual face dimensions, and even predicting the average human face outperforms some of these methods in a metrical sense. To learn the actual shape of a face, we argue for a supervised training scheme. Since there exists no large-scale 3D dataset for this task, we annotated and unified small- and medium-scale databases. The resulting unified dataset is still a medium-scale dataset with more than 2k identities and training purely on it would lead to overfitting. To this end, we take advantage of a face recognition network pretrained on a large-scale 2D image dataset, which provides distinct features for different faces and is robust to expression, illumination, and camera changes. Using these features, we train our face shape estimator in a supervised fashion, inheriting the robustness and generalization of the face recognition network. Our method, which we call MICA (MetrIC fAce), outperforms the state-of-the-art reconstruction methods by a large margin, both on current non-metric benchmarks as well as on our metric benchmarks (15% and 24% lower average error on NoW, respectively). Project website: https://zielon.github.io/mica/

引用

页码：250 / 269

页数：20

共 85 条

[1] Cross-modal Deep Face Normals with Deactivable Skip Connections [J].

Abrevaya, Victoria Fernandez ;

Boukhayma, Adnane ;

Torr, Philip H. S. ;

Boyer, Edmond .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4978-4988

[2]

An X., 2020, arXiv

[3] Extreme 3D Face Reconstruction: Seeing Through Occlusions [J].

Anh Tuan Tran ;

Hassner, Tal ;

Masi, Iacopo ;

Paz, Eran ;

Nirkin, Yuval ;

Medioni, Gerard .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3935-3944

[4]

[Anonymous], 2016, PHOTOREALISTIC FACIA

[5]

[Anonymous], 2019, 3D DENSE FACE ALIGNM

[6]

Bagdanov A.D., 2011, P 2011 JOINT ACM WOR

[7] What Does 2D Geometric Information Really Tell Us About 3D Face Shape? [J].

Bas, Anil ;

Smith, William A. P. .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (10) :1455-1473

[8]

BESL PJ, 1992, P SOC PHOTO-OPT INS, V1611, P586, DOI 10.1117/12.57955

[9] A morphable model for the synthesis of 3D faces [J].

Blanz, V ;

Vetter, T .

SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194

[10] Exchanging faces in images [J].

Blanz, V ;

Scherbaum, K ;

Vetter, T ;

Seidel, HP .

COMPUTER GRAPHICS FORUM, 2004, 23 (03) :669-676

← 1 2 3 4 5 6 7 8 9 →