SeIF: Semantic-Constrained Deep Implicit Function for Single-Image 3D Head Reconstruction

被引：1

作者：

Liu, Leyuan ^{[1
,2
]}

Liu, Xu ^{[1
,2
]}

Sun, Jianchi ^{[3
]}

Gao, Changxin ^{[4
]}

Chen, Jingying ^{[1
,2
]}

机构：

[1] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China

[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China

[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

[4] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430073, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Image reconstruction; Semantics; Solid modeling; Faces; Codes; Fires; 3D head reconstruction; 3D morphable model; deep implicit function; model animation; semantic constraint; FACE RECONSTRUCTION;

D O I：

10.1109/TMM.2024.3405721

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Various applications require realistic, artifact-free, and animatable 3D avatars. However, traditional 3D morphable models (3DMMs) produce animatable 3D heads but fail to capture accurate geometries and details, while existing deep implicit functions have been shown to achieve realistic reconstructions but suffer from artifacts and struggle to yield 3D heads that are easy to animate. To reconstruct high-fidelity, artifact-less, and animatable 3D heads from single-view images, we leverage semantics to bridge the best properties of 3DMMs and deep implicit functions and propose SeIF-a semantic-constrained deep implicit function. First, SeIF derives fine-grained semantics from a standard 3DMM (e.g., FLAME) and samples a semantic code for each query point in the query space to provide a soft constraint to the deep implicit function. The reconstruction results show that this semantic constraint does not weaken the powerful representation ability of the deep implicit function while significantly suppressing artifacts. Second, SeIF predicts a more accurate semantic code for each query point and utilizes the semantic codes to uniformize the structure of reconstructed 3D head meshes with the standard 3DMM. Since our reconstructed 3D head meshes have the same structure as the 3DMM, 3DMM-based animation approaches can be easily transferred to animate our reconstructed 3D heads. As a result, SeIF can reconstruct high-fidelity, artifact-less, and animatable 3D heads from single-view images of individuals with diverse ages, genders, races, and facial expressions. Quantitative and qualitative experimental results on seven datasets show that SeIF outperforms existing state-of-the-art methods by a large margin.

引用

页码：10106 / 10120

页数：15

共 61 条

[1] High-Fidelity 3D Digital Human Head Creation from RGB-D Selfies
Bao, Linchao
Lin, Xiangkai
Chen, Yajing
Zhang, Haoxian
Wang, Sheng
Zhe, Xuefei
Kang, Di
Huang, Haozhi
Jiang, Xinwei
Wang, Jue
Yu, Dong
Zhang, Zhengyou
[J]. ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (01):
[2] What Does 2D Geometric Information Really Tell Us About 3D Face Shape?
Bas, Anil
Smith, William A. P.
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (10) : 1455 - 1473
[3] BESL PJ, 1992, P SOC PHOTO-OPT INS, V1611, P586, DOI 10.1117/12.57955
[4] A morphable model for the synthesis of 3D faces
Blanz, V
Vetter, T
[J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
[5] Multi-NeuS: 3D Head Portraits From Single Image With Neural Implicit Functions
Burkov, Egor
Rakhimov, Ruslan
Safin, Aleksandr
Burnaev, Evgeny
Lempitsky, Victor
[J]. IEEE ACCESS, 2023, 11 : 95681 - 95691
[6] Expression-Aware Face Reconstruction via a Dual-Stream Network
Chai, Xiaoyu
Chen, Jun
Liang, Chao
Xu, Dongshu
Lin, Chia-Wen
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2998 - 3012
[7] Chaudhuri Bindita, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12350), P142, DOI 10.1007/978-3-030-58558-7_9
[8] Joint Face Detection and Facial Motion Retargeting for Multiple Faces
Chaudhuri, Bindita
Vesdapunt, Noranart
Wang, Baoyuan
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9711 - 9720
[9] Cheng S., 2020, P AS C COMP VIS, P188
[10] Capture, Learning, and Synthesis of 3D Speaking Styles
Cudeiro, Daniel
Bolkart, Timo
Laidlaw, Cassidy
Ranjan, Anurag
Black, Michael J.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10093 - 10103

← 1 2 3 4 5 6 7 →