SeIF: Semantic-Constrained Deep Implicit Function for Single-Image 3D Head Reconstruction

被引:1
作者
Liu, Leyuan [1 ,2 ]
Liu, Xu [1 ,2 ]
Sun, Jianchi [3 ]
Gao, Changxin [4 ]
Chen, Jingying [1 ,2 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China
[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan 430079, Peoples R China
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[4] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430073, Peoples R China
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Image reconstruction; Semantics; Solid modeling; Faces; Codes; Fires; 3D head reconstruction; 3D morphable model; deep implicit function; model animation; semantic constraint; FACE RECONSTRUCTION;
D O I
10.1109/TMM.2024.3405721
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Various applications require realistic, artifact-free, and animatable 3D avatars. However, traditional 3D morphable models (3DMMs) produce animatable 3D heads but fail to capture accurate geometries and details, while existing deep implicit functions have been shown to achieve realistic reconstructions but suffer from artifacts and struggle to yield 3D heads that are easy to animate. To reconstruct high-fidelity, artifact-less, and animatable 3D heads from single-view images, we leverage semantics to bridge the best properties of 3DMMs and deep implicit functions and propose SeIF-a semantic-constrained deep implicit function. First, SeIF derives fine-grained semantics from a standard 3DMM (e.g., FLAME) and samples a semantic code for each query point in the query space to provide a soft constraint to the deep implicit function. The reconstruction results show that this semantic constraint does not weaken the powerful representation ability of the deep implicit function while significantly suppressing artifacts. Second, SeIF predicts a more accurate semantic code for each query point and utilizes the semantic codes to uniformize the structure of reconstructed 3D head meshes with the standard 3DMM. Since our reconstructed 3D head meshes have the same structure as the 3DMM, 3DMM-based animation approaches can be easily transferred to animate our reconstructed 3D heads. As a result, SeIF can reconstruct high-fidelity, artifact-less, and animatable 3D heads from single-view images of individuals with diverse ages, genders, races, and facial expressions. Quantitative and qualitative experimental results on seven datasets show that SeIF outperforms existing state-of-the-art methods by a large margin.
引用
收藏
页码:10106 / 10120
页数:15
相关论文
共 61 条
  • [1] High-Fidelity 3D Digital Human Head Creation from RGB-D Selfies
    Bao, Linchao
    Lin, Xiangkai
    Chen, Yajing
    Zhang, Haoxian
    Wang, Sheng
    Zhe, Xuefei
    Kang, Di
    Huang, Haozhi
    Jiang, Xinwei
    Wang, Jue
    Yu, Dong
    Zhang, Zhengyou
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (01):
  • [2] What Does 2D Geometric Information Really Tell Us About 3D Face Shape?
    Bas, Anil
    Smith, William A. P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (10) : 1455 - 1473
  • [3] BESL PJ, 1992, P SOC PHOTO-OPT INS, V1611, P586, DOI 10.1117/12.57955
  • [4] A morphable model for the synthesis of 3D faces
    Blanz, V
    Vetter, T
    [J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
  • [5] Multi-NeuS: 3D Head Portraits From Single Image With Neural Implicit Functions
    Burkov, Egor
    Rakhimov, Ruslan
    Safin, Aleksandr
    Burnaev, Evgeny
    Lempitsky, Victor
    [J]. IEEE ACCESS, 2023, 11 : 95681 - 95691
  • [6] Expression-Aware Face Reconstruction via a Dual-Stream Network
    Chai, Xiaoyu
    Chen, Jun
    Liang, Chao
    Xu, Dongshu
    Lin, Chia-Wen
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2998 - 3012
  • [7] Chaudhuri Bindita, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12350), P142, DOI 10.1007/978-3-030-58558-7_9
  • [8] Joint Face Detection and Facial Motion Retargeting for Multiple Faces
    Chaudhuri, Bindita
    Vesdapunt, Noranart
    Wang, Baoyuan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9711 - 9720
  • [9] Cheng S., 2020, P AS C COMP VIS, P188
  • [10] Capture, Learning, and Synthesis of 3D Speaking Styles
    Cudeiro, Daniel
    Bolkart, Timo
    Laidlaw, Cassidy
    Ranjan, Anurag
    Black, Michael J.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10093 - 10103