Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement

被引:1
作者
Qin, Xujia [1 ]
Li, Xinyu [1 ]
Li, Mengjia [1 ]
Zheng, Hongbo [1 ]
Xu, Xiaogang [2 ,3 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] Zhejiang Lab, Inst Artificial Intelligence, Hangzhou, Peoples R China
[3] Zhejiang Gongshang Univ, Coll Comp & Informat Engn, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
3D face reconstruction; Attention mechanism; Self-supervised; Attribute refinement; Deep learning; SHAPE;
D O I
10.1007/s00371-024-03319-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Single-view 3D face reconstruction refers to recovering 3D information of a face, such as shape and texture, from a single image. With the wide application of deep learning in the image field, there have been a number of researches using this method to learn the 3D shape and texture of a face from image information. In this paper, we propose a self-supervised single-image 3D face reconstruction method based on the attention mechanism and attribute refinement, which incorporates the attention mechanism in the network structural model, allowing feature extraction to fuse the information of the channel domain and the spatial domain to enhance the feature extraction capability. Joint 2D image-level supervision and supervision between 3D attributes can better learn the 3D model of the face. In this paper, on the basis of using the traditional 2D image supervision, we design a variety of loss functions by combining the cyclic consistency, interpolation consistency, and landmark consistency to realize the 3D attribute level supervision. In order to strengthen the ability to characterize the details of the face, this paper proposes an attribute refinement network to enhance the ability of the model to reconstruct the details and make the reconstruction results more realistic. Based on the symmetry of the face, this paper constructs a deep learning network model to decouple the 3D information directly from the image, and finally realizes unsupervised 3D face reconstruction from a single image.
引用
收藏
页码:209 / 227
页数:19
相关论文
共 53 条
  • [1] [Anonymous], 2009, P 26 ANN INT C MACHI, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.155338]
  • [2] A morphable model for the synthesis of 3D faces
    Blanz, V
    Vetter, T
    [J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
  • [3] Large Scale 3D Morphable Models
    Booth, James
    Roussos, Anastasios
    Ponniah, Allan
    Dunaway, David
    Zafeiriou, Stefanos
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (2-4) : 233 - 254
  • [4] FaceWarehouse: A 3D Facial Expression Database for Visual Computing
    Cao, Chen
    Weng, Yanlin
    Zhou, Shun
    Tong, Yiying
    Zhou, Kun
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) : 413 - 425
  • [5] Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
    Cao, Zhiwen
    Liu, Dongfang
    Wang, Qifan
    Chen, Yingjie
    [J]. COMPUTER VISION, ECCV 2022, PT XII, 2022, 13672 : 737 - 753
  • [6] A Vector-based Representation to Enhance Head Pose Estimation
    Cao, Zhiwen
    Chu, Zongcheng
    Liu, Dongfang
    Chen, Yingjie
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1187 - 1196
  • [7] Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set
    Deng, Yu
    Yang, Jiaolong
    Xu, Sicheng
    Chen, Dong
    Jia, Yunde
    Tong, Xin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 285 - 295
  • [8] Fast 3D face reconstruction from a single image combining attention mechanism and graph convolutional network
    Deng, Zhuoran
    Liang, Yan
    Pan, Jiahui
    Liao, Jiacheng
    Hao, Yan
    Wen, Xing
    [J]. VISUAL COMPUTER, 2023, 39 (11) : 5547 - 5561
  • [9] Faugeras O., 2001, GEOMETRY MULTIPLE IM, DOI [10.7551/mitpress/3259.001.0001, DOI 10.7551/MITPRESS/3259.001.0001]
  • [10] Learning an Animatable Detailed 3D Face Model from In-The-Wild Images
    Feng, Yao
    Feng, Haiwen
    Black, Michael J.
    Bolkart, Timo
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04):