Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement

被引：1

作者：

Qin, Xujia ^{[1
]}

Li, Xinyu ^{[1
]}

Li, Mengjia ^{[1
]}

Zheng, Hongbo ^{[1
]}

Xu, Xiaogang ^{[2
,3
]}

机构：

[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou, Peoples R China

[2] Zhejiang Lab, Inst Artificial Intelligence, Hangzhou, Peoples R China

[3] Zhejiang Gongshang Univ, Coll Comp & Informat Engn, Hangzhou, Peoples R China

来源：

VISUAL COMPUTER | 2025年 / 41卷 / 01期

基金：

中国国家自然科学基金;

关键词：

3D face reconstruction; Attention mechanism; Self-supervised; Attribute refinement; Deep learning; SHAPE;

D O I：

10.1007/s00371-024-03319-0

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Single-view 3D face reconstruction refers to recovering 3D information of a face, such as shape and texture, from a single image. With the wide application of deep learning in the image field, there have been a number of researches using this method to learn the 3D shape and texture of a face from image information. In this paper, we propose a self-supervised single-image 3D face reconstruction method based on the attention mechanism and attribute refinement, which incorporates the attention mechanism in the network structural model, allowing feature extraction to fuse the information of the channel domain and the spatial domain to enhance the feature extraction capability. Joint 2D image-level supervision and supervision between 3D attributes can better learn the 3D model of the face. In this paper, on the basis of using the traditional 2D image supervision, we design a variety of loss functions by combining the cyclic consistency, interpolation consistency, and landmark consistency to realize the 3D attribute level supervision. In order to strengthen the ability to characterize the details of the face, this paper proposes an attribute refinement network to enhance the ability of the model to reconstruct the details and make the reconstruction results more realistic. Based on the symmetry of the face, this paper constructs a deep learning network model to decouple the 3D information directly from the image, and finally realizes unsupervised 3D face reconstruction from a single image.

引用

页码：209 / 227

页数：19

共 53 条

[1] [Anonymous], 2009, P 26 ANN INT C MACHI, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.155338]
[2] A morphable model for the synthesis of 3D faces
Blanz, V
Vetter, T
[J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
[3] Large Scale 3D Morphable Models
Booth, James
Roussos, Anastasios
Ponniah, Allan
Dunaway, David
Zafeiriou, Stefanos
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (2-4) : 233 - 254
[4] FaceWarehouse: A 3D Facial Expression Database for Visual Computing
Cao, Chen
Weng, Yanlin
Zhou, Shun
Tong, Yiying
Zhou, Kun
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) : 413 - 425
[5] Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
Cao, Zhiwen
Liu, Dongfang
Wang, Qifan
Chen, Yingjie
[J]. COMPUTER VISION, ECCV 2022, PT XII, 2022, 13672 : 737 - 753
[6] A Vector-based Representation to Enhance Head Pose Estimation
Cao, Zhiwen
Chu, Zongcheng
Liu, Dongfang
Chen, Yingjie
[J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1187 - 1196
[7] Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set
Deng, Yu
Yang, Jiaolong
Xu, Sicheng
Chen, Dong
Jia, Yunde
Tong, Xin
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 285 - 295
[8] Fast 3D face reconstruction from a single image combining attention mechanism and graph convolutional network
Deng, Zhuoran
Liang, Yan
Pan, Jiahui
Liao, Jiacheng
Hao, Yan
Wen, Xing
[J]. VISUAL COMPUTER, 2023, 39 (11) : 5547 - 5561
[9] Faugeras O., 2001, GEOMETRY MULTIPLE IM, DOI [10.7551/mitpress/3259.001.0001, DOI 10.7551/MITPRESS/3259.001.0001]
[10] Learning an Animatable Detailed 3D Face Model from In-The-Wild Images
Feng, Yao
Feng, Haiwen
Black, Michael J.
Bolkart, Timo
[J]. ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (04):

← 1 2 3 4 5 6 →