Fast 3D face reconstruction from a single image combining attention mechanism and graph convolutional network

被引:11
作者
Deng, Zhuoran [1 ]
Liang, Yan [1 ]
Pan, Jiahui [1 ]
Liao, Jiacheng [1 ]
Hao, Yan [1 ]
Wen, Xing [1 ]
机构
[1] South China Normal Univ, Sch Software, Foshan 528225, Peoples R China
基金
中国国家自然科学基金;
关键词
3D face reconstruction; Lightweight network; Attention mechanism; Graph convolutional network;
D O I
10.1007/s00371-022-02679-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, researchers have made significant contributions to 3D face reconstruction with the rapid development of deep learning. However, learning-based methods often suffer from time and memory consumption. Simply removing network layers hardly solves the problem. In this study, we propose a solution that achieves fast and robust 3D face reconstruction from a single image without the need for accurate 3D data for training. In terms of increasing speed, we use a lightweight network as a facial feature extractor. As a result, our method reduces the reliance on graphics processing units, allowing fast inference on central processing units alone. To maintain robustness, we combine an attention mechanism and a graph convolutional network in parameter regression to concentrate on facial details. We experiment with different combinations of three loss functions to obtain the best results. In comparative experiments, we evaluate the performance of the proposed method and state-of-the-art methods on 3D face reconstruction and sparse face alignment, respectively. Experiments on a variety of datasets validate the effectiveness of our method.
引用
收藏
页码:5547 / 5561
页数:15
相关论文
共 47 条
[1]   Inverse Rendering of Faces with a 3D Morphable Model [J].
Aldrian, Oswald ;
Smith, William A. P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (05) :1080-1093
[2]  
Aldrian Oswald., 2010, P BRIT MACHINE VISIO, p75.1, DOI DOI 10.5244/C.24.75
[3]  
[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.372
[4]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473,1409.0473, DOI 10.48550/ARXIV.1409.0473,1409.0473]
[5]   Deep 3D-LBP: CNN-based fusion of shape modeling and texture descriptors for accurate face recognition [J].
Bahroun, Sahbi ;
Abed, Rahma ;
Zagrouba, Ezzeddine .
VISUAL COMPUTER, 2023, 39 (01) :239-254
[6]   Localizing Parts of Faces Using a Consensus of Exemplars [J].
Belhumeur, Peter N. ;
Jacobs, David W. ;
Kriegman, David J. ;
Kumar, Neeraj .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2930-2940
[7]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[8]   Face recognition based on fitting a 3D morphable model [J].
Blanz, V ;
Vetter, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (09) :1063-1074
[9]   How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks) [J].
Bulat, Adrian ;
Tzimiropoulos, Georgios .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1021-1030
[10]   FaceWarehouse: A 3D Facial Expression Database for Visual Computing [J].
Cao, Chen ;
Weng, Yanlin ;
Zhou, Shun ;
Tong, Yiying ;
Zhou, Kun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (03) :413-425