Faster, Better and More Detailed: 3D Face Reconstruction with Graph Convolutional Networks

被引:2
作者
Cheng, Shiyang [1 ]
Tzimiropoulos, Georgios [1 ,3 ]
Shen, Jie [2 ]
Pantic, Maja [2 ]
机构
[1] Samsung AI Ctr, Cambridge, England
[2] Imperial Coll London, London, England
[3] Queen Mary Univ London, London, England
来源
COMPUTER VISION - ACCV 2020, PT V | 2021年 / 12626卷
关键词
D O I
10.1007/978-3-030-69541-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of 3D face reconstruction from a single image. While available solutions for addressing this problem do exist, to our knowledge, we propose the very first approach which is robust, lightweight and detailed i.e. it can reconstruct fine facial details. Our method is extremely simple and consists of 3 key components: (a) a lightweight non-parametric decoder based on Graph Convolutional Networks (GCNs) trained in a supervised manner to reconstruct coarse facial geometry from image-based ResNet features. (b) An extremely lightweight (35K parameters) subnetwork - also based on GCNs - which is trained in an unsupervised manner to refine the output of the first network. (c) A novel feature-sampling mechanism and adaptation layer which injects fine details from the ResNet features of the first network into the second one. Overall, our method is the first one (to our knowledge) to reconstruct detailed facial geometry relying solely on GCNs. We exhaustively compare our method with 7 state-of-the-art methods on 3 datasets reporting state-of-the-art results for all of our experiments, both qualitatively and quantitatively, with our approach being, at the same time, significantly faster.
引用
收藏
页码:188 / 205
页数:18
相关论文
共 57 条
[1]  
Abadi Martin, 2016, Proceedings of OSDI '16: 12th USENIX Symposium on Operating Systems Design and Implementation. OSDI '16, P265
[2]   Cross-modal Deep Face Normals with Deactivable Skip Connections [J].
Abrevaya, Victoria Fernandez ;
Boukhayma, Adnane ;
Torr, Philip H. S. ;
Boyer, Edmond .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4978-4988
[3]   Extreme 3D Face Reconstruction: Seeing Through Occlusions [J].
Anh Tuan Tran ;
Hassner, Tal ;
Masi, Iacopo ;
Paz, Eran ;
Nirkin, Yuval ;
Medioni, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3935-3944
[4]  
Bagdanov A.D., 2011, P ACM MULT INT WORKS
[5]  
Bai S, 2020, Arxiv, DOI arXiv:1901.08150
[6]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[7]   Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation [J].
Bouritsas, Giorgos ;
Bokhnyak, Sergiy ;
Ploumpis, Stylianos ;
Bronstein, Michael ;
Zafeiriou, Stefanos .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7212-7221
[8]   Photo-Realistic Facial Details Synthesis From Single Image [J].
Chen, Anpei ;
Chen, Zhang ;
Zhang, Guli ;
Mitchell, Kenny ;
Yu, Jingyi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9428-9438
[9]  
Cheng SY, 2019, Arxiv, DOI arXiv:1903.10384
[10]   4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications [J].
Cheng, Shiyang ;
Kotsia, Irene ;
Pantic, Maja ;
Zafeiriou, Stefanos .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5117-5126