Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency

被引:88
作者
Shang, Jiaxiang [1 ]
Shen, Tianwei [1 ]
Li, Shiwei [1 ]
Zhou, Lei [1 ]
Zhen, Mingmin [1 ]
Fang, Tian [2 ]
Quan, Long [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
[2] Everest Innovat Technol, Kowloon, Hong Kong, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XV | 2020年 / 12360卷
关键词
3D face reconstruction; Multi-view geometry consistency; MORPHABLE MODEL; IMAGE; EDGES;
D O I
10.1007/978-3-030-58555-6_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent learning-based approaches, in which models are trained by single-view images have shown promising results for monocular 3D face reconstruction, but they suffer from the ill-posed face pose and depth ambiguity issue. In contrast to previous works that only enforce 2D feature constraints, we propose a self-supervised training architecture by leveraging the multi-view geometry consistency, which provides reliable constraints on face pose and depth estimation. We first propose an occlusion-aware view synthesis method to apply multi-view geometry consistency to self-supervised learning. Then we design three novel loss functions for multi-view consistency, including the pixel consistency loss, the depth consistency loss, and the facial landmark-based epipolar loss. Our method is accurate and robust, especially under large variations of expressions, poses, and illumination conditions. Comprehensive experiments on the face alignment and 3D face reconstruction benchmarks have demonstrated superiority over state-of-the-art methods. Our code and model are released in https://github.com/ jiaxiangshang/MGCNet.
引用
收藏
页码:53 / 70
页数:18
相关论文
共 67 条
[1]  
Abadi Martin, 2016, Proceedings of OSDI '16: 12th USENIX Symposium on Operating Systems Design and Implementation. OSDI '16, P265
[2]   Inverse Rendering of Faces with a 3D Morphable Model [J].
Aldrian, Oswald ;
Smith, William A. P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (05) :1080-1093
[3]   Extreme 3D Face Reconstruction: Seeing Through Occlusions [J].
Anh Tuan Tran ;
Hassner, Tal ;
Masi, Iacopo ;
Paz, Eran ;
Nirkin, Yuval ;
Medioni, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3935-3944
[4]  
[Anonymous], 2011, Visualsfm: A visual structure from motion system
[5]  
[Anonymous], 2003, Multiple View Geometry in Computer Vision
[6]  
Bagdanov A.D., 2011, P 2011 JOINT ACM WOR, P79, DOI 10.1145/2072572.2072597
[7]   Fitting a 3D Morphable Model to Edges: A Comparison Between Hard and Soft Correspondences [J].
Bas, Anil ;
Smith, William A. P. ;
Bolkart, Timo ;
Wuhrer, Stefanie .
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II, 2017, 10117 :377-391
[8]  
BERGEN JR, 1992, LECT NOTES COMPUT SC, V588, P237
[9]   Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses [J].
Bhagavatula, Chandrasekhar ;
Zhu, Chenchen ;
Luu, Khoa ;
Savvides, Marios .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4000-4009
[10]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194