Semi-Supervised Monocular 3D Face Reconstruction With End-to-End Shape-Preserved Domain Transfer

被引:24
作者
Piao, Jingtan [1 ]
Qian, Chen
Li, Hongsheng
机构
[1] Chinese Univ Hong Kong, CUHK SenseTime Joint Lab, Hong Kong, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00949
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular face reconstruction is a challenging task in computer vision, which aims to recover 3D face geometry from a single RGB face image. Recently, deep learning methods have achieved great improvements on monocular face reconstruction. However, for these methods to reach optimal performance, it is paramount to have large-scale training images with ground-truth 3D face geometry, which is generally difficult for human to annotate. To tackle this problem, we propose a semi-supervised monocular reconstruction method, which jointly optimizes a shape-preserved domain-transfer CycleGAN and a shape estimation network. The framework is semi-supervisely trained with 3D rendered images with ground-truth shapes and in-the-wild face images without any extra annotation. The CycleGAN network transforms all realistic images into rendered style and is end-to-end trained in the overall framework. This is the key difference compared with existing CycleGAN-based learning methods, which just used CycleGAN as a separate training sample generator. Novel landmark consistency loss and edge-aware shape estimation loss are proposed for our two networks to jointly solve the challenging face reconstruction problem. Experiments on public face reconstruction datasets demonstrate the effectiveness of our overall method as well as the individual components.
引用
收藏
页码:9397 / 9406
页数:10
相关论文
共 27 条
[1]  
[Anonymous], 2011, P IEEE INT C COMPUTE, DOI DOI 10.1109/ICCVW.2011.6130513
[2]  
[Anonymous], 2017, ICCV
[3]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2017, P ICCV
[5]  
Bagdanov A. D., 2011, JOINT ACM WORKSH HUM, P79
[6]  
Bhagavatula Chandrasekhar, 2017, P IEEE INT C COMP VI, P3980
[7]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[8]   Reanimating faces in images and video [J].
Blanz, V ;
Basso, C ;
Poggio, T ;
Vetter, T .
COMPUTER GRAPHICS FORUM, 2003, 22 (03) :641-650
[9]  
Blinn J. F., 1977, ACM SIGGRAPH COMPUTE, V11, P192, DOI DOI 10.1145/965141.563893
[10]   Displaced Dynamic Expression Regression for Real-time Facial Tracking and Animation [J].
Cao, Chen ;
Hou, Qiming ;
Zhou, Kun .
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (04)