FaceScape: 3D Facial Dataset and Benchmark for Single-View 3D Face Reconstruction

被引:8
作者
Zhu, Hao [1 ]
Yang, Haotian [1 ]
Guo, Longwei [1 ]
Zhang, Yidi [1 ]
Wang, Yanru [1 ]
Huang, Mingkai [1 ]
Wu, Menghua [1 ]
Shen, Qiu [1 ]
Yang, Ruigang [2 ]
Cao, Xun [1 ]
机构
[1] Nanjing Univ, Nanjing 210093, Peoples R China
[2] Univ Kentucky, Lexington, KY 40506 USA
基金
中国国家自然科学基金;
关键词
3D face reconstruction; 3D morphable model; benchmark; dataset; DATABASE; IMAGE; GEOMETRY; ROBUST;
D O I
10.1109/TPAMI.2023.3307338
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inthis article, we present a large-scale detailed 3Dface dataset, FaceScape, and the corresponding benchmark to evaluate single-view facial 3D reconstruction. By training on FaceScape data, a novel algorithm is proposed to predict elaborate riggable 3D face models from a single image input. FaceScape dataset releases 16,940 textured 3D faces, captured from847 subjects and each with 20 specific expressions. The 3Dmodels contain the pore-level facial geometry that is also processed to be topologically uniform. These fine 3D facial models can be represented as a 3D morphable model for coarse shapes and displacement maps for detailed geometry. Taking advantage of the large-scale and high-accuracy dataset, a novel algorithm is further proposed to learn the expressionspecific dynamic details using a deep neural network. The learned relationship serves as the foundation of our 3D face prediction system from a single image input. Different from most previous methods, our predicted 3D models are riggable with highly detailed geometry under different expressions. We also use FaceScape data to generate the in-the-wild and in-the-lab benchmark to evaluate recent methods of single-view face reconstruction. The accuracy is reported and analyzed on the dimensions of camera pose and focal length, which provides a faithful and comprehensive evaluation and reveals new challenges. The unprecedented dataset, benchmark, and code have been released to the public for research purpose.
引用
收藏
页码:14528 / 14545
页数:18
相关论文
共 112 条
[1]   Multilinear Autoencoder for 3D Face Model Learning [J].
Abrevaya, Victoria Fernandez ;
Wuhrer, Stefanie ;
Boyer, Edmond .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :1-9
[2]   The Digital Emily Project: Achieving a Photorealistic Digital Actor [J].
Alexander, Oleg ;
Rogers, Mike ;
Lambeth, William ;
Chiang, Jen-Yuan ;
Ma, Wan-Chun ;
Wang, Chuan-Chang ;
Debevec, Paul .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2010, 30 (04) :20-31
[3]  
Amberg B, 2007, IEEE I CONF COMP VIS, P1326
[4]   Extreme 3D Face Reconstruction: Seeing Through Occlusions [J].
Anh Tuan Tran ;
Hassner, Tal ;
Masi, Iacopo ;
Paz, Eran ;
Nirkin, Yuval ;
Medioni, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3935-3944
[5]  
[Anonymous], 2006, Face Processing
[6]   LEAST-SQUARES FITTING OF 2 3-D POINT SETS [J].
ARUN, KS ;
HUANG, TS ;
BLOSTEIN, SD .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (05) :699-700
[7]   Modeling Facial Geometry using Compositional VAEs [J].
Bagautdinov, Timur ;
Wu, Chenglei ;
Saragih, Jason ;
Fua, Pascal ;
Sheikh, Yaser .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3877-3886
[8]  
Bagdanov AD, 2011, P 2011 JOINT ACM WOR, P79
[9]  
Baocai Yin., 2009, J. Comput. Res. Develop., V6, P020
[10]   High-Quality Single-Shot Capture of Facial Geometry [J].
Beeler, Thabo ;
Bickel, Bernd ;
Beardsley, Paul ;
Sumner, Bob ;
Gross, Markus .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04)