Detailed 3D human body reconstruction from multi-view images combining voxel super-resolution and learned implicit representation

被引:15
作者
Li, Zhongguo
Oskarsson, Magnus
Heyden, Anders
机构
[1] Lund, Sweden
关键词
Detailed 3D human body; Implicit representation; Multi-scale features; Multi-view images; Voxel super-resolution; SHAPE;
D O I
10.1007/s10489-021-02783-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of reconstructing detailed 3D human body models from images is interesting but challenging in computer vision due to the high freedom of human bodies. This work proposes a coarse-to-fine method to reconstruct detailed 3D human body from multi-view images combining Voxel Super-Resolution (VSR) based on learning the implicit representation. Firstly, the coarse 3D models are estimated by learning an Pixel-aligned Implicit Function based on Multi-scale Features (MF-PIFu) which are extracted by multi-stage hourglass networks from the multi-view images. Then, taking the low resolution voxel grids which are generated by the coarse 3D models as input, the VSR is implemented by learning an implicit function through a multi-stage 3D convolutional neural network. Finally, the refined detailed 3D human body models can be produced by VSR which can preserve the details and reduce the false reconstruction of the coarse 3D models. Benefiting from the implicit representation, the training process in our method is memory efficient and the detailed 3D human body produced by our method from multi-view images is the continuous decision boundary with high-resolution geometry. In addition, the coarse-to-fine method based on MF-PIFu and VSR can remove false reconstructions and preserve the appearance details in the final reconstruction, simultaneously. In the experiments, our method quantitatively and qualitatively achieves the competitive 3D human body models from images with various poses and shapes on both the real and synthetic datasets.
引用
收藏
页码:6739 / 6759
页数:21
相关论文
共 63 条
[1]   Tex2Shape: Detailed Full Human Body Geometry From a Single Image [J].
Alldieck, Thiemo ;
Pons-Moll, Gerard ;
Theobalt, Christian ;
Magnor, Marcus .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2293-2303
[2]   Video Based Reconstruction of 3D People Models [J].
Alldieck, Thiemo ;
Magnor, Marcus ;
Xu, Weipeng ;
Theobalt, Christian ;
Pons-Moll, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8387-8397
[3]   Detailed Human Avatars from Monocular Video [J].
Alldieck, Thiemo ;
Magnor, Marcus ;
Xu, Weipeng ;
Theobalt, Christian ;
Pons-Moll, Gerard .
2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, :98-109
[4]   SCAPE: Shape Completion and Animation of People [J].
Anguelov, D ;
Srinivasan, P ;
Koller, D ;
Thrun, S ;
Rodgers, J ;
Davis, J .
ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03) :408-416
[5]  
Balan A.O., 2007, P IEEE C COMP VIS PA, P1, DOI DOI 10.1109/CVPR.2007.383340
[6]   Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image [J].
Bogo, Federica ;
Kanazawa, Angjoo ;
Lassner, Christoph ;
Gehler, Peter ;
Romero, Javier ;
Black, Michael J. .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :561-578
[7]   Detailed Full-Body Reconstructions of Moving People from Monocular RGB-D Sequences [J].
Bogo, Federica ;
Black, Michael J. ;
Loper, Matthew ;
Romero, Javier .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2300-2308
[8]   OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields [J].
Cao, Zhe ;
Hidalgo, Gines ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) :172-186
[9]   Learning Implicit Fields for Generative Shape Modeling [J].
Chen, Zhiqin ;
Zhang, Hao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5932-5941
[10]   Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion [J].
Chibane, Julian ;
Alldieck, Thiemo ;
Pons-Moll, Gerard .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6968-6979