An Embeddable Implicit IUVD Representation for Part-Based 3D Human Surface Reconstruction

被引：1

作者：

Li, Baoxing ^{[1
]}

Deng, Yong ^{[1
]}

Yang, Yehui ^{[1
]}

Zhao, Xu ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

关键词：

Surface reconstruction; Image reconstruction; Surface treatment; Three-dimensional displays; Shape; Clothing; Redundancy; 3D human surface reconstruction; implicit representation; UV map; human body prior; acceleration;

D O I：

10.1109/TIP.2024.3430073

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To reconstruct a 3D human surface from a single image, it is crucial to simultaneously consider human pose, shape, and clothing details. Recent approaches have combined parametric body models (such as SMPL), which capture body pose and shape priors, with neural implicit functions that flexibly learn clothing details. However, this combined representation introduces additional computation, e.g. signed distance calculation in 3D body feature extraction, leading to redundancy in the implicit query-and-infer process and failing to preserve the underlying body shape prior. To address these issues, we propose a novel IUVD-Feedback representation, consisting of an IUVD occupancy function and a feedback query algorithm. This representation replaces the time-consuming signed distance calculation with a simple linear transformation in the IUVD space, leveraging the SMPL UV maps. Additionally, it reduces redundant query points through a feedback mechanism, leading to more reasonable 3D body features and more effective query points, thereby preserving the parametric body prior. Moreover, the IUVD-Feedback representation can be embedded into any existing implicit human reconstruction pipeline without requiring modifications to the trained neural networks. Experiments on the THuman2.0 dataset demonstrate that the proposed IUVD-Feedback representation improves the robustness of results and achieves three times faster acceleration in the query-and-infer process. Furthermore, this representation holds potential for generative applications by leveraging its inherent semantic information from the parametric body model.

引用

页码：4334 / 4347

页数：14

共 70 条

[1] Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing
Alldieck, Thiemo
Zanfir, Mihai
Sminchisescu, Cristian
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1496 - 1505
[2] imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose
Alldieck, Thiemo
Xu, Hongyi
Sminchisescu, Cristian
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5441 - 5450
[3] Tex2Shape: Detailed Full Human Body Geometry From a Single Image
Alldieck, Thiemo
Pons-Moll, Gerard
Theobalt, Christian
Magnor, Marcus
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2293 - 2303
[4] Video Based Reconstruction of 3D People Models
Alldieck, Thiemo
Magnor, Marcus
Xu, Weipeng
Theobalt, Christian
Pons-Moll, Gerard
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8387 - 8397
[5] Detailed Human Avatars from Monocular Video
Alldieck, Thiemo
Magnor, Marcus
Xu, Weipeng
Theobalt, Christian
Pons-Moll, Gerard
[J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 98 - 109
[6] SCAPE: Shape Completion and Animation of People
Anguelov, D
Srinivasan, P
Koller, D
Thrun, S
Rodgers, J
Davis, J
[J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 408 - 416
[7] Multi-Garment Net: Learning to Dress 3D People from Images
Bhatnagar, Bharat Lal
Tiwari, Garvita
Theobalt, Christian
Pons-Moll, Gerard
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5419 - 5429
[8] BLINN JF, 1976, COMMUN ACM, V19, P542, DOI 10.1145/965143.563322
[9] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
Bogo, Federica
Kanazawa, Angjoo
Lassner, Christoph
Gehler, Peter
Romero, Javier
Black, Michael J.
[J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
[10] Chan K., 2022, P ADV NEUR INF PROC, P17373

← 1 2 3 4 5 6 7 →