An Embeddable Implicit IUVD Representation for Part-Based 3D Human Surface Reconstruction

被引:1
作者
Li, Baoxing [1 ]
Deng, Yong [1 ]
Yang, Yehui [1 ]
Zhao, Xu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
Surface reconstruction; Image reconstruction; Surface treatment; Three-dimensional displays; Shape; Clothing; Redundancy; 3D human surface reconstruction; implicit representation; UV map; human body prior; acceleration;
D O I
10.1109/TIP.2024.3430073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To reconstruct a 3D human surface from a single image, it is crucial to simultaneously consider human pose, shape, and clothing details. Recent approaches have combined parametric body models (such as SMPL), which capture body pose and shape priors, with neural implicit functions that flexibly learn clothing details. However, this combined representation introduces additional computation, e.g. signed distance calculation in 3D body feature extraction, leading to redundancy in the implicit query-and-infer process and failing to preserve the underlying body shape prior. To address these issues, we propose a novel IUVD-Feedback representation, consisting of an IUVD occupancy function and a feedback query algorithm. This representation replaces the time-consuming signed distance calculation with a simple linear transformation in the IUVD space, leveraging the SMPL UV maps. Additionally, it reduces redundant query points through a feedback mechanism, leading to more reasonable 3D body features and more effective query points, thereby preserving the parametric body prior. Moreover, the IUVD-Feedback representation can be embedded into any existing implicit human reconstruction pipeline without requiring modifications to the trained neural networks. Experiments on the THuman2.0 dataset demonstrate that the proposed IUVD-Feedback representation improves the robustness of results and achieves three times faster acceleration in the query-and-infer process. Furthermore, this representation holds potential for generative applications by leveraging its inherent semantic information from the parametric body model.
引用
收藏
页码:4334 / 4347
页数:14
相关论文
共 70 条
  • [1] Photorealistic Monocular 3D Reconstruction of Humans Wearing Clothing
    Alldieck, Thiemo
    Zanfir, Mihai
    Sminchisescu, Cristian
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1496 - 1505
  • [2] imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose
    Alldieck, Thiemo
    Xu, Hongyi
    Sminchisescu, Cristian
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5441 - 5450
  • [3] Tex2Shape: Detailed Full Human Body Geometry From a Single Image
    Alldieck, Thiemo
    Pons-Moll, Gerard
    Theobalt, Christian
    Magnor, Marcus
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2293 - 2303
  • [4] Video Based Reconstruction of 3D People Models
    Alldieck, Thiemo
    Magnor, Marcus
    Xu, Weipeng
    Theobalt, Christian
    Pons-Moll, Gerard
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8387 - 8397
  • [5] Detailed Human Avatars from Monocular Video
    Alldieck, Thiemo
    Magnor, Marcus
    Xu, Weipeng
    Theobalt, Christian
    Pons-Moll, Gerard
    [J]. 2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 98 - 109
  • [6] SCAPE: Shape Completion and Animation of People
    Anguelov, D
    Srinivasan, P
    Koller, D
    Thrun, S
    Rodgers, J
    Davis, J
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 408 - 416
  • [7] Multi-Garment Net: Learning to Dress 3D People from Images
    Bhatnagar, Bharat Lal
    Tiwari, Garvita
    Theobalt, Christian
    Pons-Moll, Gerard
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5419 - 5429
  • [8] BLINN JF, 1976, COMMUN ACM, V19, P542, DOI 10.1145/965143.563322
  • [9] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
    Bogo, Federica
    Kanazawa, Angjoo
    Lassner, Christoph
    Gehler, Peter
    Romero, Javier
    Black, Michael J.
    [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
  • [10] Chan K., 2022, P ADV NEUR INF PROC, P17373