Single-Image 3D Human Pose and Shape Estimation Enhanced by Clothed 3D Human Reconstruction

被引:0
作者
Liu, Leyuan [1 ,2 ]
Gao, Yunqi [1 ]
Sun, Jianchi [1 ]
Chen, Jingying [1 ,2 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China
[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023 | 2024年 / 1998卷
基金
中国国家自然科学基金;
关键词
3D Human Pose and Shape Estimation; Clothed 3D Human Reconstruction; Graph Convolutional Network; SMPL Parameter Regression;
D O I
10.1007/978-981-99-9109-9_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D human pose and shape estimation and clothed 3D human reconstruction are two hot topics in the community of computer vision. 3D human pose and shape estimation aims to estimate the 3D poses and body shapes of "naked" humans under clothes, while clothed 3D human reconstruction refers to reconstructing the surfaces of humans wearing clothes. These two topics are closely related, but researchers usually study them separately. In this paper, we enhance the accuracy of the 3D human pose and body shape estimation by the reconstructed clothed 3D human models. Our method consists of two main components: the 3D body mesh recovery module and the clothed 3D human reconstruction module. In the 3D body mesh recovery module, an intermediate 3D body mesh is first recovered from the input image by a graph convolutional network (GCN), and then the 3D body pose and shape parameters are estimated by a regressor. In the clothed human reconstruction module, two clothed human surface models are respectively reconstructed under the guidance of the recovered 3D body mesh and the ground-truth 3D body mesh. At the training phase, losses which are described by the residuals among the two reconstructed clothed human models and ground truth are passed back into the 3D body mesh recovery module and used for boosting the body mesh recovery module. The quantitative and qualitative experimental results on THuman2.0, and LSP show that our method outperforms the current state-of-the-art 3D human pose and shape estimation methods.
引用
收藏
页码:33 / 44
页数:12
相关论文
共 28 条
  • [1] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
    Bogo, Federica
    Kanazawa, Angjoo
    Lassner, Christoph
    Gehler, Peter
    Romero, Javier
    Black, Michael J.
    [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
  • [2] Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes
    Choi, Hongsuk
    Moon, Gyeongsik
    Park, JoonKyu
    Lee, Kyoung Mu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1465 - 1474
  • [3] Choi Hongsuk, 2020, EUR C COMP VIS, V16, P769, DOI DOI 10.1007/978-3-030-58571-6_45
  • [4] Identity Mappings in Deep Residual Networks
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645
  • [5] ARCH plus plus : Animation-Ready Clothed Human Reconstruction Revisited
    He, Tong
    Xu, Yuanlu
    Saito, Shunsuke
    Soatto, Stefano
    Tung, Tony
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11026 - 11036
  • [6] Towards Accurate Marker-less Human Shape and Pose Estimation over Time
    Huang, Yinghao
    Bogo, Federica
    Lassner, Christoph
    Kanazawa, Angjoo
    Gehler, Peter, V
    Romero, Javier
    Akhter, Ijaz
    Black, Michael J.
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 421 - 430
  • [7] Johnson Sam, 2010, BRIT MACH VIS C, V2, P5
  • [8] End-to-end Recovery of Human Shape and Pose
    Kanazawa, Angjoo
    Black, Michael J.
    Jacobs, David W.
    Malik, Jitendra
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7122 - 7131
  • [9] Kingma D.P., 2014, arXiv, DOI [DOI 10.48550/ARXIV.1412.6980, 10.48550/arXiv.1412.6980]
  • [10] Kipf T. N., 2017, ICLR POSTER, DOI DOI 10.48550/ARXIV.1609.02907