Single-Image 3D Human Pose and Shape Estimation Enhanced by Clothed 3D Human Reconstruction

被引：0

作者：

Liu, Leyuan ^{[1
,2
]}

Gao, Yunqi ^{[1
]}

Sun, Jianchi ^{[1
]}

Chen, Jingying ^{[1
,2
]}

机构：

[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan, Peoples R China

[2] Cent China Normal Univ, Natl Engn Lab Educ Big Data, Wuhan, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023 | 2024年 / 1998卷

基金：

中国国家自然科学基金;

关键词：

3D Human Pose and Shape Estimation; Clothed 3D Human Reconstruction; Graph Convolutional Network; SMPL Parameter Regression;

D O I：

10.1007/978-981-99-9109-9_4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D human pose and shape estimation and clothed 3D human reconstruction are two hot topics in the community of computer vision. 3D human pose and shape estimation aims to estimate the 3D poses and body shapes of "naked" humans under clothes, while clothed 3D human reconstruction refers to reconstructing the surfaces of humans wearing clothes. These two topics are closely related, but researchers usually study them separately. In this paper, we enhance the accuracy of the 3D human pose and body shape estimation by the reconstructed clothed 3D human models. Our method consists of two main components: the 3D body mesh recovery module and the clothed 3D human reconstruction module. In the 3D body mesh recovery module, an intermediate 3D body mesh is first recovered from the input image by a graph convolutional network (GCN), and then the 3D body pose and shape parameters are estimated by a regressor. In the clothed human reconstruction module, two clothed human surface models are respectively reconstructed under the guidance of the recovered 3D body mesh and the ground-truth 3D body mesh. At the training phase, losses which are described by the residuals among the two reconstructed clothed human models and ground truth are passed back into the 3D body mesh recovery module and used for boosting the body mesh recovery module. The quantitative and qualitative experimental results on THuman2.0, and LSP show that our method outperforms the current state-of-the-art 3D human pose and shape estimation methods.

引用

页码：33 / 44

页数：12

共 28 条

[1] Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image
Bogo, Federica
Kanazawa, Angjoo
Lassner, Christoph
Gehler, Peter
Romero, Javier
Black, Michael J.
[J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 561 - 578
[2] Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes
Choi, Hongsuk
Moon, Gyeongsik
Park, JoonKyu
Lee, Kyoung Mu
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1465 - 1474
[3] Choi Hongsuk, 2020, EUR C COMP VIS, V16, P769, DOI DOI 10.1007/978-3-030-58571-6_45
[4] Identity Mappings in Deep Residual Networks
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 630 - 645
[5] ARCH plus plus : Animation-Ready Clothed Human Reconstruction Revisited
He, Tong
Xu, Yuanlu
Saito, Shunsuke
Soatto, Stefano
Tung, Tony
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11026 - 11036
[6] Towards Accurate Marker-less Human Shape and Pose Estimation over Time
Huang, Yinghao
Bogo, Federica
Lassner, Christoph
Kanazawa, Angjoo
Gehler, Peter, V
Romero, Javier
Akhter, Ijaz
Black, Michael J.
[J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 421 - 430
[7] Johnson Sam, 2010, BRIT MACH VIS C, V2, P5
[8] End-to-end Recovery of Human Shape and Pose
Kanazawa, Angjoo
Black, Michael J.
Jacobs, David W.
Malik, Jitendra
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7122 - 7131
[9] Kingma D.P., 2014, arXiv, DOI [DOI 10.48550/ARXIV.1412.6980, 10.48550/arXiv.1412.6980]
[10] Kipf T. N., 2017, ICLR POSTER, DOI DOI 10.48550/ARXIV.1609.02907

← 1 2 3 →