3D human pose and shape estimation with dense correspondence from a single depth image

被引：6

作者：

Wang, Kangkan ^{[1
,2
]}

Zhang, Guofeng ^{[3
]}

Yang, Jian ^{[1
,2
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ,PCA Lab, Nanjing 210094, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing 210094, Peoples R China

[3] Zhejiang Univ, State Key Lab CAD&CG, Zijingang Campus, Hangzhou 310058, Peoples R China

来源：

VISUAL COMPUTER | 2023年 / 39卷 / 01期

关键词：

3D human pose and shape; Dense correspondence; 3D model fitting; Depth image; Deep learning;

D O I：

10.1007/s00371-021-02339-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We propose a novel approach to estimate the 3D pose and shape of human bodies with dense correspondence from a single depth image. In contrast to most current 3D body model recovery methods from depth images that employ motion information of depth sequences to compute point correspondences, we reconstruct 3D human body models from a single depth image by combining the correspondence learning and the parametric model fitting. Specifically, a novel multi-view coarse-to-fine correspondence network is proposed by projecting a 3D template model into multi-view depth images. The proposed correspondence network can predict 2D flows of the input depth relative to each projected depth in a coarse-to-fine manner. The predicted multi-view flows are then aggregated to establish accurate dense point correspondences between the 3D template and the input depth with the known 3D-to-2D projection. Based on the learnt correspondences, the 3D human pose and shape represented by a parametric 3D body model are recovered through a model fitting method that incorporates an adversarial prior. We conduct extensive experiments on SURREAL, Human3.6M, DFAUST, and real depth data of human bodies. The experimental results demonstrate that the proposed method outperforms the state-of-the-art methods in terms of reconstruction accuracy.

引用

页码：429 / 441

页数：13

共 50 条

[21] Dense 3D Face Correspondence
Gilani, Syed Zulqarnain
Mian, Ajmal
Shafait, Faisal
Reid, Ian
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (07) : 1584 - 1598
[22] Hierarchically constrained 3D hand pose estimation using regression forests from single frame depth data
Kirac, Furkan
Kara, Yunus Emre
Akarun, Lale
PATTERN RECOGNITION LETTERS, 2014, 50 : 91 - 100
[23] Fusing information from multiple 2D depth cameras for 3D human pose estimation in the operating room
Lasse Hansen
Marlin Siebert
Jasper Diesel
Mattias P. Heinrich
International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 1871 - 1879
[24] Fusing information from multiple 2D depth cameras for 3D human pose estimation in the operating room
Hansen, Lasse
Siebert, Marlin
Diesel, Jasper
Heinrich, Mattias P.
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (11) : 1871 - 1879
[25] 3D human modeling from a single depth image dealing with self-occlusion
Jang, In Yeop
Cho, Ji-Ho
Lee, Kwan H.
MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (01) : 267 - 288
[26] 3D human modeling from a single depth image dealing with self-occlusion
In Yeop Jang
Ji-Ho Cho
Kwan H. Lee
Multimedia Tools and Applications, 2012, 58 : 267 - 288
[27] 3D terrain estimation from a single landscape image
Takahashi, Haruka
Kanamori, Yoshihiro
Endo, Yuki
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (06)
[28] Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation
Liu, Zhiwei
Zhu, Xiangyu
Yang, Lu
Yan, Xiang
Tang, Ming
Lei, Zhen
Zhu, Guibo
Feng, Xuetao
Wang, Yan
Wang, Jinqiao
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1976 - 1984
[29] Deep neural networks for human pose estimation from a very low resolution depth image
Piotr Szczuko
Multimedia Tools and Applications, 2019, 78 : 29357 - 29377
[30] Deep neural networks for human pose estimation from a very low resolution depth image
Szczuko, Piotr
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 29357 - 29377

← 1 2 3 4 5 →