3D human pose and shape estimation with dense correspondence from a single depth image

被引:6
|
作者
Wang, Kangkan [1 ,2 ]
Zhang, Guofeng [3 ]
Yang, Jian [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Key Lab Intelligent Percept & Syst High Dimens In, Minist Educ,PCA Lab, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing 210094, Peoples R China
[3] Zhejiang Univ, State Key Lab CAD&CG, Zijingang Campus, Hangzhou 310058, Peoples R China
关键词
3D human pose and shape; Dense correspondence; 3D model fitting; Depth image; Deep learning;
D O I
10.1007/s00371-021-02339-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We propose a novel approach to estimate the 3D pose and shape of human bodies with dense correspondence from a single depth image. In contrast to most current 3D body model recovery methods from depth images that employ motion information of depth sequences to compute point correspondences, we reconstruct 3D human body models from a single depth image by combining the correspondence learning and the parametric model fitting. Specifically, a novel multi-view coarse-to-fine correspondence network is proposed by projecting a 3D template model into multi-view depth images. The proposed correspondence network can predict 2D flows of the input depth relative to each projected depth in a coarse-to-fine manner. The predicted multi-view flows are then aggregated to establish accurate dense point correspondences between the 3D template and the input depth with the known 3D-to-2D projection. Based on the learnt correspondences, the 3D human pose and shape represented by a parametric 3D body model are recovered through a model fitting method that incorporates an adversarial prior. We conduct extensive experiments on SURREAL, Human3.6M, DFAUST, and real depth data of human bodies. The experimental results demonstrate that the proposed method outperforms the state-of-the-art methods in terms of reconstruction accuracy.
引用
收藏
页码:429 / 441
页数:13
相关论文
共 50 条
  • [21] Dense 3D Face Correspondence
    Gilani, Syed Zulqarnain
    Mian, Ajmal
    Shafait, Faisal
    Reid, Ian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (07) : 1584 - 1598
  • [22] Hierarchically constrained 3D hand pose estimation using regression forests from single frame depth data
    Kirac, Furkan
    Kara, Yunus Emre
    Akarun, Lale
    PATTERN RECOGNITION LETTERS, 2014, 50 : 91 - 100
  • [23] Fusing information from multiple 2D depth cameras for 3D human pose estimation in the operating room
    Lasse Hansen
    Marlin Siebert
    Jasper Diesel
    Mattias P. Heinrich
    International Journal of Computer Assisted Radiology and Surgery, 2019, 14 : 1871 - 1879
  • [24] Fusing information from multiple 2D depth cameras for 3D human pose estimation in the operating room
    Hansen, Lasse
    Siebert, Marlin
    Diesel, Jasper
    Heinrich, Mattias P.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2019, 14 (11) : 1871 - 1879
  • [25] 3D human modeling from a single depth image dealing with self-occlusion
    Jang, In Yeop
    Cho, Ji-Ho
    Lee, Kwan H.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (01) : 267 - 288
  • [26] 3D human modeling from a single depth image dealing with self-occlusion
    In Yeop Jang
    Ji-Ho Cho
    Kwan H. Lee
    Multimedia Tools and Applications, 2012, 58 : 267 - 288
  • [27] 3D terrain estimation from a single landscape image
    Takahashi, Haruka
    Kanamori, Yoshihiro
    Endo, Yuki
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (06)
  • [28] Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation
    Liu, Zhiwei
    Zhu, Xiangyu
    Yang, Lu
    Yan, Xiang
    Tang, Ming
    Lei, Zhen
    Zhu, Guibo
    Feng, Xuetao
    Wang, Yan
    Wang, Jinqiao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1976 - 1984
  • [29] Deep neural networks for human pose estimation from a very low resolution depth image
    Piotr Szczuko
    Multimedia Tools and Applications, 2019, 78 : 29357 - 29377