Reconstructing 3D Human Pose from RGB-D Data with Occlusions

被引:1
作者
Dang, Bowen [1 ]
Zhao, Xi [1 ]
Zhang, Bowen [1 ]
Wang, He [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] UCL, London, England
基金
中国国家自然科学基金;
关键词
CCS Concepts; center dot Computing methodologies -> Shape modeling; Reconstruction;
D O I
10.1111/cgf.14982
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We propose a new method to reconstruct the 3D human body from RGB-D images with occlusions. The foremost challenge is the incompleteness of the RGB-D data due to occlusions between the body and the environment, leading to implausible reconstructions that suffer from severe human-scene penetration. To reconstruct a semantically and physically plausible human body, we propose to reduce the solution space based on scene information and prior knowledge. Our key idea is to constrain the solution space of the human body by considering the occluded body parts and visible body parts separately: modeling all plausible poses where the occluded body parts do not penetrate the scene, and constraining the visible body parts using depth data. Specifically, the first component is realized by a neural network that estimates the candidate region named the "free zone", a region carved out of the open space within which it is safe to search for poses of the invisible body parts without concern for penetration. The second component constrains the visible body parts using the "truncated shadow volume" of the scanned body point cloud. Furthermore, we propose to use a volume matching strategy, which yields better performance than surface matching, to match the human body with the confined region. We conducted experiments on the PROX dataset, and the results demonstrate that our method produces more accurate and plausible results compared with other methods.
引用
收藏
页数:13
相关论文
共 34 条
[11]   End-to-end Recovery of Human Shape and Pose [J].
Kanazawa, Angjoo ;
Black, Michael J. ;
Jacobs, David W. ;
Malik, Jitendra .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7122-7131
[12]   Grasping Field: Learning Implicit Representations for Human Grasps [J].
Karunratanakul, Korrawe ;
Yang, Jinlong ;
Zhang, Yan ;
Black, Michael J. ;
Muandet, Krikamol ;
Tang, Siyu .
2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, :333-344
[13]  
Kingma DP, 2014, ADV NEUR IN, V27
[14]   Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop [J].
Kolotouros, Nikos ;
Pavlakos, Georgios ;
Black, Michael J. ;
Daniilidis, Kostas .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2252-2261
[15]   MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes [J].
Li, Zhi ;
Shimada, Soshi ;
Schiele, Bernt ;
Theobalt, Christian ;
Golyanik, Vladislav .
2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, :1-11
[16]   End-to-End Human Pose and Mesh Reconstruction with Transformers [J].
Lin, Kevin ;
Wang, Lijuan ;
Liu, Zicheng .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1954-1963
[17]   SMPL: A Skinned Multi-Person Linear Model [J].
Loper, Matthew ;
Mahmood, Naureen ;
Romero, Javier ;
Pons-Moll, Gerard ;
Black, Michael J. .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (06)
[18]   AMASS: Archive of Motion Capture as Surface Shapes [J].
Mahmood, Naureen ;
Ghorbani, Nima ;
Troje, Nikolaus F. ;
Pons-Moll, Gerard ;
Black, Michael J. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5441-5450
[19]   Occupancy Networks: Learning 3D Reconstruction in Function Space [J].
Mescheder, Lars ;
Oechsle, Michael ;
Niemeyer, Michael ;
Nowozin, Sebastian ;
Geiger, Andreas .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4455-4465
[20]  
Park JJ, 2019, PROC CVPR IEEE, P165, DOI [10.18429/jacow-fel2019-tup054, 10.1109/CVPR.2019.00025]