CNN-Based Denoising, Completion, and Prediction of Whole-Body Human-Depth Images

被引:3
作者
Jang, Jae Won [1 ]
Kwon, Young Chan [1 ]
Lim, Hwasup [2 ]
Choi, Ouk [1 ]
机构
[1] Incheon Natl Univ, Dept Elect Engn, Incheon 22012, South Korea
[2] Korea Inst Sci & Technol, Ctr Imaging Media Res, Seoul 02792, South Korea
关键词
3D human shape; convolutional neural networks; deep learning; single depth image; synthetic data generation; HUMAN BODIES; RECONSTRUCTION; SHAPE;
D O I
10.1109/ACCESS.2019.2957862
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional human shape reconstruction is important in many applications, such as virtual or augmented reality (VR/AR), virtual clothing fitting, and healthcare. In this paper, we propose a learning-based method for reconstructing a whole-body point cloud from a single front-view human-depth image. Because actual depth images typically suffer from noise and missing data, an accurate point cloud cannot be reasonably obtained by simply predicting a back-view depth image. To solve this problem, we propose to use convolutional neural networks that not only predict a back-view depth image but also refine the input front-view depth image. To train the networks, we propose a carefully designed method for generating synthetic but realistic human-depth images with noise and missing data. Experiments show that the proposed method is effective for obtaining seamless whole-body point clouds. In addition, the experiments show that the networks trained on the synthetic depth images are ready for application to actual depth images.
引用
收藏
页码:175842 / 175856
页数:15
相关论文
共 78 条
[41]   Multilinear Pose and Body Shape Estimation of Dressed Subjects from Image Sets [J].
Hasler, Nils ;
Ackermann, Hanno ;
Rosenhahn, Bodo ;
Thormaehlen, Thorsten ;
Seidel, Hans-Peter .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1823-1830
[42]   MovieReshape: Tracking and Reshaping of Humans in Videos [J].
Jain, Arjun ;
Thormaehlen, Thorsten ;
Seidel, Hans-Peter ;
Theobalt, Christian .
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (06)
[43]   Robust alternating optimisation for extrinsic calibration of RGB-D cameras [J].
Jang, J. W. ;
Kwon, Y. C. ;
Hwang, W. ;
Choi, O. .
ELECTRONICS LETTERS, 2019, 55 (18) :992-994
[44]   End-to-end Recovery of Human Shape and Pose [J].
Kanazawa, Angjoo ;
Black, Michael J. ;
Jacobs, David W. ;
Malik, Jitendra .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7122-7131
[45]   Instance-Level Human Parsing via Part Grouping Network [J].
Gong, Ke ;
Liang, Xiaodan ;
Li, Yicheng ;
Chen, Yimin ;
Yang, Ming ;
Lin, Liang .
COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 :805-822
[46]   Multi-Cue-Based Circle Detection and Its Application to Robust Extrinsic Calibration of RGB-D Cameras [J].
Kwon, Young Chan ;
Jang, Jae Won ;
Hwang, Youngbae ;
Choi, Ouk .
SENSORS, 2019, 19 (07)
[47]   ArticulatedFusion: Real-Time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth Camera [J].
Li, Chao ;
Zhao, Zheheng ;
Guo, Xiaohu .
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 :324-340
[48]   SMPL: A Skinned Multi-Person Linear Model [J].
Loper, Matthew ;
Mahmood, Naureen ;
Romero, Javier ;
Pons-Moll, Gerard ;
Black, Michael J. .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (06)
[49]   Deep Learning Whole Body Point Cloud Scans from a Single Depth Map [J].
Lunscher, Nolan ;
Zelek, John .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :1208-1215
[50]   Point Cloud Completion of Foot Shape from a Single Depth Map for Fit Matching using Deep Learning View Synthesis [J].
Lunscher, Nolan ;
Zelek, John .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2300-2305