3D Human Pose Estimation from RGB-D Images Using Deep Learning Method

被引：4

作者：

Chun, Junchul ^{[1
]}

Park, Seohee ^{[1
]}

Ji, Myunggeun ^{[1
]}

机构：

[1] Kyonggi Univ, CS Dept, 154-42 Gwanggyosan Ro, Suwon, South Korea

来源：

2018 INTERNATIONAL CONFERENCE ON SENSORS, SIGNAL AND IMAGE PROCESSING (SSIP 2018) | 2018年

基金：

新加坡国家研究基金会;

关键词：

3D Human Pose Estimation; RGB; D Model; 2D Joint Detection; CNN (Convolution Neural Network); DNN (Deep Neural Network); NEURAL-NETWORKS;

D O I：

10.1145/3290589.3290591

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In human activity recognition system, detecting the human and estimating the pose of 2D or 3D human correctly is critical issue. In this paper, we propose a novel approach for 3D human pose estimation from RGB-D CCTV images based on a deep learning approach. By using the RGB-D model rather than the conventional RGB image which has a limitation in detecting object due to the lack of topological information, we can resolve the self-occlusion problem and improve the object detection ratio from efficiently. Subsequently, the position of a human joint is localized with a Convolutional Neural Network (CNN) from the detected person. In this phase, we utilize CPM (Convolutional Pose Machine), to generate belief maps to predict the positions of key-point representing human body parts and estimate 2D human pose by detected key-points. In final stage, we estimate 3D human pose from the 2D joint information based on Deep Neural Network (DNN). From the experiment, we prove that the proposed method detects human objects robustly in occlusion and the estimated 3D human pose are very accurate comparing the previously introduced methods. As for the future work, the estimated 3D human pose will be used for human activity recognition.

引用

页码：51 / 55

页数：5

共 18 条

[1] [Anonymous], 2015, ARXIV PREPRINT ARXIV
[2] [Anonymous], 2017, ARXIV170101779
[3] [Anonymous], 2016, ARXIV161201465
[4] [Anonymous], J HEALTHCARE ENG
[5] [Anonymous], 2017, ARXIV170100295
[6] Chen Chun-Jung, 2017, Collection and Research (Taichung), P1, DOI 10.6693/CAR201712_30(1).0001
[7] 박서희, 2017, [Journal of Internet Computing and Services, 인터넷정보학회논문지], V18, P61, DOI 10.7472/jksii.2017.18.4.61
[8] Marker-Less 3D Human Motion Capture with Monocular Image Sequence and Height-Maps
Du, Yu
Wong, Yongkang
Liu, Yonghao
Han, Feilin
Gui, Yilin
Wang, Zhen
Kankanhalli, Mohan
Geng, Weidong
[J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 20 - 36
[9] Crowd Scene Understanding from Video: A Survey
Grant, Jason M.
Flynn, Patrick J.
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2017, 13 (02)
[10] Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
Ionescu, Catalin
Papava, Dragos
Olaru, Vlad
Sminchisescu, Cristian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) : 1325 - 1339

← 1 2 →