3D Human Pose Estimation from RGB-D Images Using Deep Learning Method

被引:4
作者
Chun, Junchul [1 ]
Park, Seohee [1 ]
Ji, Myunggeun [1 ]
机构
[1] Kyonggi Univ, CS Dept, 154-42 Gwanggyosan Ro, Suwon, South Korea
来源
2018 INTERNATIONAL CONFERENCE ON SENSORS, SIGNAL AND IMAGE PROCESSING (SSIP 2018) | 2018年
基金
新加坡国家研究基金会;
关键词
3D Human Pose Estimation; RGB; D Model; 2D Joint Detection; CNN (Convolution Neural Network); DNN (Deep Neural Network); NEURAL-NETWORKS;
D O I
10.1145/3290589.3290591
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In human activity recognition system, detecting the human and estimating the pose of 2D or 3D human correctly is critical issue. In this paper, we propose a novel approach for 3D human pose estimation from RGB-D CCTV images based on a deep learning approach. By using the RGB-D model rather than the conventional RGB image which has a limitation in detecting object due to the lack of topological information, we can resolve the self-occlusion problem and improve the object detection ratio from efficiently. Subsequently, the position of a human joint is localized with a Convolutional Neural Network (CNN) from the detected person. In this phase, we utilize CPM (Convolutional Pose Machine), to generate belief maps to predict the positions of key-point representing human body parts and estimate 2D human pose by detected key-points. In final stage, we estimate 3D human pose from the 2D joint information based on Deep Neural Network (DNN). From the experiment, we prove that the proposed method detects human objects robustly in occlusion and the estimated 3D human pose are very accurate comparing the previously introduced methods. As for the future work, the estimated 3D human pose will be used for human activity recognition.
引用
收藏
页码:51 / 55
页数:5
相关论文
共 18 条
  • [1] [Anonymous], 2015, ARXIV PREPRINT ARXIV
  • [2] [Anonymous], 2017, ARXIV170101779
  • [3] [Anonymous], 2016, ARXIV161201465
  • [4] [Anonymous], J HEALTHCARE ENG
  • [5] [Anonymous], 2017, ARXIV170100295
  • [6] Chen Chun-Jung, 2017, Collection and Research (Taichung), P1, DOI 10.6693/CAR201712_30(1).0001
  • [7] 박서희, 2017, [Journal of Internet Computing and Services, 인터넷정보학회논문지], V18, P61, DOI 10.7472/jksii.2017.18.4.61
  • [8] Marker-Less 3D Human Motion Capture with Monocular Image Sequence and Height-Maps
    Du, Yu
    Wong, Yongkang
    Liu, Yonghao
    Han, Feilin
    Gui, Yilin
    Wang, Zhen
    Kankanhalli, Mohan
    Geng, Weidong
    [J]. COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 : 20 - 36
  • [9] Crowd Scene Understanding from Video: A Survey
    Grant, Jason M.
    Flynn, Patrick J.
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2017, 13 (02)
  • [10] Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments
    Ionescu, Catalin
    Papava, Dragos
    Olaru, Vlad
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) : 1325 - 1339