3D Human Pose Estimation from RGB plus D Images with Convolutional Neural Networks

被引:3
作者
Cai, Yiheng [1 ]
Wang, Xueyan [1 ]
Kong, Xinran [1 ]
机构
[1] Beijing Univ Technol, Dept Informat, PingLeyuan 100, Beijing, Peoples R China
来源
2018 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND BIOINFORMATICS (ICBEB 2018) | 2018年
关键词
Human Pose Estimation; Deep Learning; RGB plus D Images;
D O I
10.1145/3278198.3278225
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we explore 3D human pose estimation on the RGB+D images. While many researchers try to directly predict 3D pose from single RGB image, we propose a simple framework that could predict 3D pose predictions with the RGB image and depth image. Our approach is based on two aspects. On the one hand, we predicted accurate 2D joint locations from RGB image by applying the stacked hourglass networks based on the improved residual architecture. On the other hand, in view of obtained 2D joint locations, we could estimate 3D pose with the depth after calculating depth image patches. In general, compared with the state-of-the-art approaches, our model achieves signification improvement on benchmark dataset.
引用
收藏
页码:64 / 69
页数:6
相关论文
共 29 条
  • [1] Adrian Bulat, 2017, ICCV
  • [2] Akhter I, 2015, PROC CVPR IEEE, P1446, DOI 10.1109/CVPR.2015.7298751
  • [3] Amir Shahroudy, 2016, IEEE C COMP VIS PATT
  • [4] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
    Andriluka, Mykhaylo
    Pishchulin, Leonid
    Gehler, Peter
    Schiele, Bernt
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
  • [5] [Anonymous], 2016, Proceedings of the 24th ACM international conference on Multimedia
  • [6] Cao Z., 2016, P IEEE C COMP VIS PA, P7291, DOI DOI 10.1109/CVPR.2017.1432-S2.0-85042428134
  • [7] Carreira J., 2016, COMPUTER VISION PATT
  • [8] Catalin Ionescu, 2011, INT C COMP VIS
  • [9] Chen Ching-Hang, 2017, CVPR
  • [10] Collobert R, 2011, BIGLEARN NIPS WORKSH, P1