Optimized 4D DPM for Pose Estimation on RGBD Channels using Polisphere Models

被引:1
作者
Martinez, Enrique [1 ]
Nina, Oliver [2 ]
Sanchez, Antonio J. [1 ]
Ricolfe, Carlos [1 ]
机构
[1] Univ Politecn Valencia, Inst AI2, C Vera S-N, Valencia, Spain
[2] Univ Cent Florida, CRCV, Orlando, FL 32816 USA
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5 | 2017年
关键词
Human Pose Estimation; DPM; RGBD Images; Inverse Kinematics;
D O I
10.5220/0006133702810288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Deformable Parts Model (DPM) is a standard method to perform human pose estimation on RGB images, 3 channels. Although there has been much work to improve such method, little work has been done on utilizing DPM on other types of imagery such as RGBD data. In this paper, we describe a formulation of the DPM model that makes use of depth information channels in order to improve joint detection and pose estimation using 4 channels. In order to offset the time complexity and overhead added to the model due to extra channels to process, we propose an optimization for the proposed algorithm based on solving direct and inverse kinematic equations, that form we can reduce the interested points reducing, at the same time, the time complexity. Our results show a significant improvement on pose estimation over the standard DPM model on our own RGBD dataset and on the public CAD60 dataset.
引用
收藏
页码:281 / 288
页数:8
相关论文
共 20 条
  • [1] [Anonymous], ROM J TECH SCI APPL
  • [2] [Anonymous], KINEMATICS
  • [3] [Anonymous], IEEE RO MAN 14 IEEE
  • [4] [Anonymous], IEEE WORKSH ADV ROB
  • [5] [Anonymous], 2004, MODELING IDENTIFICAT
  • [6] Everingham Mark, 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
  • [7] Felzenszwalb P., 2008, 2013 IEEE C COMPUTER, P1
  • [8] Object Detection with Discriminatively Trained Part-Based Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    Ramanan, Deva
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (09) : 1627 - 1645
  • [9] Pictorial structures for object recognition
    Felzenszwalb, PF
    Huttenlocher, DP
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) : 55 - 79
  • [10] Gupta Raj., 2013, P 21 ACM INT C MULTI, P283