Latent Regression Forest: Structured Estimation of 3D Articulated Hand Posture

被引:232
作者
Tang, Danhang [1 ]
Chang, Hyung Jin [1 ]
Tejani, Alykhan [1 ]
Kim, Tae-Kyun [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, London, England
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/CVPR.2014.490
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present the Latent Regression Forest (LRF), a novel framework for real-time, 3D hand pose estimation from a single depth image. In contrast to prior forest-based methods, which take dense pixels as input, classify them independently and then estimate joint positions afterwards; our method can be considered as a structured coarse-to-fine search, starting from the centre of mass of a point cloud until locating all the skeletal joints. The searching process is guided by a learnt Latent Tree Model which reflects the hierarchical topology of the hand. Our main contributions can be summarised as follows: (i) Learning the topology of the hand in an unsupervised, data-driven manner. (ii) A new forest-based, discriminative framework for structured search in images, as well as an error regression step to avoid error accumulation. (iii) A new multi-view hand pose dataset containing 180K annotated images from 10 different subjects. Our experiments show that the LRF out-performs state-of-the-art methods in both accuracy and efficiency.
引用
收藏
页码:3786 / 3793
页数:8
相关论文
共 19 条
[1]  
[Anonymous], 2012, ECCV
[2]  
[Anonymous], 2012, ECCV
[3]  
[Anonymous], 2012, ECCV
[4]  
[Anonymous], 2011, CVPR
[5]  
[Anonymous], 2011, ICCV
[6]  
[Anonymous], 2012, CVPR
[7]  
[Anonymous], 2013, ICCV
[8]  
Choi MJ, 2011, J MACH LEARN RES, V12, P1771
[9]   Model-Based 3D Hand Pose Estimation from Monocular Video [J].
de La Gorce, Martin ;
Fleet, David J. ;
Paragios, Nikos .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) :1793-1805
[10]   Vision-based hand pose estimation: A review [J].
Erol, Ali ;
Bebis, George ;
Nicolescu, Mircea ;
Boyle, Richard D. ;
Twombly, Xander .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2007, 108 (1-2) :52-73