Real-Time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

被引:160
作者
Sridhar, Srinath [1 ]
Mueller, Franziska [1 ]
Zollhoefer, Michael [1 ]
Casas, Dan [1 ]
Oulasvirta, Antti [2 ]
Theobalt, Christian [1 ]
机构
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Aalto Univ, Espoo, Finland
来源
COMPUTER VISION - ECCV 2016, PT II | 2016年 / 9906卷
关键词
MOTION; MODEL; SHAPE; POSE;
D O I
10.1007/978-3-319-46475-6_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time simultaneous tracking of hands manipulating and interacting with external objects has many potential applications in augmented reality, tangible computing, and wearable computing. However, due to difficult occlusions, fast motions, and uniform hand appearance, jointly tracking hand and object pose is more challenging than tracking either of the two separately. Many previous approaches resort to complex multi-camera setups to remedy the occlusion problem and often employ expensive segmentation and optimization steps which makes real-time tracking impossible. In this paper, we propose a real-time solution that uses a single commodity RGB-D camera. The core of our approach is a 3D articulated Gaussian mixture alignment strategy tailored to handobject tracking that allows fast pose optimization. The alignment energy uses novel regularizers to address occlusions and hand-object contacts. For added robustness, we guide the optimization with discriminative part classification of the hand and segmentation of the object. We conducted extensive experiments on several existing datasets and introduce a new annotated hand-object dataset. Quantitative and qualitative results show the key advantages of our method: speed, accuracy, and robustness.
引用
收藏
页码:294 / 310
页数:17
相关论文
共 43 条
[1]  
[Anonymous], 2014, ACM T GRAPHIC, DOI DOI 10.1145/2601097.2601165
[2]  
[Anonymous], 2013, P GRAPHICS INTERFACE
[3]  
[Anonymous], 2011, BMVC
[4]  
Athitsos V, 2003, PROC CVPR IEEE, P432
[5]  
Badami I., 2013, WORKSH SEM PERC MAPP
[6]  
Ballan L, 2012, LECT NOTES COMPUT SC, V7577, P640, DOI 10.1007/978-3-642-33783-3_46
[7]   Smart particle filtering for 3D hand tracking [J].
Bray, M ;
Koller-Meier, E ;
Van Gool, L .
SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, :675-680
[8]  
Campbell D., 2016, ARXIV160300150
[9]   Model-Based 3D Hand Pose Estimation from Monocular Video [J].
de La Gorce, Martin ;
Fleet, David J. ;
Paragios, Nikos .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) :1793-1805
[10]   Tracking a Hand Manipulating an Object [J].
Hamer, Henning ;
Schindler, Konrad ;
Koller-Meier, Esther ;
Van Gool, Luc .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1475-1482