First-Person Palm Pose Tracking and Gesture Recognition in Augmented Reality

被引:6
作者
Thalmann, Daniel [1 ]
Liang, Hui [1 ]
Yuan, Junsong [2 ]
机构
[1] Nanyang Technol Univ, Inst Media Innovat, 50 Nanyang Ave, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, 50 Nanyang Ave, Singapore 639798, Singapore
来源
COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS | 2016年 / 598卷
关键词
D O I
10.1007/978-3-319-29971-6_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an Augmented Reality solution to allow users to manipulate and inspect 3D virtual objects freely with their bare hands on wearable devices. To this end, we use a head-mounted depth camera to capture the RGB-D hand images from egocentric view, and propose a unified framework to jointly recover the 6D palm pose and recognize the hand gesture from the depth images. The random forest is utilized to regress for the palm pose and classify the hand gesture simultaneously via a spatial-voting framework. With a real-world annotated training dataset, the proposed method shows to predict the palm pose and gesture accurately. The output of the forest is used to render the 3D virtual objects, which are overlaid onto the hand region in input RGB images with camera calibration parameters to provide seamless virtual and real scene synthesis.
引用
收藏
页码:3 / 15
页数:13
相关论文
共 34 条
  • [11] Hand gesture recognition using a real-time tracking method and hidden Markov models
    Chen, FS
    Fu, CM
    Huang, CL
    [J]. IMAGE AND VISION COMPUTING, 2003, 21 (08) : 745 - 758
  • [12] Chen Q., 2007, IEEE INSTRUMENTATION, P1
  • [13] The representation and recognition of human movement using temporal templates
    Davis, JW
    Bobick, AF
    [J]. 1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 928 - 934
  • [14] Model-Based 3D Hand Pose Estimation from Monocular Video
    de La Gorce, Martin
    Fleet, David J.
    Paragios, Nikos
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) : 1793 - 1805
  • [15] Freeman W. T., 1995, IEEE INT WORKSH AUT, V12, P296
  • [16] Guan HY, 2006, PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, P263
  • [17] Herdtweck C, 2013, IEEE INT VEH SYM, P403, DOI 10.1109/IVS.2013.6629502
  • [18] CONDENSATION - Conditional density propagation for visual tracking
    Isard, M
    Blake, A
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 29 (01) : 5 - 28
  • [19] Keskin C, 2012, LECT NOTES COMPUT SC, V7577, P852, DOI 10.1007/978-3-642-33783-3_61
  • [20] Hierarchically constrained 3D hand pose estimation using regression forests from single frame depth data
    Kirac, Furkan
    Kara, Yunus Emre
    Akarun, Lale
    [J]. PATTERN RECOGNITION LETTERS, 2014, 50 : 91 - 100