A template matching approach of one-shot-learning gesture recognition

被引:27
作者
Mahbub, Upal [1 ]
Imtiaz, Hafiz [1 ]
Roy, Tonmoy [1 ]
Rahman, Md. Shafiur [1 ]
Ahad, Md. Atiqur Rahman [2 ]
机构
[1] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh
[2] Univ Dhaka, Dept Appl Phys Elect & Commun Engn, Dhaka, Bangladesh
关键词
Gesture recognition; Depth image; Motion history image; 2D Fourier transform; MOTION; SEGMENTATION;
D O I
10.1016/j.patrec.2012.09.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel approach for gesture recognition from motion depth images based on template matching. Gestures can be represented with image templates, which in turn can be used to compare and match gestures. The proposed method uses a single example of an action as a query to find similar matches and thus termed one-shot-learning gesture recognition. It does not require prior knowledge about actions, foreground/background segmentation, or any motion estimation or tracking. The proposed method makes a novel approach to separate different gestures from a single video. Moreover, this method is based on the computation of space-time descriptors from the query video which measures the likeness of a gesture in a lexicon. These descriptor extraction methods include the standard deviation of the depth images of a gesture as well as the motion history image. Furthermore, two dimensional discrete Fourier transform is employed to reduce the effect of camera shift. The comparison is done based on correlation coefficient of the image templates and an intelligent classifier is proposed to ensure better recognition accuracy. Extensive experimentation is done on a very complicated and diversified dataset to establish the effectiveness of employing the proposed methods. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1780 / 1788
页数:9
相关论文
共 33 条
  • [21] Mahbub U., 2011, 2011 14th International Conference on Computer and Information Technology (ICCIT), P646, DOI 10.1109/ICCITechn.2011.6164868
  • [22] COMPUTATION OF NORMALIZED EDIT DISTANCE AND APPLICATIONS
    MARZAL, A
    VIDAL, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (09) : 926 - 932
  • [23] THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS
    OTSU, N
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01): : 62 - 66
  • [24] Ren Z, 2011, PROC FL STATE HORTIC, V124, P1
  • [25] Ren Z, 2011, P 19 ACM INT C MULT, P1093, DOI DOI 10.1145/2072298.2071946
  • [26] Action Recognition from One Example
    Seo, Hae Jong
    Milanfar, Peyman
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 867 - 882
  • [27] Shao L, 2011, IEEE IMAGE PROC, P209, DOI 10.1109/ICIP.2011.6116023
  • [28] Human action segmentation and recognition via motion and shape analysis
    Shao, Ling
    Ji, Ling
    Liu, Yan
    Zhang, Jianguo
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (04) : 438 - 445
  • [29] Song Y., 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P500, DOI 10.1109/FG.2011.5771448
  • [30] Weilong Yang, 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, P482, DOI 10.1109/ICCVW.2009.5457663