Kinect-based hand gesture recognition using trajectory information, hand motion dynamics and neural networks

被引:0
作者
Fenglin Liu
Wei Zeng
Chengzhi Yuan
Qinghui Wang
Ying Wang
机构
[1] Longyan University,School of Mechanical and Electrical Engineering
[2] University of Rhode Island,Department of Mechanical, Industrial and Systems Engineering
来源
Artificial Intelligence Review | 2019年 / 52卷
关键词
Hand gesture recognition; Kinect; Hand motion dynamics; RBF neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gestures are spatio-temporal patterns which can be characterized by collections of spatio-temporal features. Recognition of hand gestures is to find the re-occurrences of such spatio-temporal patterns through pattern matching. However, dynamic hand gestures have many obstacles for accurate recognition, including poor lighting conditions, camera’s inability to capture dynamic gesture in focus, occlusion due to finger movement, color variations due to lighting conditions. The Microsoft Kinect device provides an effective way to solve the above issues and also provides the skeleton for more convenient hand localization and tracking. The aim of this study is to develop a new trajectory-based method for hand gesture recognition using Kinect. In the first step, trajectory-based hand gesture features including spatial position and direction of fingertips, are derived from Kinect. The properties associated with the hand motion dynamics are preserved in these features. In the second step, radial basis function (RBF) neural networks are employed to model and approximate the hand motion dynamics derived from different hand gestures which represent Arabic numbers (0–9) and English alphabets (A–Z). The trained patterns of the approximated hand motion dynamics is stored in constant RBF networks. In the last step, a bank of dynamical estimators is constructed for all the training patterns, in which the constant RBF networks are embedded in. By comparing the set of estimators with a test gesture pattern, a set of recognition errors are generated, in which the average L1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_1$$\end{document} norms of the errors are taken as the recognition measure based on the smallest error principle. Finally, experiments are carried out to assess the performance of the proposed method compared with other state-of-the-art approaches. By using the twofold and tenfold cross-validation styles, the correct recognition rates for Arabic numbers (0–9) and English alphabets (A–Z) are reported to be 95.83%,97.25%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.83\%, 97.25\%$$\end{document}, and 91.35%,92.63%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$91.35\%, 92.63\%$$\end{document}, respectively.
引用
收藏
页码:563 / 583
页数:20
相关论文
共 132 条
[1]  
Beh J(2014)Rule-based trajectory segmentation for modeling hand motion trajectory Pattern Recognit 47 1586-1601
[2]  
Han D(2016)An image-to-class dynamic time warping approach for both 3D static and trajectory hand gesture recognition Pattern Recognit 55 137-147
[3]  
Ko H(1988)Stability and approximator convergence in nonparametric nonlinear adaptive control IEEE Trans Neural Netw 9 1008-1020
[4]  
Cheng H(2017)A comparative performance analysis of different activation functions in LSTM networks for classification Neural Comput Appl 43 1318-1334
[5]  
Dai Z(2013)Enhanced computer vision with microsoft kinect sensor: a review IEEE Trans Cybern 158 85-105
[6]  
Liu Z(2017)Space-time representation of people based on 3D skeletal data: a review Comput Vis Image Underst 16 75-88
[7]  
Zhao Y(2014)Handwritten character recognition based on zoning using Euler number for English alphabets and numerals IOSR J Comput Eng 62 73-86
[8]  
Farrell J(2017)Approximate string matching: a lightweight approach to recognize gestures with Kinect Pattern Recognit 28 3285-3294
[9]  
Farzad A(2017)Fused features mining for depth-based hand gesture recognition to classify blind human communication Neural Comput Appl 84 6-13
[10]  
Mashayekhi H(2016)A framework to plot and recognize hand motion trajectories towards development of non-tactile interfaces Proc Comput Sci 28 97-104