Exploiting Recurrent Neural Networks and Leap Motion Controller for the Recognition of Sign Language and Semaphoric Hand Gestures

被引：128

作者：

Avola, Danilo ^{[1
]}

Bernardi, Marco ^{[2
]}

Cinque, Luigi ^{[2
]}

Foresti, Gian Luca ^{[1
]}

Massaroni, Cristiano ^{[2
]}

机构：

[1] Univ Udine Polo Sci Matemat Informat & Multimedia, Dept Math & Compr Sci, I-33100 Udine, Italy

[2] Univ Roma La Sapienza, Fac Ingn Informaz Informat & Stat, Dept Comp Sci, I-00185 Rome, Italy

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2019年 / 21卷 / 01期

关键词：

Hand gesture recognition; sign language; semaphoric gestures; Leap Motion Controller (LMC); Recurrent Neural Network (RNN); Long Short Term Memory (LSTM); DESCRIPTORS; INTERFACES; DESIGN; SENSOR; TIME;

D O I：

10.1109/TMM.2018.2856094

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hand gesture recognition is still a topic of great interest for the computer vision community. In particular, sign language and semaphoric hand gestures are two foremost areas of interest due to their importance in human-human communication and human-computer interaction, respectively. Any hand gesture can be represented by sets of feature vectors that change over time. Recurrent neural networks (RNNs) are suited to analyze this type of set thanks to their ability to model the long-term contextual information of temporal sequences. In this paper, an RNN is trained by using as features the angles formed by the finger bones of the human hands. The selected features, acquired by a leap motion controller sensor, are chosen because the majority of human hand gestures produce joint movements that generate truly characteristic corners. The proposed method, including the effectiveness of the selected angles, was initially tested by creating a very challenging dataset composed by a large number of gestures defined by the American sign language. On the latter, an accuracy of over 96% was achieved. Afterwards, by using the Shape Retrieval Contest (SHREC) dataset, a wide collection of semaphoric hand gestures, the method was also proven to outperform in accuracy competing approaches of the current literature.

引用

页码：234 / 245

页数：12

共 59 条

[1]

[Anonymous], 2013, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2012.59

[2]

[Anonymous], 2006, Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2, Washington, DC, USA

[3]

[Anonymous], 2012, P 27 C IMAGE VISION

[4]

[Anonymous], 2012, SUPERVISED SEQUENCE

[5]

Athitsos V., 2008, PROC IEEE WORKSHOP C, P1

[6] Design of an efficient framework for fast prototyping of customized human-computer interfaces and virtual environments for rehabilitation [J].

Avola, Danilo ;

Spezialetti, Matteo ;

Placidi, Giuseppe .

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2013, 110 (03) :490-502

[7]

Barrientos F.A., 2002, international conference on Collaborative virtual environments, P113

[8]

BISHOP C. M., 2006, Pattern recognition and machine learning, DOI [DOI 10.1117/1.2819119, 10.1007/978-0-387-45528-0]

[9]

Bridle J. S., 1990, Neurocomputing, P227

[10]

Calinon S., 2007, 2007 2nd Annual Conference on Human-Robot Interaction (HRI), P255

← 1 2 3 4 5 6 →