Deep learning in vision-based static hand gesture recognition

被引：219

作者：

Oyedotun, Oyebade K. ^{[1
]}

Khashman, Adnan ^{[1
,2
]}

机构：

[1] ECRAA, Mersin 10, Lefkosa, Northern Cyprus, Turkey

[2] Univ Kyrenia, Mersin 10, Kyrenia, Northern Cyprus, Turkey

来源：

NEURAL COMPUTING & APPLICATIONS | 2017年 / 28卷 / 12期

关键词：

Hand gesture recognition; Human-computer interaction; Neural network; Deep learning; NETWORK;

D O I：

10.1007/s00521-016-2294-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hand gesture for communication has proven effective for humans, and active research is ongoing in replicating the same success in computer vision systems. Human-computer interaction can be significantly improved from advances in systems that are capable of recognizing different hand gestures. In contrast to many earlier works, which consider the recognition of significantly differentiable hand gestures, and therefore often selecting a few gestures from the American Sign Language (ASL) for recognition, we propose applying deep learning to the problem of hand gesture recognition for the whole 24 hand gestures obtained from the Thomas Moeslund's gesture recognition database. We show that more biologically inspired and deep neural networks such as convolutional neural network and stacked denoising autoencoder are capable of learning the complex hand gesture classification task with lower error rates. The considered networks are trained and tested on data obtained from the above-mentioned public database; results comparison is then made against earlier works in which only small subsets of the ASL hand gestures are considered for recognition.

引用

页码：3941 / 3951

页数：11

共 34 条

[31]

Vincent P, 2010, J MACH LEARN RES, V11, P3371

[32] Face Recognition Based on Deep Learning [J].

Wang, Weihong ;

Yang, Jie ;

Xiao, Jianwei ;

Li, Sheng ;

Zhou, Dixin .

HUMAN CENTERED COMPUTING, HCC 2014, 2015, 8944 :812-820

[33]

Yewale S. K., 2011, 2011 Proceedings of International Conference on Emerging Trends in Networks and Computer Communications (ETNCC 2011), P287, DOI 10.1109/ETNCC.2011.6255906

[34] Visualizing and Understanding Convolutional Networks [J].

Zeiler, Matthew D. ;

Fergus, Rob .

COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :818-833

← 1 2 3 4 →