Deep learning in vision-based static hand gesture recognition

被引:219
作者
Oyedotun, Oyebade K. [1 ]
Khashman, Adnan [1 ,2 ]
机构
[1] ECRAA, Mersin 10, Lefkosa, Northern Cyprus, Turkey
[2] Univ Kyrenia, Mersin 10, Kyrenia, Northern Cyprus, Turkey
关键词
Hand gesture recognition; Human-computer interaction; Neural network; Deep learning; NETWORK;
D O I
10.1007/s00521-016-2294-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand gesture for communication has proven effective for humans, and active research is ongoing in replicating the same success in computer vision systems. Human-computer interaction can be significantly improved from advances in systems that are capable of recognizing different hand gestures. In contrast to many earlier works, which consider the recognition of significantly differentiable hand gestures, and therefore often selecting a few gestures from the American Sign Language (ASL) for recognition, we propose applying deep learning to the problem of hand gesture recognition for the whole 24 hand gestures obtained from the Thomas Moeslund's gesture recognition database. We show that more biologically inspired and deep neural networks such as convolutional neural network and stacked denoising autoencoder are capable of learning the complex hand gesture classification task with lower error rates. The considered networks are trained and tested on data obtained from the above-mentioned public database; results comparison is then made against earlier works in which only small subsets of the ASL hand gestures are considered for recognition.
引用
收藏
页码:3941 / 3951
页数:11
相关论文
共 34 条
[31]  
Vincent P, 2010, J MACH LEARN RES, V11, P3371
[32]   Face Recognition Based on Deep Learning [J].
Wang, Weihong ;
Yang, Jie ;
Xiao, Jianwei ;
Li, Sheng ;
Zhou, Dixin .
HUMAN CENTERED COMPUTING, HCC 2014, 2015, 8944 :812-820
[33]  
Yewale S. K., 2011, 2011 Proceedings of International Conference on Emerging Trends in Networks and Computer Communications (ETNCC 2011), P287, DOI 10.1109/ETNCC.2011.6255906
[34]   Visualizing and Understanding Convolutional Networks [J].
Zeiler, Matthew D. ;
Fergus, Rob .
COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :818-833