Deep learning in vision-based static hand gesture recognition

被引:219
作者
Oyedotun, Oyebade K. [1 ]
Khashman, Adnan [1 ,2 ]
机构
[1] ECRAA, Mersin 10, Lefkosa, Northern Cyprus, Turkey
[2] Univ Kyrenia, Mersin 10, Kyrenia, Northern Cyprus, Turkey
关键词
Hand gesture recognition; Human-computer interaction; Neural network; Deep learning; NETWORK;
D O I
10.1007/s00521-016-2294-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand gesture for communication has proven effective for humans, and active research is ongoing in replicating the same success in computer vision systems. Human-computer interaction can be significantly improved from advances in systems that are capable of recognizing different hand gestures. In contrast to many earlier works, which consider the recognition of significantly differentiable hand gestures, and therefore often selecting a few gestures from the American Sign Language (ASL) for recognition, we propose applying deep learning to the problem of hand gesture recognition for the whole 24 hand gestures obtained from the Thomas Moeslund's gesture recognition database. We show that more biologically inspired and deep neural networks such as convolutional neural network and stacked denoising autoencoder are capable of learning the complex hand gesture classification task with lower error rates. The considered networks are trained and tested on data obtained from the above-mentioned public database; results comparison is then made against earlier works in which only small subsets of the ASL hand gestures are considered for recognition.
引用
收藏
页码:3941 / 3951
页数:11
相关论文
共 34 条
[1]  
[Anonymous], COMPUTER VISION BASE
[2]  
[Anonymous], 2013, INT C MACHINE LEARNI
[3]  
[Anonymous], 2013, Journal of Image and Graphics, DOI DOI 10.12720/JOIG.1.1.34-38
[4]  
[Anonymous], 2012, INT J COMPUTER SCI E
[5]  
[Anonymous], 2012, International Journal of Computer Applications
[6]  
[Anonymous], 2013, International Journal of Computer Applications
[7]  
[Anonymous], 2014, INT J SCI ENG RES
[8]  
Avraam Marimpis, 2014, International Journal of Advanced Research in Artificial Intelligence, V3, P1
[9]  
Baldi, 2012, P ICML WORKSH UNS TR, V27, P37, DOI DOI 10.1561/2200000006
[10]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493