Convolutional neural network with spatial pyramid pooling for hand gesture recognition

被引:0
作者
Yong Soon Tan
Kian Ming Lim
Connie Tee
Chin Poo Lee
Cheng Yaw Low
机构
[1] Multimedia University,Faculty of Information Science and Technology (FIST)
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Convolutional neural network (CNN); Spatial pyramid pooling (SPP); Hand gesture recognition; Sign language recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture provides a means for human to interact through a series of gestures. While hand gesture plays a significant role in human–computer interaction, it also breaks down the communication barrier and simplifies communication process between the general public and the hearing-impaired community. This paper outlines a convolutional neural network (CNN) integrated with spatial pyramid pooling (SPP), dubbed CNN–SPP, for vision-based hand gesture recognition. SPP is discerned mitigating the problem found in conventional pooling by having multi-level pooling stacked together to extend the features being fed into a fully connected layer. Provided with inputs of varying sizes, SPP also yields a fixed-length feature representation. Extensive experiments have been conducted to scrutinize the CNN–SPP performance on two well-known American sign language (ASL) datasets and one NUS hand gesture dataset. Our empirical results disclose that CNN–SPP prevails over other deep learning-driven instances.
引用
收藏
页码:5339 / 5351
页数:12
相关论文
共 80 条
[1]  
Lim KM(2016)Block-based histogram of optical flow for isolated sign language recognition J Vis Commun Image Represent 40 538-545
[2]  
Tan AW(2016)A feature covariance matrix with serial particle filter for isolated sign language recognition Expert Syst Appl 54 208-218
[3]  
Tan SC(2017)A four dukkha state-space model for hand tracking Neurocomputing 267 311-319
[4]  
Lim KM(2017)Sign language recognition using image processing Int J Adv Res Comput Sci Softw Eng 7 10-12
[5]  
Tan AW(2017)A hybrid gesture recognition method for American sign language Indian J Sci Technol 10 1-10946
[6]  
Tan SC(2017)Gesture recognition based on an improved local sparse representation classification algorithm Cluster Comput 22 10935-322
[7]  
Lim KM(2017)Vision based hand gesture recognition for Indian sign languages using local binary patterns with support vector machine classifier Adv Natl Appl Sci 11 314-29057
[8]  
Tan AW(2018)Finger spelling recognition using depth information and support vector machine Multimedia Tools Appl 77 29043-3294
[9]  
Tan SC(2017)Fused features mining for depth-based hand gesture recognition to classify blind human communication Neural Comput Appl 28 3285-741
[10]  
Kour KP(2018)Recognition of a real-time signer-independent static Farsi sign language based on fourier coefficients amplitude Int J Mach Learn Cybernet 9 727-10