Enhancement of Hand Gesture Recognition Using Convolutional Neural Networks Integrating a Combination of an Autoencoder Network and PCA

被引:3
作者
Bousbai, Khalil [1 ]
Merah, Mostefa [1 ]
机构
[1] Mostaganem Univ, Dept Elect Engn, Signals & Syst Lab, Site 1 Route Belahcel, Mostaganem, Algeria
关键词
Hand gesture recognition; American sign language; deep learning; capsule networks; convolutional networks; autoencoders;
D O I
10.1142/S0218001422560158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand gestures offer people a convenient way to interact with computers, in addition to give them the ability to communicate without physical contact and at a distance, which is essential in today's health conditions, especially during an epidemic infectious viruses such as the COVID-19 coronavirus. However, factors, such as the complexity of hand gesture patterns, differences in hand size and position, and other aspects, can affect the performance of hand gesture recognition and classification algorithms. Some deep learning approaches such as convolutional neural networks (CNN), capsule networks (CapsNets) and autoencoders have been proposed by researchers to improve the performance of image recognition systems in this particular field: While CNNs are arguably the most widely used networks for object detection and image classification, CapsNets and Autoencoder seem to resolve some of the limitations identified in the first approach. For this reason, in this work, a specific combination of these networks is proposed to effectively solve the ASL problem. The results obtained in this work show that the proposed group with a simple data augmentation process improves precision performance by 99.43%.
引用
收藏
页数:16
相关论文
共 27 条
[1]   A convolutional neural network to classify American Sign Language fingerspelling from depth and colour images [J].
Ameen, Salem ;
Vadera, Sunil .
EXPERT SYSTEMS, 2017, 34 (03)
[2]  
[Anonymous], 2017, SIGN LANG MNIST
[3]  
Bilgin M., 2019, P 2019 3 INT S MULT, P1, DOI DOI 10.1109/ISMSIT.2019.8932829
[4]  
Bousbai K., 2019, P 2019 6 INT C IMAGE, P1
[5]  
Brown G., 2005, Information Fusion, V6, P5, DOI 10.1016/j.inffus.2004.04.004
[6]   A Hybrid Autoencoder Network for Unsupervised Image Clustering [J].
Chen, Pei-Yin ;
Huang, Jih-Jeng .
ALGORITHMS, 2019, 12 (06)
[7]   Laser stripe image denoising using convolutional autoencoder [J].
Fang, Zhuoqun ;
Jia, Tong ;
Chen, Qiusheng ;
Xu, Ming ;
Yuan, Xi ;
Wu, Chengdong .
RESULTS IN PHYSICS, 2018, 11 :96-104
[8]   Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex [J].
Han, Kuan ;
Wen, Haiguang ;
Shi, Junxing ;
Lu, Kun-Han ;
Zhang, Yizhen ;
Fu, Di ;
Liu, Zhongming .
NEUROIMAGE, 2019, 198 :125-136
[9]   Unsupervised feature extraction with autoencoder trees [J].
Irsoy, Ozan ;
Alpaydin, Ethem .
NEUROCOMPUTING, 2017, 258 :63-73
[10]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90