Enhancement of Hand Gesture Recognition Using Convolutional Neural Networks Integrating a Combination of an Autoencoder Network and PCA

被引:3
作者
Bousbai, Khalil [1 ]
Merah, Mostefa [1 ]
机构
[1] Mostaganem Univ, Dept Elect Engn, Signals & Syst Lab, Site 1 Route Belahcel, Mostaganem, Algeria
关键词
Hand gesture recognition; American sign language; deep learning; capsule networks; convolutional networks; autoencoders;
D O I
10.1142/S0218001422560158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hand gestures offer people a convenient way to interact with computers, in addition to give them the ability to communicate without physical contact and at a distance, which is essential in today's health conditions, especially during an epidemic infectious viruses such as the COVID-19 coronavirus. However, factors, such as the complexity of hand gesture patterns, differences in hand size and position, and other aspects, can affect the performance of hand gesture recognition and classification algorithms. Some deep learning approaches such as convolutional neural networks (CNN), capsule networks (CapsNets) and autoencoders have been proposed by researchers to improve the performance of image recognition systems in this particular field: While CNNs are arguably the most widely used networks for object detection and image classification, CapsNets and Autoencoder seem to resolve some of the limitations identified in the first approach. For this reason, in this work, a specific combination of these networks is proposed to effectively solve the ASL problem. The results obtained in this work show that the proposed group with a simple data augmentation process improves precision performance by 99.43%.
引用
收藏
页数:16
相关论文
共 27 条
[11]   Face recognition: A convolutional neural-network approach [J].
Lawrence, S ;
Giles, CL ;
Tsoi, AC ;
Back, AD .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (01) :98-113
[12]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[13]   Robust Capsule Network Based on Maximum Correntropy Criterion for Hyperspectral Image Classification [J].
Li, Heng-Chao ;
Wang, Wei-Ye ;
Pan, Lei ;
Li, Wei ;
Du, Qian ;
Tao, Ran .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :738-751
[14]   Ensemble Approaches for Regression: A Survey [J].
Mendes-Moreira, Joao ;
Soares, Carlos ;
Jorge, Alipio Mario ;
De Sousa, Jorge Freire .
ACM COMPUTING SURVEYS, 2012, 45 (01)
[15]  
Nilsson NJ., 1965, Learning Machines: Foundations of Trainable Pattern-Classifying Systems
[16]  
Pugeault N, 2011, 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), DOI 10.1109/ICCVW.2011.6130290
[17]  
Rathi D, 2018, Arxiv, DOI arXiv:1805.06618
[18]   Sign Language Fingerspelling Classification from Depth and Color Images using a Deep Belief Network [J].
Rioux-Maldague, Lucas ;
Giguere, Philippe .
2014 CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2014, :92-97
[19]  
Sabour S, 2017, ADV NEUR IN, V30
[20]   Action recognition using optimized deep autoencoder and CNN for surveillance data streams of non-stationary environments [J].
Ullah, Amin ;
Muhammad, Khan ;
Ul Haq, Ijaz ;
Baik, Sung Wook .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 :386-397