An Embedded Neural Network Approach for Reinforcing Deep Learning: Advancing Hand Gesture Recognition

被引:1
作者
Mira, Anwar [1 ]
Hellwich, Olaf [2 ]
机构
[1] Univ Babylon, Coll Informat Technol, Babylon, Iraq
[2] Tech Univ Berlin, Comp Vis & Remote Sensing, Berlin, Germany
关键词
Deep Neural Network; Radial Basis Function Neural Network; Self Organizing Maps Network; K-Means clustering; CLASSIFICATION;
D O I
10.3897/jucs.110291
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep neural networks (DNNs) can face limitations during training for recognition, motivating this study to improve recognition capabilities by optimizing deep learning features for hand gesture image recognition. We propose a novel approach that enhances features from well-trained DNNs using an improved radial basis function (RBF) neural network, targeting recognition within individual gesture categories. We achieve this by clustering images with a self-organizing map (SOM) network to identify optimal centers for RBF training. Our enhanced SOM, employing the Hassanat distance metric, outperforms the traditional K-Means method across a comparative analysis of various distance functions and the expanded number of cluster centers, accurately identifying hand gestures in images. Our training pipeline learns from hand gesture videos and static images, addressing the growing need for machines to interact with gestures. Despite challenges posed by gesture videos, such as sensitivity to hand pose sequences within a single gesture category and overlapping hand poses due to the high similarities and repetitions, our pipeline achieved significant enhancement without requiring time-related training data. We also improve the recognition of static hand pose images within the same category. Our work advances DNNs by integrating deep learning features and incorporating SOM for RBF training.
引用
收藏
页码:957 / 977
页数:21
相关论文
共 30 条
[1]  
[Anonymous], 2013, International Journal of Engineering and Computer Science
[2]  
Hassanat AB, 2014, Arxiv, DOI arXiv:1409.0923
[3]   Rapid and efficient hand gestures recognizer based on classes discriminator wavelet networks [J].
Bouchrika, Tahani ;
Jemai, Olfa ;
Zaied, Mourad ;
Ben Amar, Chokri .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) :5995-6016
[4]  
Brixy Mareva, 2017, Self-organising Map for handwritten number classification
[5]  
Chase L. D., 2008, Euclidean Distance, P824
[6]   Multi-feature fusion: Graph neural network and CNN combining for hyperspectral image classification [J].
Ding, Yao ;
Zhang, Zhili ;
Zhao, Xiaofeng ;
Hong, Danfeng ;
Cai, Wei ;
Yu, Chengguo ;
Yang, Nengjun ;
Cai, Weiwei .
NEUROCOMPUTING, 2022, 501 :246-257
[7]   ChaLearn Looking at People Challenge 2014: Dataset and Results [J].
Escalera, Sergio ;
Baro, Xavier ;
Gonzalez, Jordi ;
Bautista, Miguel A. ;
Madadi, Meysam ;
Reyes, Miguel ;
Ponce-Lopez, Victor ;
Escalante, Hugo J. ;
Shotton, Jamie ;
Guyon, Isabelle .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 :459-473
[8]  
Gan G., 2020, Data clustering: theory, algorithms, and applications
[9]   Hybrid neural networks for big data classification [J].
Hernandez, Gerardo ;
Zamora, Erik ;
Sossa, Humberto ;
Tellez, German ;
Furlan, Federico .
NEUROCOMPUTING, 2020, 390 :327-340
[10]  
Jaccard P., 1901, B SOCIT VAUDOISEDES, V37, P547