EnGesto: An Ensemble Learning Approach for Classification of Hand Gestures

被引：0

作者：

Raj, Amrutha, V ^{[1
]}

Malu, G. ^{[1
]}

机构：

[1] Digital Univ Kerala, Sch Comp Sci & Engn, Thiruvananthapuram 695317, Kerala, India

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Humanities; Feature extraction; Lighting; Data mining; Computational modeling; Cultural aspects; Convolutional neural network; data augmentation; deep architecture; ensemble learning; gesture recognition; Indian classical dance; IDENTIFICATION; DANCE;

D O I：

10.1109/ACCESS.2024.3411155

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recognizing intricate hand gestures is challenging in various applications like human-computer interaction and e-learning. Accurate classification is crucial for preserving cultural practices and enabling intuitive interaction with machines. However, existing models often struggle with challenges like dynamic lighting, complex backgrounds, varying camera angles, intricate hand poses, noise, and diverse hand attributes. To address the aforementioned challenges, we introduce EnGesto, an ensemble learning model for categorizing intricate hand gestures. EnGesto comprises three major modules: Data Augmentation (DAug), Extract Visual Geometry Group (EVGG), and Multistage Hand Gesture Classification (MuGest). DAug simulates real-world imaging conditions, enhancing accuracy in detecting hand movements and improving resilience to unexpected events, strengthening reliability and capacity. EVGG extracts feature maps using a customized VGG16 model. Within the MuGest module, advanced components such as Fully Convolutional Network (FCN), Region Proposal Network (RPN), Convolutional Neural Network (CNN), Global Max Pooling, Attention layer, and a Fully Connected (FC) layer are employed to carefully select relevant features from EVGG to achieve precise, robust hand gesture classification. Research showcased exemplary performance of the proposed model, surpassing its counterparts in classification accuracy of 97.85%, outperforms VGGNet, ResNet, EfficientNet, and CNN even under demanding image conditions, as in Indian classical dance, Bharatanatyam, with its core mudras-precise gestures conveying a range of emotions and ideas. EnGesto excels in accurate gesture classification, enhancing precision, and efficiency in preserving and facilitating e-learning of treasured art forms while promoting cultural significance, enabling natural, intuitive interaction with machines, and opening avenues for further research and development in this domain.

引用

页码：85709 / 85723

页数：15

共 33 条

[1]

Aashish, 2022, Bharatanatyam Recital by Shobhana on 10 Th Day of 45th Soorya Dance and Music Festival, # Trivandrum

[2] Comparing deep learning models for low-light natural scene image enhancement and their impact on object detection and classification: Overview, empirical evaluation, and challenges [J].

Al Sobbahi, Rayan ;

Tekli, Joe .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 109

[3] Combined Hu moments, orientation knowledge, and grid intersections feature based identification of Bharatanatyam mudra images [J].

Anami, Basavaraj S. ;

Bhandage, Venkatesh A. .

PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (04) :1439-1454

[4] A Comparative Study of Suitability of Certain Features in Classification of Bharatanatyam Mudra Images Using Artificial Neural Network [J].

Anami, Basavaraj S. ;

Bhandage, Venkatesh A. .

NEURAL PROCESSING LETTERS, 2019, 50 (01) :741-769

[5] A vertical-horizontal-intersections feature based method for identification of bharatanatyam double hand mudra images [J].

Anami, Basavaraj S. ;

Bhandage, Venkatesh A. .

MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (23) :31021-31040

[6] On the Classification of Kathakali Hand Gestures Using Support Vector Machines and Convolutional Neural Networks [J].

Bhavanam, Lakshmi Tulasi ;

Iyer, Ganesh Neelakanta .

2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,

[7]

Biswas S, 2021, 2021 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), P278, DOI [10.1109/WISPNET51692.2021.9419426, 10.1109/WiSPNET51692.2021.9419426]

[8]

Camurri A, 2003, LECT NOTES ARTIF INT, V2915, P20

[9]

Cho M. G., 2017, J. Inst. Control Robot. Syst., V23, P11

[10]

Deepam, 2019, Hasta Bheda: Speaking Through Mudras

← 1 2 3 4 →