Spatiotemporal features representation with dynamic mode decomposition for hand gesture recognition using deep neural networks

被引:3
作者
Sharma, Bhavana [1 ]
Panda, Jeebananda [1 ]
机构
[1] DTU, Dept Elect & Commun Engn, New Delhi 110042, India
关键词
Hand gesture recognition; Dynamic mode decomposition (DMD); Time dynamics; Spatiotemporal features; Deep neural network;
D O I
10.1007/s11760-024-03038-y
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hand Gesture Recognition (HGR) with complexity and diversity of hand images in uncontrolled environment is a challenging task because of complex backgrounds, light illumination, strong occlusions, blur motion. This work provides a thorough examination of spatiotemporal feature extraction with deep learning model in order to overcome practical variations in lighting and fluctuations of physical hand's movement in both space and time. The hand skin color is first filtered through YCbCr color space and in order to train the hand images, MediaPipe is used to distinguish the specific gesture region. With respect to spatial variations, the spatiotemporal features extraction is done by Dynamic Mode Decomposition (DMD) technique, where hand key features are decoupled with time dynamics and modes in order to obtain time-frequency analysis. Thus, the received reconstructed signal has an enhanced visibility of skin-color pixels. The extensive experiment is demonstrated by deep neural network ResNet18 for better classification on three publicly available datasets, namely, Ego hand dataset, American Sign Language (ASL) dataset and Senz3D dataset. This work outplays existing state-of-arts methods remarkable regarding spatiotemporal features extraction with an accuracy of Ego hand dataset is 97.85% and ASL dataset is 98.49% at specific dynamic modes three, whereas Senz3D dataset achieves 98.51% classification accuracy at dynamic mode two. We have obtained a competitive outcome when comparing the State-Of-The-Art (SOTA) techniques available for HGR.
引用
收藏
页码:3745 / 3759
页数:15
相关论文
共 31 条
[1]   Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training [J].
Abavisani, Mahdi ;
Joze, Hamid Reza Vaezi ;
Patel, Vishal M. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1165-1174
[2]   American Sign Language Words Recognition Using Spatio-Temporal Prosodic and Angle Features: A Sequential Learning Approach [J].
Abdullahi, Sunusi Bala ;
Chamnongthai, Kosin .
IEEE ACCESS, 2022, 10 :15911-15923
[3]   Robust Dynamic Mode Decomposition [J].
Abolmasoumi, Amir Hossein ;
Netto, Marcos ;
Mili, Lamine .
IEEE ACCESS, 2022, 10 :65473-65484
[4]   Deep Learning-Based Approach for Sign Language Gesture Recognition With Efficient Hand Gesture Representation [J].
Al-Hammadi, Muneer ;
Muhammad, Ghulam ;
Abdul, Wadood ;
Alsulaiman, Mansour ;
Bencherif, Mohammed A. ;
Alrayes, Tareq S. ;
Mathkour, Hassan ;
Mekhtiche, Mohamed Amine .
IEEE ACCESS, 2020, 8 :192527-192542
[5]   DeepArSLR: A Novel Signer-Independent Deep Learning Framework for Isolated Arabic Sign Language Gestures Recognition [J].
Aly, Saleh ;
Aly, Walaa .
IEEE ACCESS, 2020, 8 :83199-83212
[6]   Dynamic Mode Decomposition Based Video Shot Detection [J].
Bi, Chongke ;
Yuan, Ye ;
Zhang, Jiawan ;
Shi, Yun ;
Xiang, Yiqing ;
Wang, Yuehuan ;
Zhang, Ronghui .
IEEE ACCESS, 2018, 6 :21397-21407
[7]   In-situ recognition of hand gesture via Enhanced Xception based single-stage deep convolutional neural network [J].
Bose, S. Rubin ;
Kumar, V. Sathiesh .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
[8]   Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition [J].
Chalasani, Tejo ;
Smolic, Aljosa .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :4367-4375
[9]   Variants of Dynamic Mode Decomposition: Boundary Condition, Koopman, and Fourier Analyses [J].
Chen, Kevin K. ;
Tu, Jonathan H. ;
Rowley, Clarence W. .
JOURNAL OF NONLINEAR SCIENCE, 2012, 22 (06) :887-915
[10]   A Comparative Analysis of Signal Decomposition Techniques for Structural Health Monitoring on an Experimental Benchmark [J].
Civera, Marco ;
Surace, Cecilia .
SENSORS, 2021, 21 (05) :1-35