Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [31] Hand gesture recognition using subunit-based dynamic time warping
    Wang, Yanrung
    Shimada, Atsushi
    Yamashita, Takayoshi
    Taniguchi, Rin-ichiro
    PROCEEDINGS OF THE EIGHTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 18TH '13), 2013, : 480 - 483
  • [32] Dynamic gesture recognition based on temporal modeling
    Wang Rui
    Wang Zhonghua
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4829 - 4832
  • [33] Dyhand: dynamic hand gesture recognition using BiLSTM and soft attention methods
    Singh, Rohit Pratap
    Singh, Laiphrakpam Dolendro
    VISUAL COMPUTER, 2025, 41 (01) : 41 - 51
  • [34] Dynamic gesture recognition using signal processing based on fuzzy nominal scales
    Allevard, T
    Benoit, E
    Foulloy, L
    MEASUREMENT, 2005, 38 (04) : 303 - 312
  • [35] Visual Gesture Recognition for Human Robot Interaction Using Dynamic Movement Primitives
    Liu, Zhan
    Hu, Fan
    Luo, Dingsheng
    Wu, Xihong
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2094 - 2100
  • [36] Sparsity aware dynamic gesture recognition using radar sensors with angular diversity
    Yang, Le
    Li, Gang
    IET RADAR SONAR AND NAVIGATION, 2018, 12 (10) : 1114 - 1120
  • [37] A Local Spatial-Temporal Synchronous Network to Dynamic Gesture Recognition
    Zhao, Dongdong
    Yang, Qinglian
    Zhou, Xingwen
    Li, Hongli
    Yan, Shi
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (05) : 2226 - 2233
  • [38] Gesture Recognition for Home Automation using Transfer Learning
    Yang, Sehoon
    Lee, Sangjoon
    Byun, Yungcheol
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2018, : 136 - 137
  • [39] Gesture Recognition Method Combining Dense Convolutional with Spatial Transformer Networks
    Ma Jie
    Zhang Xiudan
    Yang Nan
    Tian Yalei
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (04) : 951 - 956
  • [40] MediaPipe with LSTM Architecture for Real-Time Hand Gesture Recognization
    Biswas, Sougatamoy
    Nandy, Anup
    Naskar, Asim Kumar
    Saw, Rahul
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 422 - 431