Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [1] Dynamic Hand Gesture Recognition Using Kinect
    Kadethankar, Atharva Ajit
    Joshi, Apurv Dilip
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [2] A Convolutional-Transformer-Based Approach for Dynamic Gesture Recognition of Data Gloves
    Tang, Yingzhe
    Pan, Mingzhang
    Li, Hongqi
    Cao, Xinxin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
  • [3] Dynamic Hand Gesture Recognition Using the Skeleton of the Hand
    Bogdan Ionescu
    Didier Coquin
    Patrick Lambert
    Vasile Buzuloiu
    EURASIP Journal on Advances in Signal Processing, 2005
  • [4] Dynamic Hand Gesture Recognition in In-Vehicle Environment Based on FMCW Radar and Transformer
    Zheng, Lianqing
    Bai, Jie
    Zhu, Xichan
    Huang, Libo
    Shan, Chewu
    Wu, Qiong
    Zhang, Lei
    SENSORS, 2021, 21 (19)
  • [5] Dynamic hand gesture recognition using the skeleton of the hand
    Ionescu, B
    Coquin, D
    Lambert, P
    Buzuloiu, V
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (13) : 2101 - 2109
  • [6] Convolutional Transformer Fusion Blocks for Multi-Modal Gesture Recognition
    Hampiholi, Basavaraj
    Jarvers, Christian
    Mader, Wolfgang
    Neumann, Heiko
    IEEE ACCESS, 2023, 11 : 34094 - 34103
  • [7] Integration of Convolutional Neural Network and Vision Transformer for gesture recognition using sEMG
    Liu, Xiaoguang
    Hu, Lijian
    Tie, Liang
    Jun, Li
    Wang, Xiaodong
    Liu, Xiuling
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 98
  • [8] A dynamic gesture recognition and prediction system using the convexity approach
    Barros, Pablo
    Maciel-Junior, Nestor T.
    Fernandes, Bruno J. T.
    Bezerra, Byron L. D.
    Fernandes, Sergio M. M.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 155 : 139 - 149
  • [9] Human gesture recognition using a simplified dynamic Bayesian network
    Myung-Cheol Roh
    Seong-Whan Lee
    Multimedia Systems, 2015, 21 : 557 - 568
  • [10] A CNN-Transformer Hybrid Recognition Approach for sEMG-Based Dynamic Gesture Prediction
    Liu, Yanhong
    Li, Xingyu
    Yang, Lei
    Bian, Guibin
    Yu, Hongnian
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72