Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [41] Gesture Recognition Using Depth Images
    Liang, Bin
    ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 353 - 356
  • [42] Gesture recognition using Integral Imaging
    Pla, Filiberto
    Latorre-Carmona, Pedro
    Salvador-Balaguer, Eva
    Javidi, Bahram
    2017 16TH WORKSHOP ON INFORMATION OPTICS (WIO), 2017,
  • [43] Gesture recognition using RFID technology
    Asadzadeh, Parvin
    Kulik, Lars
    Tanin, Egemen
    PERSONAL AND UBIQUITOUS COMPUTING, 2012, 16 (03) : 225 - 234
  • [44] Gesture Recognition using PYTHON']PYTHON
    Octavian, Calin Alexandru
    Mihaela, Hnatiuc
    Catalin, Iov Jan
    2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 139 - 144
  • [45] Gesture recognition using RFID technology
    Parvin Asadzadeh
    Lars Kulik
    Egemen Tanin
    Personal and Ubiquitous Computing, 2012, 16 : 225 - 234
  • [46] DIFFERENTIAL PSEUDO-IMAGE FOR SKELETON-BASED DYNAMIC GESTURE RECOGNITION
    Kapuscinski, Tomasz
    Mis, Mateusz
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4203 - 4207
  • [47] Design of Gesture Recognition System for Dynamic User Interface
    Rautaray, Siddharth S.
    Agrawal, Anupam
    2012 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY ENHANCED EDUCATION (ICTEE 2012), 2012,
  • [48] Accelerometer-based gesture recognition using dynamic time warping and sparse representation
    Wang, Haiying
    Li, Zhengshan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (14) : 8637 - 8655
  • [49] Accelerometer-based gesture recognition using dynamic time warping and sparse representation
    Haiying Wang
    Zhengshan Li
    Multimedia Tools and Applications, 2016, 75 : 8637 - 8655
  • [50] Gesture Recognition using SAX Method
    Kurnaz, Ismail
    Durgut, Rafet
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 673 - 676