Dynamic Gesture Recognition using a Transformer and Mediapipe

被引:0
|
作者
Althubiti, Asma H. [1 ]
Algethami, Haneen [1 ]
机构
[1] Taif Univ, Dept Comp Sci, Coll Comp & Informat Technol, Taif 21944, Saudi Arabia
关键词
Gesture recognition; self-attention; transformer encoder; skeleton; transfer learning;
D O I
10.14569/IJACSA.2024.01506143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is a rising interest in dynamic gesture recognition as a research area. This is the result of emerging global pandemics as well as the need to avoid touching different surfaces. Most of the previous research has focused on implementing deep learning algorithms for the RGB modality. However, despite its potential to enhance the algorithm's performance, gesture recognition has not widely utilised the concept of attention. Most research also used three-dimensional convolutional networks with long short-term memory networks for gesture recognition. However, these networks can be computationally expensive. As a result, this paper employs pre-trained models in conjunction with the skeleton modality to address the challenges posed by background noise. The goal is to present a comparative analysis of various gesture recognition models, divided based on video frames or skeletons. The performance of different models was evaluated using a dataset taken from Kaggle with a size of 2 GB. Each video contains 30 frames (or images) to recognise five gestures. The transformer model for skeleton-based gesture recognition achieves 0.99 accuracy and can be used to capture temporal dependencies in sequential data.
引用
收藏
页码:1424 / 1439
页数:16
相关论文
共 50 条
  • [21] Gesture Recognition using Dynamic Time Warping and Kinect: A Practical Approach
    Riofrio, Santiago
    Pozo, David
    Rosero, Jorge
    Vasquez, Juan
    2017 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER SCIENCE (INCISCOS), 2017, : 302 - 308
  • [22] End-to-End Dynamic Gesture Recognition Using MmWave Radar
    Ali, Anum
    Parida, Priyabrata
    Va, Vutha
    Ni, Saifeng
    Nguyen, Khuong Nhat
    Ng, Boon Loong
    Zhang, Jianzhong Charlie
    IEEE ACCESS, 2022, 10 : 88692 - 88706
  • [23] Dynamic gesture recognition using a model-based temporal self-similarity and its application to taebo gesture recognition
    Lee, Kyoung-Mi
    Won, Hey-Min
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2013, 7 (11): : 2824 - 2838
  • [24] Dynamic gesture recognition based on gesture contour model of time series
    Pang, Hai-Bo
    Li, Zhan-Bo
    Ding, You-Dong
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2015, 43 (01): : 140 - 146
  • [25] SwinFMCW: A Joint Swin Transformer and LSTM Method for Gesture and Identity Recognition Using FMCW Radar
    Sun, Beichen
    Xu, Zhimeng
    Wu, Zhenbin
    Zhang, Shanshan
    2022 CROSS STRAIT RADIO SCIENCE & WIRELESS TECHNOLOGY CONFERENCE, CSRSWTC, 2022,
  • [26] Gesture recognition matching based on dynamic skeleton
    Wang Jingyao
    Yu Naigong
    Firdaous, Essaf
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1680 - 1685
  • [27] Robotic control of dynamic and static gesture recognition
    Zhang, Xuexiang
    Wu, Xuncheng
    2019 2ND WORLD CONFERENCE ON MECHANICAL ENGINEERING AND INTELLIGENT MANUFACTURING (WCMEIM 2019), 2019, : 474 - 478
  • [28] Dynamic Gesture Recognition Based on MEMP Network
    Zhang, Xinyu
    Li, Xiaoqiang
    FUTURE INTERNET, 2019, 11 (04):
  • [29] Framework for dynamic hand gesture recognition using Grassmann manifold for intelligent vehicles
    Verma, Bindu
    Choudhary, Ayesha
    IET INTELLIGENT TRANSPORT SYSTEMS, 2018, 12 (07) : 721 - 729
  • [30] Dynamic Vision Sensor Based Gesture Recognition Using Liquid State Machine
    Xiao, Xun
    Wang, Lei
    Chen, Xiaofan
    Qu, Lianhua
    Guo, Shasha
    Wang, Yao
    Kang, Ziyang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 618 - 629