Isolated sign language recognition through integrating pose data and motion history images

被引:0
|
作者
Akdag, Ali [1 ]
Baykan, Omer Kaan [2 ]
机构
[1] Tokat Gaziosmanpasa Univ, Dept Comp Engn, Tokat, Turkiye
[2] Konya Tech Univ, Dept Comp Engn, Konya, Turkiye
关键词
Sign language recognition; Deep learning; Motion history image; Feature fusion;
D O I
10.7717/peerj-cs.2054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents an innovative approach for the task of isolated sign language recognition (SLR); this approach centers on the integration of pose data with motion history images (MHIs) derived from these data. Our research combines spatial information obtained from body, hand, and face poses with the comprehensive details provided by three-channel MHI data concerning the temporal dynamics of the sign. Particularly, our developed finger pose-based MHI (FP-MHI) feature significantly enhances the recognition success, capturing the nuances of finger movements and gestures, unlike existing approaches in SLR. This feature improves the accuracy and reliability of SLR systems by more accurately capturing the fine details and richness of sign language. Additionally, we enhance the overall model accuracy by predicting missing pose data through linear interpolation. Our study, based on the randomized leaky rectified linear unit (RReLU) enhanced ResNet-18 model, successfully handles the interaction between manual and non-manual features through the fusion of extracted features and classification with a support vector machine (SVM). This innovative integration demonstrates competitive and superior results compared to current methodologies in the field of SLR across various datasets, including BosphorusSign22k-general, BosphorusSign22k, LSA64, and GSL, in our experiments.
引用
收藏
页数:44
相关论文
共 50 条
  • [21] Occlusion robust sign language recognition system for indian sign language using CNN and pose features
    Das S.
    Biswas S.K.
    Purkayastha B.
    Multimedia Tools and Applications, 2024, 83 (36) : 84141 - 84160
  • [22] Sign Pose-based Transformer for Word-level Sign Language Recognition
    Bohacek, Matyas
    Hruz, Marek
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 182 - 191
  • [23] Chinese Sign Language Recognition with 3D Hand Motion Trajectories and Depth Images
    Geng, Lubo
    Ma, Xin
    Wang, Haibo
    Gu, Jason
    Li, Yibin
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1457 - 1461
  • [24] Pose-based Sign Language Recognition using GCN and BERT
    Tunga, Anirudh
    Nuthalapati, Sai Vidyaranya
    Wachs, Juan
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 31 - 40
  • [25] Optimized wavelet hand pose estimation for American sign language recognition
    Isaacs, J
    Foo, S
    CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 797 - 802
  • [26] Enhancing Indian sign language recognition through data augmentation and visual transformer
    Singla V.
    Bawa S.
    Singh J.
    Neural Computing and Applications, 2024, 36 (24) : 15103 - 15116
  • [27] Continuous sign language recognition using isolated signs data and deep transfer learning
    Sharma, S.
    Gupta, R.
    Kumar, A.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 1531 - 1542
  • [28] Continuous sign language recognition using isolated signs data and deep transfer learning
    S. Sharma
    R. Gupta
    A. Kumar
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 1531 - 1542
  • [29] A framework for motion recognition with applications to American sign language and gait recognition
    Vogler, C
    Sun, H
    Metaxas, D
    WORKSHOP ON HUMAN MOTION, PROCEEDINGS, 2000, : 33 - 38
  • [30] Real-Time Isolated Sign Language Recognition
    Hori, Noriaki
    Yamamoto, Masahito
    FRONTIERS OF ARTIFICIAL INTELLIGENCE, ETHICS, AND MULTIDISCIPLINARY APPLICATIONS, FAIEMA 2023, 2024, : 445 - 458