Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs

被引:9
作者
Buttar, Ahmed Mateen [1 ]
Ahmad, Usama [1 ]
Gumaei, Abdu H. [2 ]
Assiri, Adel [3 ]
Akbar, Muhammad Azeem [4 ]
Alkhamees, Bader Fahad [5 ]
机构
[1] Univ Agr Faisalabad, Dept Comp Sci, Faisalabad 38000, Pakistan
[2] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Al Kharj 11942, Saudi Arabia
[3] King Khalid Univ, Coll Business, Management Informat Syst Dept, Abha 61421, Saudi Arabia
[4] LUT Univ, Software Engn Dept, Lahti 15210, Finland
[5] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
关键词
You Only Look Once (YOLO); Long Short-Term Memory (LSTM); deep learning; confusion matrix; convolutional neural network (CNN); MediaPipe holistic;
D O I
10.3390/math11173729
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
A speech impairment limits a person's capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person's gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers' speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92% accuracy for different continuous signs, and the YOLOv6 model achieves 96% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.
引用
收藏
页数:20
相关论文
共 24 条
  • [1] Agarwal S.R., 2015, INT J COMPUTER APPL, V116, P18
  • [2] Deep Learning-Based Approach for Sign Language Gesture Recognition With Efficient Hand Gesture Representation
    Al-Hammadi, Muneer
    Muhammad, Ghulam
    Abdul, Wadood
    Alsulaiman, Mansour
    Bencherif, Mohammed A.
    Alrayes, Tareq S.
    Mathkour, Hassan
    Mekhtiche, Mohamed Amine
    [J]. IEEE ACCESS, 2020, 8 (08): : 192527 - 192542
  • [3] Al-Shaheen A., 2022, Int. J. Multidiscip. Stud. Innov. Technol, V6, P61, DOI [10.36287/ijmsit.6.1.61, DOI 10.36287/IJMSIT.6.1.61]
  • [4] Inter-database validation of a deep learning approach for automatic sleep scoring
    Alvarez-Estevez, Diego
    Rijsman, Roselyne M.
    [J]. PLOS ONE, 2021, 16 (08):
  • [5] DeepArSLR: A Novel Signer-Independent Deep Learning Framework for Isolated Arabic Sign Language Gestures Recognition
    Aly, Saleh
    Aly, Walaa
    [J]. IEEE ACCESS, 2020, 8 : 83199 - 83212
  • [6] Arora Sandhya, 2018, International Journal of Business Intelligence and Data Mining, V13, P163
  • [7] Bantupalli K, 2018, IEEE INT CONF BIG DA, P4896, DOI 10.1109/BigData.2018.8622141
  • [8] Hybrid Deep Learning Models for Sentiment Analysis
    Dang, Cach N.
    Moreno-Garcia, Maria N.
    De la Prieta, Fernando
    [J]. COMPLEXITY, 2021, 2021
  • [9] Sign and Human Action Detection Using Deep Learning
    Dhulipala, Shivanarayna
    Adedoyin, Festus Fatai
    Bruno, Alessandro
    [J]. JOURNAL OF IMAGING, 2022, 8 (07)
  • [10] Forster J., 2014, P 9 INT C LANG RES E