Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs

被引:8
|
作者
Buttar, Ahmed Mateen [1 ]
Ahmad, Usama [1 ]
Gumaei, Abdu H. [2 ]
Assiri, Adel [3 ]
Akbar, Muhammad Azeem [4 ]
Alkhamees, Bader Fahad [5 ]
机构
[1] Univ Agr Faisalabad, Dept Comp Sci, Faisalabad 38000, Pakistan
[2] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Al Kharj 11942, Saudi Arabia
[3] King Khalid Univ, Coll Business, Management Informat Syst Dept, Abha 61421, Saudi Arabia
[4] LUT Univ, Software Engn Dept, Lahti 15210, Finland
[5] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
关键词
You Only Look Once (YOLO); Long Short-Term Memory (LSTM); deep learning; confusion matrix; convolutional neural network (CNN); MediaPipe holistic;
D O I
10.3390/math11173729
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
A speech impairment limits a person's capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person's gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers' speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92% accuracy for different continuous signs, and the YOLOv6 model achieves 96% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Moroccan Sign Language Video Recognition with Deep Learning
    Boukdir, Abdelbasset
    Benaddy, Mohamed
    El Meslouhi, Othmane
    Kardouchi, Mustapha
    Akhloufi, Moulay
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 415 - 422
  • [22] Review of Sign Language Recognition Based on Deep Learning
    Zhang Shujun
    Zhang Qun
    Li Hui
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (04) : 1021 - 1032
  • [23] RECOGNITION OF SIGN LANGUAGE GESTURES USING DEEP LEARNING
    Manoj, R.
    Karthick, R. E.
    Priyadharshini, Indira R.
    Renuka, G.
    Monica
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 508 - 516
  • [24] Deep Learning Methods for Indian Sign Language Recognition
    Likhar, Pratik
    Bhagat, Neel Kamal
    Rathna, G. N.
    2020 IEEE 10TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2020,
  • [25] Isolated Sign Language Recognition Using Deep Learning
    Das, Sukanya
    Yadav, Sumit Kumar
    Samanta, Debasis
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 343 - 356
  • [26] A Deep Learning Approach for Analyzing Video and Skeletal Features in Sign Language Recognition
    Konstantinidis, Dimitrios
    Dimitropoulos, Kosmas
    Daras, Petros
    2018 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST), 2018, : 24 - 29
  • [27] A sensing data and deep learning-based sign language recognition approach
    Hao, Wei
    Hou, Chen
    Zhang, Zhihao
    Zhai, Xueyu
    Wang, Li
    Lv, Guanghao
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
  • [28] A hybrid approach for Bangla sign language recognition using deep transfer learning model with random forest classifier
    Das, Sunanda
    Imtiaz, Md. Samir
    Neom, Nieb Hasan
    Siddique, Nazmul
    Wang, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [29] Deep Learning Approach for US Traffic Sign Recognition
    Nuakoh, Emmanuel B.
    Roy, Kaushik
    Yuan, Xiaohong
    Esterline, Albert
    ICDLT 2019: 2019 3RD INTERNATIONAL CONFERENCE ON DEEP LEARNING TECHNOLOGIES, 2019, : 47 - 50
  • [30] Sign Language Recognition with Multimodal Sensors and Deep Learning Methods
    Lu, Chenghong
    Kozakai, Misaki
    Jing, Lei
    ELECTRONICS, 2023, 12 (23)