Automatic Indian sign language recognition using MediaPipe holistic and LSTM network

被引:1
|
作者
Khartheesvar, G. [1 ]
Kumar, Mohit [1 ]
Yadav, Arun Kumar [1 ]
Yadav, Divakar [2 ]
机构
[1] NIT Hamirpur, Dept Comp Sci & Engn, Hamirpur, HP, India
[2] Indira Gandhi Natl Open Univ, Sch Comp & Informat Sci, New Delhi, India
关键词
Sign language recognition; Indian sign language; LSTM; MediaPipe holistic; INCLUDE; INCLUDE-50; HAND POSTURE; GESTURES;
D O I
10.1007/s11042-023-17361-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sign language is a reliable medium of communication that deaf and mute individuals use to express themselves and connect with society. However, communication between a person who knows sign language and someone unfamiliar with sign language can be difficult. Indian Sign Language is slowly gaining popularity but faces similar problems. In this paper, a method for recognizing isolated words in Indian Sign Language (ISL) using MediaPipe holistic pipeline for feature extraction and Long-Short Term Memory (LSTM) network is proposed. The proposed method is evaluated on a large-scale ISL video dataset, INCLUDE, along with its smaller subset, INCLUDE-50. The proposed method achieves 94.8% and 87.4% accuracy on INCLUDE-50 and INCLUDE, respectively. Furthermore, a macro averaged F1-score of 93.5% and 86.6% is obtained on the INCLUDE-50 and INCLUDE, respectively. Both these results outperform the current state of the art performance on the respective datasets.
引用
收藏
页码:58329 / 58348
页数:20
相关论文
共 50 条
  • [1] Continuous Sign Language Recognition System Using Deep Learning with MediaPipe Holistic
    Srivastava, Sharvani
    Singh, Sudhakar
    Pooja, Shiv
    Prakash, Shiv
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (03) : 1455 - 1468
  • [2] An integrated mediapipe-optimized GRU model for Indian sign language recognition
    Barathi Subramanian
    Bekhzod Olimov
    Shraddha M. Naik
    Sangchul Kim
    Kil-Houm Park
    Jeonghong Kim
    Scientific Reports, 12
  • [3] Interpreting Sign Language Recognition using Transformers and MediaPipe Landmarks
    Luna-Jimenez, Cristina
    Gil-Martin, Manuel
    Kleinlein, Ricardo
    San-Segundo, Ruben
    Fernandez-Martinez, Fernando
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 373 - 377
  • [4] An integrated mediapipe-optimized GRU model for Indian sign language recognition
    Subramanian, Barathi
    Olimov, Bekhzod
    Naik, Shraddha M.
    Kim, Sangchul
    Park, Kil-Houm
    Kim, Jeonghong
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Automatic Indian Sign Language Recognition System
    Dixit, Karishma
    Jalal, Anand Singh
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 883 - 887
  • [6] Sign Language Recognition of Selected Filipino Phrases Using LSTM Neural Network
    Montefalcon, Myron Darrel
    Padilla, Jay Rhald
    Rodriguez, Ramon
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 633 - 641
  • [7] Automatic Recognition of Indian Sign Language using Image Processing and Computer Vision
    Kuanr, Gaurahari
    Chaudhury, Saurabh
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 1469 - 1474
  • [8] MediaPipe's Landmarks with RNN for Dynamic Sign Language Recognition
    Samaan, Gerges H.
    Wadie, Abanoub R.
    Attia, Abanoub K.
    Asaad, Abanoub M.
    Kamel, Andrew E.
    Slim, Salwa O.
    Abdallah, Mohamed S.
    Cho, Young-Im
    ELECTRONICS, 2022, 11 (19)
  • [9] Integrated Mediapipe with a CNN Model for Arabic Sign Language Recognition
    AL Moustafa, Ahmad M. J.
    Rahim, Mohd Shafry Mohd
    Bouallegue, Belgacem
    Khattab, Mahmoud M.
    Soliman, Amr Mohmed
    Tharwat, Gamal
    Ahmed, Abdelmoty M.
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2023, 2023
  • [10] Indian sign language recognition system using network deconvolution and spatial transformer network
    Ghorai, Anudyuti
    Nandi, Utpal
    Changdar, Chiranjit
    Si, Tapas
    Singh, Moirangthem Marjit
    Mondal, Jyotsna Kumar
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 20889 - 20907