A Spatio-Temporal Framework for Dynamic Indian Sign Language Recognition

被引:2
作者
Sharma, Sakshi [1 ]
Singh, Sukhwinder [1 ]
机构
[1] Punjab Engn Coll, ECE Dept, Chandigarh, India
关键词
Convolutional neural network; Deep learning; Long short-term memory; Indian sign language; Sign language recognition system;
D O I
10.1007/s11277-023-10730-8
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
A sign language recognition system is a boon to the signer community as it eases the flow of information between the signer and non-signer communities. However, extracting timely detail from the video data is still a challenging task. In this paper, a deep learning based model consisting of trainable CNN and trainable stacked 2 bidirectional long short term memory (S2B-LSTM) has been proposed and tested to recognise the dynamic gestures of Indian sign language (ISL). The CNN architecture has been used as feature extractor to extract the spatial features from the input video data, whereas the temporal relation between the consecutive frames of input video is extracted using S2B-LSTM. This model has been trained and tested on self-developed dataset consisting of 360 videos of ISL dynamic gestures. The CNN-S2B-LSTM model outperforms the existing techniques of sign language recognition with best recognition accuracy of 97.6%.
引用
收藏
页码:2527 / 2541
页数:15
相关论文
共 27 条
  • [1] Abhishek KS, 2016, IEEE C ELEC DEVICES, P334, DOI 10.1109/EDSSC.2016.7785276
  • [2] Ahmed W, 2016, PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), P120, DOI 10.1109/INFOSCI.2016.7845312
  • [3] Video-based signer-independent Arabic sign language recognition using hidden Markov models
    AL-Rousan, M.
    Assaleh, K.
    Tala'a, A.
    [J]. APPLIED SOFT COMPUTING, 2009, 9 (03) : 990 - 999
  • [4] Feature extraction in Brazilian Sign Language Recognition based on phonological structure and using RGB-D sensors
    Almeida, Silvia Grasiella Moreira
    Guimaraes, Frederico Gadelha
    Ramirez, Jaime Arturo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (16) : 7259 - 7271
  • [5] A Signer Independent Sign Language Recognition with Co-articulation Elimination from Live Videos: An Indian Scenario
    Athira, P. K.
    Sruthi, C. J.
    Lijiya, A.
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (03) : 771 - 781
  • [6] Bhuyan M. K., 2005, TENCON IEEE REG NOV, P1, DOI 10.1109/TENCON.2005.300947
  • [7] Darwish Saad M, 2017, P 9 INT C COMPUTER A, DOI [10.1145/3057039.3057040, DOI 10.1145/3057039.3057040]
  • [8] Multimodal Learning for Sign Language Recognition
    Ferreira, Pedro M.
    Cardoso, Jaime S.
    Rebelo, Ana
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 313 - 321
  • [9] APPROXIMATION OF DYNAMICAL-SYSTEMS BY CONTINUOUS-TIME RECURRENT NEURAL NETWORKS
    FUNAHASHI, K
    NAKAMURA, Y
    [J]. NEURAL NETWORKS, 1993, 6 (06) : 801 - 806
  • [10] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]