Pakistan sign language recognition: leveraging deep learning models with limited dataset

被引:0
作者
Hafiz Muhammad Hamza
Aamir Wali
机构
[1] National University of Computer and Emerging Science,FAST School of Computing
来源
Machine Vision and Applications | 2023年 / 34卷
关键词
Sign language recognition; Pakistan Sign Language; PSL data dictionary; C3D; Data augmentation;
D O I
暂无
中图分类号
学科分类号
摘要
Sign language is the predominant form of communication among a large group of society. The nature of sign languages is visual. This makes them very different from spoken languages. Unfortunately, very few able people can understand sign language making communication with the hearing-impaired extremely difficult. Research in the field of sign language recognition can help reduce the barrier between deaf and able people. A lot of work has been done on sign language recognition for numerous languages such as American sign language and Chinese sign language. Unfortunately, very little to no work has been done for Pakistan Sign Language. Any contribution in Pakistan Sign Language recognition is limited to static images instead of gestures. Furthermore, the dataset available for this language is very small in terms of the number of examples per word which makes it very difficult to train deep networks that require a considerable amount of training data. Data Augmentation techniques help the network generalize better by providing more variety in the training data. In this paper, a pipeline for the Pakistan Sign Language recognition system is proposed that incorporates an augmentation unit. To validate the effectiveness of the proposed pipeline, three deep learning models, C3D, I3D, and TSM are used. Results show that translation and rotation are the two best augmentation techniques for the Pakistan Sign Language dataset. The models trained using our data-augment-supported pipeline outperform other methods that only used the original data. The most suitable model is C3D which not only produced an accuracy of 93.33% but also has a low training time as compared to other models.
引用
收藏
相关论文
共 50 条
[41]   Extraction of Dataset for Indian Sign Language Recognition from News Video [J].
Goswami, Pooja ;
Padmavathi, S. .
DISTRIBUTED COMPUTING AND OPTIMIZATION TECHNIQUES, ICDCOT 2021, 2022, 903 :459-469
[42]   IoT-driven smart assistive communication system for the hearing impaired with hybrid deep learning models for sign language recognition [J].
Maashi, Mashael ;
Iskandar, Huda G. ;
Rizwanullah, Mohammed .
SCIENTIFIC REPORTS, 2025, 15 (01)
[43]   Vision-based hand gesture recognition using deep learning for the interpretation of sign language [J].
Sharma, Sakshi ;
Singh, Sukhwinder .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182 (182)
[44]   ARABIC SIGN LANGUAGE CHARACTERS RECOGNITION BASED ON A DEEP LEARNING APPROACH AND A SIMPLE LINEAR CLASSIFIER [J].
Hasasneh, Ahmad .
JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2020, 6 (03) :281-290
[45]   Wi-Fi SIMO Radar for Deep Learning-Based Sign Language Recognition [J].
Lai, Yi-Chen ;
Huang, Pin-Yu ;
Horng, Tzyy-Sheng .
IEEE MICROWAVE AND WIRELESS TECHNOLOGY LETTERS, 2024, 34 (06) :825-828
[46]   CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING [J].
Zhang, Zhihao ;
Pu, Junfu ;
Zhuang, Liansheng ;
Zhou, Wengang ;
Li, Houqiang .
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :285-289
[47]   Chinese Sign Language Recognition with Sequence to Sequence Learning [J].
Mao, Chensi ;
Huang, Shiliang ;
Li, Xiaoxu ;
Ye, Zhongfu .
COMPUTER VISION, PT I, 2017, 771 :180-191
[48]   Self-directed-Learning for Sign Language Recognition [J].
Jiang, Huaqiang ;
Hu, Huosheng ;
Pan, Hong .
PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'09), 2009, :139-+
[49]   Sign language detection dataset: A resource for AI-based recognition systems [J].
Garg, Bindu ;
Kasar, Manisha ;
Paygude, Priyanka ;
Dhumane, Amol ;
Ambala, Srinivas ;
Rajpurohit, Jitendra ;
Sharma, Abhay ;
Meshram, Vidula ;
Vats, Amber ;
Kashyap, Achyut .
DATA IN BRIEF, 2025, 61
[50]   Sign Language Recognition with CW Radar and Machine Learning [J].
Lu, Yilong ;
Lang, Yue .
2020 21ST INTERNATIONAL RADAR SYMPOSIUM (IRS 2020), 2020, :31-34