Pakistan sign language recognition: leveraging deep learning models with limited dataset

被引:0
作者
Hafiz Muhammad Hamza
Aamir Wali
机构
[1] National University of Computer and Emerging Science,FAST School of Computing
来源
Machine Vision and Applications | 2023年 / 34卷
关键词
Sign language recognition; Pakistan Sign Language; PSL data dictionary; C3D; Data augmentation;
D O I
暂无
中图分类号
学科分类号
摘要
Sign language is the predominant form of communication among a large group of society. The nature of sign languages is visual. This makes them very different from spoken languages. Unfortunately, very few able people can understand sign language making communication with the hearing-impaired extremely difficult. Research in the field of sign language recognition can help reduce the barrier between deaf and able people. A lot of work has been done on sign language recognition for numerous languages such as American sign language and Chinese sign language. Unfortunately, very little to no work has been done for Pakistan Sign Language. Any contribution in Pakistan Sign Language recognition is limited to static images instead of gestures. Furthermore, the dataset available for this language is very small in terms of the number of examples per word which makes it very difficult to train deep networks that require a considerable amount of training data. Data Augmentation techniques help the network generalize better by providing more variety in the training data. In this paper, a pipeline for the Pakistan Sign Language recognition system is proposed that incorporates an augmentation unit. To validate the effectiveness of the proposed pipeline, three deep learning models, C3D, I3D, and TSM are used. Results show that translation and rotation are the two best augmentation techniques for the Pakistan Sign Language dataset. The models trained using our data-augment-supported pipeline outperform other methods that only used the original data. The most suitable model is C3D which not only produced an accuracy of 93.33% but also has a low training time as compared to other models.
引用
收藏
相关论文
共 50 条
[21]   Efhamni: A Deep Learning-Based Saudi Sign Language Recognition Application [J].
Al Khuzayem, Lama ;
Shafi, Suha ;
Aljahdali, Safia ;
Alkhamesie, Rawan ;
Alzamzami, Ohoud .
SENSORS, 2024, 24 (10)
[22]   Deep Learning-Based Sign Language Recognition System for Cognitive Development [J].
Jebali, Maher ;
Dakhli, Abdesselem ;
Bakari, Wided .
COGNITIVE COMPUTATION, 2023, 15 (06) :2189-2201
[23]   A Comprehensive Study on Deep Learning-Based Methods for Sign Language Recognition [J].
Adaloglou, Nikolas ;
Chatzis, Theocharis ;
Papastratis, Ilias ;
Stergioulas, Andreas ;
Papadopoulos, Georgios Th. ;
Zacharopoulou, Vassia ;
Xydopoulos, George J. ;
Atzakas, Klimnis ;
Papazachariou, Dimitris ;
Daras, Petros .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1750-1762
[24]   Deep Learning-Based Sign Language Recognition System for Cognitive Development [J].
Maher Jebali ;
Abdesselem Dakhli ;
Wided Bakari .
Cognitive Computation, 2023, 15 :2189-2201
[25]   A sensing data and deep learning-based sign language recognition approach [J].
Hao, Wei ;
Hou, Chen ;
Zhang, Zhihao ;
Zhai, Xueyu ;
Wang, Li ;
Lv, Guanghao .
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118
[26]   Active Class Selection for Dataset Acquisition in Sign Language Recognition [J].
Bicego, Manuele ;
Vazquez-Enriquez, Manuel ;
Alba-Castro, Jose L. .
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 :304-315
[27]   Towards Indian Sign Language Sentence Recognition using INSIGNVID: Indian Sign Language Video Dataset [J].
Mistree, Kinjal ;
Thakor, Devendra ;
Bhatt, Brijesh .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) :697-707
[28]   Pakistan Sign Language Recognition Using Statistical Template Matching [J].
Alvi, Aleem Khalid ;
Bin Azhar, M. Yousuf ;
Usman, Mehmood ;
Mumtaz, Suleman ;
Rafiq, Sameer ;
Rehman, Razi Ur ;
Ahmed, Israr .
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 3, 2005, 3 :52-55
[29]   Continuous Arabic Sign Language Recognition Models [J].
Algethami, Nahlah ;
Farhud, Raghad ;
Alghamdi, Manal ;
Almutairi, Huda ;
Sorani, Maha ;
Aleisa, Noura .
SENSORS, 2025, 25 (09)
[30]   UAlpha40: A comprehensive dataset of Urdu alphabet for Pakistan sign language [J].
Javaid, Sameena ;
Sajid, Shahood ;
Baloch, Yusra Khan .
DATA IN BRIEF, 2025, 59