Developing an Offline and Real-Time Indian Sign Language Recognition System with Machine Learning and Deep Learning

被引:0
作者
Priya K. [1 ]
Sandesh B.J. [1 ]
机构
[1] Computer Science and Engineering Department, PES University, Bangalore
关键词
CNN; Deep learning; Gabor filters; HSV; Indian sign language; Machine learning; Skin mask; YOLO-NAS-S;
D O I
10.1007/s42979-023-02482-w
中图分类号
学科分类号
摘要
Sign language is a powerful form of communication for humans, and advancements in computer vision systems are driving significant progress in sign language recognition. In the context of Indian sign language (ISL), early research focused on differentiating a limited set of distinct hand signs, often relying on specialized hardware such as sensors and gloves, also most of the works were experimented on the dataset captured under controlled environments. This research aims to enhance communication for the speech and hearing impaired community by recognizing static images of ISL digits and alphabets in both offline and real-time scenarios. To achieve this, two publicly available datasets were used, containing a total of 42,000 sign images and 36,000 static signs, respectively. The dataset1 consists of sign images that were taken under controlled environments, whereas the dataset2 consists of sign images that were taken in different environments with varying backgrounds and lighting conditions. Dataset1 was experimented with and without using preprocessing techniques, while dataset2 underwent similar testing. We employed both machine learning and deep learning with CNN to categorize the ISL alphabets and numbers. In the machine learning approach, image preprocessing techniques such as HSV conversion, skin mask generation, and skin portion extraction and Gabor filtering were used to segment the region of interest, which was then fed to five ML models for sign prediction. In contrast, the DL approach used CNN model. In addition, probability ensemble testing was performed on both datasets to compare the accuracies. Real-time recognition was also conducted using a custom dataset, employing the YOLO-NAS-S model. This study contributes to the advancement of ISL recognition by conducting a comparative analysis of ML algorithms and CNNs, examining their performance with and without preprocessing techniques. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2024.
引用
收藏
相关论文
共 38 条
[1]  
Deora D., Bajaj N., Indian sign language recognition, 2012 1st International Conference on Emerging Technology Trends in Electronics, Communication and Networking, IEEE 2012-978–1-4673-1627-9/12.
[2]  
Nair A.V., Bindu V., A review on indian sign language recognition, Int J Comput Appl, 73, 22, pp. 33-38, (2013)
[3]  
Badenas J., Miguel Sanchiz J., Filiberto P., Motion-based segmentation and region tracking in image sequences, Pattern Recognit, 34, pp. 661-670, (2001)
[4]  
Liao P.-S., Chen T.-S., Chung P.-C., A fast algorithm for multilevel thresholding, J Inf Sci Eng, 17, pp. 713-727, (2001)
[5]  
McIvor A., Zang B., Klette R., The background subtraction problem for video surveillance systems. In: International workshop robot vision 2001, Auckland, New Zealand, February 2001, Springer Lecture Notes in Computer Science, 1998, pp. 176-183
[6]  
Sultana A., Rajapushpa T., Vision based gesture recognition for alphabetical hand gestures using the SVM classifier, Int J Comput Sci Eng Technol, 3, 7, pp. 218-223, (2012)
[7]  
Nanivadekar P.A., Kulkarni V., Indian sign language recognition: Database creation, hand tracking and segmentation, International Conference on Circuits, Systems, Communication and Information Technology Applications, (2014)
[8]  
Raheja J.L., Mishra A., Chaudhary A., Indian Sign language recognition using SVM, Pattern Recognit Image Anal, 26, 2, pp. 434-441, (2016)
[9]  
Loke P., Paranjpe J., Bhabal S., Kanere K., Indian sign language converter system using an android app, International Conference on Electronics, Communication and Aerospace Technology, 2017 IEEE, 978-105090-5686-6/17.
[10]  
Beena M.V., Agnisarman Namboodiri M.N., ASL numerals recognition from depth maps using artificial neural networks, Middle-East J Sci Res, 25, 7, pp. 1407-1413, (2017)