Real-time sign language detection: Empowering the disabled community

被引:0
作者
Kumar, Sumit [1 ]
Rani, Ruchi [2 ]
Chaudhari, Ulka [2 ]
机构
[1] Symbiosis Int Deemed Univ, Symbiosis Inst Technol, Pune Campus, Pune 412115, Maharashtra, India
[2] Dr Vishwanath Karad MIT World Peace Univ Pune, Sch Comp Engn & Technol, Dept Comp Engn & Technol, Pune 411038, Maharashtra, India
关键词
Sign Language (SL); Disabled; Transfer learning; Convolutional neural networks (CNNs); VGG16; model; Pre-trained models; Classification;
D O I
10.1016/j.mex.2024.102901
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interaction and communication for normal human beings are easier than for a person with disabilities like speaking and hearing who may face communication problems with other people. Sign Language helps reduce this communication gap between a normal and disabled person. The prior solutions proposed using several deep learning techniques, such as Convolutional Neural Networks, Support Vector Machines, and K-Nearest Neighbors, have either demonstrated low accuracy or have not been implemented as real-time working systems. This system addresses both issues effectively. This work extends the difficulties faced while classifying the characters in Indian Sign Language(ISL). It can identify a total of 23 hand poses of the ISL. The system uses a pre-trained VGG16 Convolution Neural Network(CNN) with an attention mechanism. The model's training uses the Adam optimizer and cross-entropy loss function. The results demonstrate the effectiveness of transfer learning for ISL classification, achieving an accuracy of 97.5 % with VGG16 and 99.8 % with VGG16 plus attention mechanism. center dot Enabling quick and accurate sign language recognition with the help of trained model VGG16 with an attention mechanism. center dot The system does not require any external gloves or sensors, which helps to eliminate the need for physical sensors while simplifying the process with reduced costs. center dot Real-time processing makes the system more helpful for people with speaking and hearing disabilities, making it easier for them to communicate with other humans.
引用
收藏
页数:10
相关论文
共 22 条
[1]   Automated sign language detection and classification using reptile search algorithm with hybrid deep learning [J].
Alsolai, Hadeel ;
Alsolai, Leen ;
Al-Wesabi, Fahd N. ;
Othman, Mahmoud ;
Rizwanullah, Mohammed ;
Abdelmageed, Amgad Atta .
HELIYON, 2024, 10 (01)
[2]   Reviewing 25 years of continuous sign language recognition research: Advances, challenges, and prospects [J].
Alyami, Sarah ;
Luqman, Hamzah ;
Hammoudeh, Mohammad .
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
[3]   Enhancing sign language recognition using CNN and SIFT: A case study on Pakistan sign language [J].
Arooj, Sadia ;
Altaf, Saud ;
Ahmad, Shafiq ;
Mahmoud, Haitham ;
Mohamed, Adamali Shah Noor .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (02)
[4]   Cross-lingual few-shot sign language recognition [J].
Bilge, Yunus Can ;
Ikizler-Cinbis, Nazli ;
Cinbis, Ramazan Gokberk .
PATTERN RECOGNITION, 2024, 151
[5]  
Bora Jyotishman, 2023, Procedia Computer Science, P1384, DOI 10.1016/j.procs.2023.01.117
[6]   VGG16-based intelligent image analysis in the pathological diagnosis of IgA nephropathy [J].
Chen, Ying ;
Chen, Yinyin ;
Fu, Shuangshuang ;
Yin, Wei ;
Liu, Kanghan ;
Qian, Shuyi .
JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2023, 16 (03)
[7]   Explainable federated learning for privacy-preserving bangla sign language detection [J].
Diba, Bidita Sarkar ;
Plabon, Jayonto Dutta ;
Rahman, M. D. Mahmudur ;
Mistry, Durjoy ;
Saha, Aloke Kumar ;
Mridha, M. F. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 134
[8]  
Koyineni Sai Niketh, 2024, Procedia Computer Science, V233, P269, DOI 10.1016/j.procs.2024.03.216
[9]   A two-stream sign language recognition network based on keyframe extraction method [J].
Liu, Tianyu ;
Tao, Tangfei ;
Zhao, Yizhe ;
Zhu, Jieli .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 253
[10]   A signer-independent sign language recognition method for the single-frequency dataset [J].
Liu, Tianyu ;
Tao, Tangfei ;
Zhao, Yizhe ;
Li, Min ;
Zhu, Jieli .
NEUROCOMPUTING, 2024, 582