An Approach to Sri Lankan Sign Language Recognition Using Deep Learning with MediaPipe

被引:1
作者
Herath, Randika Jeewantha [1 ]
Ishanka, Piumi [1 ]
机构
[1] Sabaragamuwa Univ Sri Lanka, Dept Comp & Informat Syst, Belihuloya, Sri Lanka
来源
DIGITAL TECHNOLOGIES AND APPLICATIONS, ICDTA 2022, VOL 1 | 2022年 / 454卷
关键词
Computer vision; Gesture recognition; Deep learning; Sri Lankan sign language;
D O I
10.1007/978-3-031-01942-5_45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, there are millions of people around the world with speech and hearing impairments, and according to the latest census, there are nearly 70,000 people who use Sri Lankan Sign Language. Sign language is a visual language, and it is the main medium of communication in their daily conversations. But they face obstacles when communicating with people who do not know sign language. There are communication barriers in different contexts, such as in work environments, knowledge exchange, and message sharing. Therefore, technology should play a major role in helping people with these hearing and speech impairments to improve their quality of life. This research aims to suggest models that use Google MediaPipe Hand Pose landmarks to identify Sri Lankan Sign Language. Moreover, this article compares vision-based approaches with convolutional neural network (CNN) and recurrent neural network (RNN). We also considered activation functions (such as ReLU, Linear, and Softmax), loss functions (mean squared error (MSE) and Categorical_crossentropy), and optimizations (Adam and Stochastic Gradient Descent (SGD)). The result showed that most algorithms built with Long Short-Term Memory (LSTM), CNN, and CNN-LSTM achieved an accuracy greater than 95%, both with the training dataset and the test dataset. In particular, models with MSE as the loss function and Adam as the optimizer showed higher accuracy.
引用
收藏
页码:449 / 459
页数:11
相关论文
共 32 条
[1]  
Ashok S., 2014, J. Theor. Appl. Inf. Technol., V67
[2]  
Bach Duy Khuat, 2021, ICSCA 2021: 2021 10th International Conference on Software and Computer Applications, P162, DOI 10.1145/3457784.3457810
[3]   Arabic Sign Language Recognition System Using 2D Hands and Body Skeleton Data [J].
Bencherif, Mohamed A. ;
Algabri, Mohammed ;
Mekhtiche, Mohamed A. ;
Faisal, Mohammed ;
Alsulaiman, Mansour ;
Mathkour, Hassan ;
Al-Hammadi, Muneer ;
Ghaleb, Hamid .
IEEE ACCESS, 2021, 9 :59612-59627
[4]   Learning Three-dimensional Skeleton Data from Sign Language Video [J].
Brock, Heike ;
Law, Felix ;
Nakadai, Kazuhiro ;
Nagashima, Yuji .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (03)
[5]  
Chaikaew Anusorn, 2021, 2021 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunication Engineering, P128, DOI 10.1109/ECTIDAMTNCON51128.2021.9425711
[6]   Backhand-View-Based Continuous-Signed-Letter Recognition Using a Rewound Video Sequence and the Previous Signed-Letter Information [J].
Chophuk, Ponlawat ;
Chamnongthai, Kosin .
IEEE ACCESS, 2021, 9 :40187-40197
[7]   Generating of Sign System for Bahasa Indonesia (SIBI) Root Word Gestures Using Deep Temporal Sigmoid Belief Network [J].
Darmana, Igm Surya A. ;
Rakun, Erdefi .
ICCAI '19 - PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, :221-225
[8]  
Deepak S., 2015, 2015 ANN IEEE INDIA, DOI [10.1109/INDICON.2015.7443381, DOI 10.1109/INDICON.2015.7443381]
[9]   CNN and Traditional Classifiers Performance for Sign Language Recognition [J].
Fayyaz, Sobia ;
Ayaz, Yasar .
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING (ICMLSC 2019), 2019, :192-196
[10]  
Fernando P., 2016, GSTF J. Comput. (JoC), V5, P1, DOI [DOI 10.7603/S40601-016-0009-8, 10.7603/s40601-016-0009-8]