Real-Time Air-Writing Recognition for Arabic Letters Using Deep Learning

被引:0
作者
Qedear, Aseel [1 ]
AlMatrafy, Aldanh [1 ]
Al-Sowat, Athary [1 ]
Saigh, Abrar [1 ]
Alayed, Asmaa [2 ]
机构
[1] Umm Al Qura Univ, Coll Comp, Dept Comp Sci & Artificial Intelligence, Mecca 21955, Saudi Arabia
[2] Umm Al Qura Univ, Coll Comp, Dept Software Engn, Mecca 21955, Saudi Arabia
关键词
deep learning; Arabic air-writing recognition; mid-air; Arabic alphabet; hand gestures; fingertips; writing; Arabic language;
D O I
10.3390/s24186098
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Learning to write the Arabic alphabet is crucial for Arab children's cognitive development, enhancing their memory and retention skills. However, the lack of Arabic language educational applications may hamper the effectiveness of their learning experience. To bridge this gap, SamAbjd was developed, an interactive web application that leverages deep learning techniques, including air-writing recognition, to teach Arabic letters. SamAbjd was tailored to user needs through extensive surveys conducted with mothers and teachers, and a comprehensive literature review was performed to identify effective teaching methods and models. The development process involved gathering data from three publicly available datasets, culminating in a collection of 31,349 annotated images of handwritten Arabic letters. To enhance the dataset's quality, data preprocessing techniques were applied, such as image denoising, grayscale conversion, and data augmentation. Two models were experimented with using a convolution neural network (CNN) and Visual Geometry Group (VGG16) to evaluate their effectiveness in recognizing air-written Arabic characters. Among the CNN models tested, the standout performer was a seven-layer model without dropout, which achieved a high testing accuracy of 96.40%. This model also demonstrated impressive precision and F1-score, both around 96.44% and 96.43%, respectively, indicating successful fitting without overfitting. The web application, built using Flask and PyCharm, offers a robust and user-friendly interface. By incorporating deep learning techniques and user feedback, the web application meets educational needs effectively.
引用
收藏
页数:25
相关论文
共 54 条
[11]  
Brynjolfsson E., 2017, Artificial Intelligence, for Real, V3rd ed., P20
[12]   Air-Writing Recognition-Part I: Modeling and Recognition of Characters, Words, and Connecting Motions [J].
Chen, Mingyu ;
AlRegib, Ghassan ;
Juang, Biing-Hwang .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2016, 46 (03) :403-413
[13]  
colab, Welcome to Colab
[14]   Difficulties of learning Arabic for non-native speakers [J].
Dajani, Basma Ahmad Sedki ;
Mubaideen, Salwa ;
Omari, Fatima Mohammad Amin .
4TH WORLD CONFERENCE ON PSYCHOLOGY, COUNSELING AND GUIDANCE (WCPCG-2013), 2014, 114 :919-926
[15]  
formula-generator, Formula Generator: Generate LaTeX Formulae and Equations that Can Be Copied to Microsoft Word
[16]  
Geeks for Geeks Organization, VGG-16 | CNN model
[17]  
github, HMBD-v1/Dataset Template v1.pdf at Master HossamBalaha/HMBD-v1 GitHub
[18]  
google, Personal Cloud Storage & File Sharing Platform-Google-google.com
[19]  
Hassaballah M, 2020, DEEP LEARNING COMPUT, DOI [DOI 10.1201/9781351003827, 10.1201/9781351003827]
[20]  
Hou F., 2024, Appl. Comput. Eng, V48, P225, DOI [10.54254/2755-2721/48/20241529, DOI 10.54254/2755-2721/48/20241529]