A new Arabic handwritten character recognition deep learning system (AHCR-DLS)

被引:0
作者
Hossam Magdy Balaha
Hesham Arafat Ali
Mohamed Saraya
Mahmoud Badawy
机构
[1] Mansoura University,Computers and Systems Engineering Department, Faculty of Engineering
[2] Taibah University,Department of Computer Science and Informatics
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Arabic handwritten character recognition; Classification; Convolutional neural network; Data augmentation; Deep learning; Optical character recognition; Optimizers;
D O I
暂无
中图分类号
学科分类号
摘要
Optical character recognition for the English text may be considered one of the most important research topics, whether, printed or handwritten. Although excellent results have been reached in the English text, there is a lack of this type of research in the Arabic text. This is because of the nature of the Arabic alphabet, and the multiplicity of forms of the same letter. Arabic handwritten character recognition (AHCR) systems involve several issues, and challenges from finding a suitable, and public Arabic handwritten text dataset phase to recognition, and classification phase passing through segmentation, and feature extraction phases. The paper objectives are: Firstly, a large, and complex Arabic handwritten characters’ dataset (HMBD) is presented for training, testing, and validation phases, as well as, discussing its collection, preparation, cleaning, and preprocessing. Secondly, we introduce a deep learning (DL) system with two convolutional neural network (CNN) architectures (named HMB1 and HMB2); with the appliance of optimization, regularization, and dropout techniques. This system can serve as a baseline for future research on handwritten Arabic text. Different performance metrics were calculated such as accuracy, recall, precision, and F1. 16 experiments were applied to the described system using HMBD, and another two datasets: CMATER, and AIA9k. Experiments’ results were captured and compared to study the effects of weight initializers, optimizers, data augmentation, and regularization on overfitting, and accuracy. He Uniform weight initializer and AdaDelta optimizer reported the highest accuracies. Data augmentation showed an improvement in the accuracies. HMB1 reported testing accuracy of 98.4% with 865,840 records using augmentation on HMBD. CMATER and AIA9k datasets were used for validating the generalization. Data augmentation was applied, and the best results were 100%, and 99.0% for testing accuracies, respectively. A cross-over validation between the described architectures, and a previous state-of-the-art architecture, and dataset was performed in two phases. First, the previous control architecture cannot generalize for the presented dataset in the current study. Second, the study described architectures generalize for the control dataset, with higher accuracies (97.3%, and 96.8% for HMB1, and HMB2, respectively), than the reported accuracy in the selected control study.
引用
收藏
页码:6325 / 6367
页数:42
相关论文
共 55 条
[1]  
Shirko O(2010)Machine translation of noun phrases from Arabic to English using transfer-based approach J Comput Sci 6 350-793
[2]  
Omar N(1987)Review of text-to-speech conversion for English J Acoust Soc Am 82 737-95
[3]  
Arshad H(2018)Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud Decis Support Syst 105 87-317
[4]  
Albared M(2019)Understanding emotions in text using deep learning and big data Comput Hum Behav 93 309-444
[5]  
Klatt DH(2015)Deep learning Nature 521 436-683
[6]  
Wang Y(1990)Character recognition—a review Pattern Recognit 23 671-1899
[7]  
Xu W(2004)Indian script character recognition: a survey Pattern Recognit 37 1887-1033
[8]  
Chatterjee A(2011)Segmentation-free online Arabic handwriting recognition Int J Pattern Recognit Artif Intell 25 1009-808
[9]  
Gupta U(1990)The state of the art in online handwriting recognition IEEE Trans Pattern Anal Mach Intell 12 787-84
[10]  
Chinnakotla MK(2000)Online and off-line handwriting recognition: a comprehensive survey IEEE Trans Pattern Anal Mach Intell 22 63-19