New MDLSTM-based designs with data augmentation for offline Arabic handwriting recognition

被引:0
作者
Rania Maalej
Monji Kherallah
机构
[1] University of Sfax,National School of Engineers of Sfax
[2] University of Sfax,Faculty of Sciences
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Data augmentation; MDLSTM; Dropout; ReLU; Maxout; Offline arabic handwriting recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Although deep learning techniques have achieved promising results in several fields including healthcare, monitoring, and smart cities, their application in handwriting recognition shows limited results, especially for the offline Arabic handwritten script. Therefore, there is a need to enhance existing deep learning architectures. Moreover, the Multi-Dimensional Long Short-Term Memory (MDLSTM) leverages the LSTM model by replacing the single recurring connection with as many connections as there are spatiotemporal dimensions in the data. These connections allow the network to create a flexible internal context representation that is robust to local distortions. In this context, MDLSTM based architecture has been explored and three new architectures with Dropout, ReLU, and Maxout are proposed for offline Arabic handwriting recognition. Moreover, data augmentation approach has been applied to validate the proposed models. Indeed, a new dataset is developed by some morphological operations applied to the existing dataset “IFN/ENIT”. The experimental results show that our models outperform existing ones and the best accuracy of 92.59% was recorded with the MDLSTM-CTC-Maxout model trained with the original IFN/ENIT dataset. Moreover, data augmentation improves the MDLSTM-CTC-Maxout proposed model’s accuracy to reach 93.46%.
引用
收藏
页码:10243 / 10260
页数:17
相关论文
共 24 条
[1]  
Cai M(2016)Maxout neurons for deep convolutional and LSTM neural networks in speech recognition Speech Commun 77 53-64
[2]  
Liu J(2019)Boosting of deep convolutional architectures for Arabic handwriting recognition Int J Multimed Data Eng Manag 10 26-45
[3]  
Elleuch M(2016)A New design based-SVM of the CNN classifier architecture with dropout for offline Arabic handwritten recognition Procedia Comput Sci 80 1712-1723
[4]  
Kherallah M(2020)Comparative study on Deep Convolution Neural Networks DCNN-based offline arabic handwriting recognition IEEE Access 8 95465-95482
[5]  
Elleuch M(1982)The meaning and use of the area under a receiver operating characteristic (ROC) curve Radiology 143 29-36
[6]  
Maalej R(1997)Long short-term memory Neural Comput 9 1735-1780
[7]  
Kherallah M(2019)Bayesian versus convolutional networks for Arabic handwriting recognition Arab J Sci Eng 44 9301-9319
[8]  
Ghanim TM(2006)Offline Arabic handwriting recognition: a survey IEEE Trans Pattern Anal Mach Intell 28 712-724
[9]  
Khalil MI(2020)Improving the DBLSTM for on-line Arabic handwriting recognition Multimed Tools Appl 32 1860007-undefined
[10]  
Abbas HM(2018)Recognition of cursive Arabic handwritten text using embedded training based on hidden Markov models Int J Pattern Recognit Artif Intell undefined undefined-undefined