Evaluation of deep learning approaches for optical character recognition in Urdu language

被引:0
作者
Riaz, Mehek [1 ]
Monir, Syed Muhammad Ghazanfar [2 ]
Hasan, Rija [2 ]
机构
[1] Karachi Inst Econ & Technol, Coll Comp & Informat Sci, Karachi, Pakistan
[2] Mohammad Ali Jinnah Univ, Dept Elect & Comp Engn, Karachi, Pakistan
关键词
Convolutional Approaches for OCR; Discriminative Analysis Densenet121; Inception V3; Optical Character Recognition; Resnet50; Support Vector Machine (SVM); Vggnet16; OCR; HISTOGRAM; FEATURES;
D O I
10.22581/muet1982.2204.15
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the evolving technological era, the optical character recognition systems have substantial execution, considering the widespread use of daily hand-written human transaction. Optical Character Recognition (OCR) is an implementation of Computer Vision that digitizes numerous hand dealt documents for further analysis and formatting. OCR is achieved by various ways such as discriminative analysis and deep learning. This paper focuses on evaluating deep learning models on a hand-written compiled dataset of Urdu digits. The evaluation is performed for deep convolutional learning algorithms; VGGNet16, InceptionV3, ResNet50, and DenseNet121. The convolutional models are pre-trained on the ImageNet. The model weights of fully connected layers have been evaluated, reducing the training time of the convolutional layers. The testing accuracy for the compiled dataset is observed as, ResNet50 with 96%, InceptionV3 with 95%, VGGNet16 with 95% and DenseNet121 with 94%.
引用
收藏
页码:146 / 156
页数:11
相关论文
共 41 条
[1]   Medical Handwritten Prescription Recognition Using CRNN [J].
Achkar, Roger ;
Ghayad, Khodor ;
Haidar, Rayan ;
Saleh, Sawsan ;
Al Hajj, Rana .
PROCEEDING OF THE 2019 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (IEEE CITS 2019), 2019, :211-215
[2]  
Adriano J. E. M., 2019, IOP Conference Series: Materials Science and Engineering
[3]  
Alrehali Bodour, 2020, 2020 6th Conference on Data Science and Machine Learning Applications (CDMA), P37, DOI 10.1109/CDMA47397.2020.00012
[4]  
Amalia A., 2018, 3 INT C INFORM COMPU
[5]  
[Anonymous], 2017, arXiv
[6]  
Bartz C, 2017, Arxiv, DOI arXiv:1707.08831
[7]   A multi-feature selection approach for gender identification of handwriting based on kernel mutual information [J].
Bi, Ning ;
Suen, Ching Y. ;
Nobile, Nicola ;
Tan, Jun .
PATTERN RECOGNITION LETTERS, 2019, 121 :123-132
[8]   Handwritten Urdu character recognition using one-dimensional BLSTM classifier [J].
Bin Ahmed, Saad ;
Naz, Saeeda ;
Swati, Salahuddin ;
Razzak, Muhammad Imran .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (04) :1143-1151
[9]  
Boukharouba Abdelhak, 2017, Applied Computing and Informatics, V13, P19, DOI 10.1016/j.aci.2015.05.001
[10]  
Chang M. C., 2017, 16 IEEE INT C MACHIN