An Enhanced Offline Printed Arabic OCR Model Based on Bio-Inspired Fuzzy Classifier

被引:6
作者
Darwish, Saad Mohamed [1 ]
Elzoghaly, Khaled Osama [1 ]
机构
[1] Alexandria Univ, Inst Grad Studies & Res, Alexandria 21526, Egypt
关键词
Arabic OCR; fuzzy classification; feature selection; GA; SCRIPT RECOGNITION;
D O I
10.1109/ACCESS.2020.3004286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the recent few years, there was a concentrated search on Arabic Optical Character Recognition (OCR), especially the recognition of scanned, offline, machine-printed documents. However, Arabic OCR consequences are dissatisfying and are still a developed research area. Finding the best feature extraction techniques and selecting an appropriate classification algorithm lead to supreme recognition accuracy and low computational overhead. This paper presents a new Arabic OCR model by integrating both of Genetic Algorithm (GA) and the Fuzzy K-Nearest Neighbor classifier (F-KNN) in a unified framework to enhance the identification accuracy. GA is utilized as a feature selection algorithm that has better convergence and spread of solutions with candid variation preservation mechanism. The F-KNN algorithm is more appropriate to classify ambiguous or uncertain data objects in the sense that every object belongs to all classes with different degrees of membership. The suggested model semantically fuses bio-inspired based feature vectors with fuzzy KNN classifier to build accurate membership function for each class. Experimental results compared to other approaches revealed the effectiveness of the suggested model and demonstrated that the feature selection approach increased the identification accuracy process.
引用
收藏
页码:117770 / 117781
页数:12
相关论文
共 42 条
[1]  
Abandah G., 2010, ENG SCI J, V37, P1
[2]  
Abed M. A., 2010, SSRN ELECT J, V8, P1
[3]  
Abu Doush I, 2018, INT CONF COMP SCI, P150, DOI 10.1109/CSIT.2018.8486162
[4]   A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT [J].
Ahmad, Riaz ;
Naz, Saeeda ;
Afzal, Muhammad ;
Rashid, Sheikh ;
Liwicki, Marcus ;
Dengel, Andreas .
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (03) :299-305
[5]  
Al Tameemi A. M., 2011, Information Technology Journal, V10, P1754, DOI 10.3923/itj.2011.1754.1760
[6]  
Alghamdi M, 2018, INT J ADV COMPUT SC, V9, P415
[7]   Optical Character Recognition for Quranic Image Similarity Matching [J].
Alotaibi, Faiz ;
Abdullah, Muhamad Taufik ;
Abdullah, Rusli Bin Hj ;
Rahmat, Rahmita Wirza Binti O. K. ;
Hashem, Ibrahim Abaker Targio ;
Sangaiah, Arun Kumar .
IEEE ACCESS, 2018, 6 :554-562
[8]  
Amirfakhrian M., 2013, INT J MATH MODEL COM, V3, P109
[9]  
[Anonymous], J THEOR APPL INF TEC
[10]  
[Anonymous], 2015, Int. J. Signal Process, DOI DOI 10.14257/IJSIP.2015.8.2.37