Handwritten Arabic Optical Character Recognition Approach Based on Hybrid Whale Optimization Algorithm With Neighborhood Rough Set

被引:21
作者
Sahlol, Ahmed Talat [1 ]
Abd Elaziz, Mohamed [2 ]
Al-Qaness, Mohammed A. A. [3 ]
Kim, Sunghwan [4 ]
机构
[1] Damietta Univ, Fac Specif Educ, Comp Teacher Preparat Dept, Dumyat 34511, Egypt
[2] Zagazig Univ, Fac Sci, Dept Math, Zagazig 44519, Egypt
[3] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[4] Univ Ulsan, Sch Elect Engn, Ulsan 44610, South Korea
基金
新加坡国家研究基金会;
关键词
Machine learning approach; feature selection; optimization; Arabic handwritten character recognition; whale optimization; neighborhood rough set; optical character recognition (OCR); FEATURE-SELECTION;
D O I
10.1109/ACCESS.2020.2970438
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accomplishing high recognition performance is considered one of the most important tasks for handwritten Arabic character recognition systems. In general, Optical Character Recognition (OCR) systems are constructed from four phases: pre-processing, feature extraction, feature selection, and classification. Recent literature focused on the selection of appropriate features as a key point towards building a successful and sufficient character recognition system. In this paper, we propose a hybrid machine learning approach that utilizes neighborhood rough sets with a binary whale optimization algorithm to select the most appropriate features for the recognition of handwritten Arabic characters. To validate the proposed approach, we used the CENPARMI dataset, which is a well-known dataset for machine learning experiments involving handwritten Arabic characters. The results show clear advantages of the proposed approach in terms of recognition accuracy, memory footprint, and processor time than those without the features of the proposed method. When comparing the results of the proposed method with other recent state-of-the-art optimization algorithms, the proposed approach outperformed all others in all experiments. Moreover, the proposed approach shows the highest recognition rate with the smallest consumption time compared to deep neural networks such as VGGnet, Resnet, Nasnet, Mobilenet, Inception, and Xception. The proposed approach was also compared with recently published works using the same dataset, which further confirmed the outstanding classification accuracy and time consumption of this approach. The misclassified failure cases were studied and analyzed, which showed that they would likely be confusing for even Arabic natives because the correct interpretation of the characters required the context of their appearance.
引用
收藏
页码:23011 / 23021
页数:11
相关论文
共 61 条
  • [1] An improved social spider optimization algorithm based on rough sets for solving minimum number attribute reduction problem
    Abd El Aziz, Mohamed
    Hassanien, Aboul Ella
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 30 (08) : 2441 - 2452
  • [2] Abd El Aziz M, 2018, STUD COMPUT INTELL, V730, P23, DOI 10.1007/978-3-319-63754-9_2
  • [3] Modified cuckoo search algorithm with rough sets for feature selection
    Abd El Aziz, Mohamed
    Hassanien, Aboul Ella
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (04) : 925 - 934
  • [4] Whale Optimization Algorithm and Moth-Flame Optimization for multilevel thresholding image segmentation
    Abd El Aziz, Mohamed
    Ewees, Ahmed A.
    Hassanien, Aboul Ella
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 83 : 242 - 256
  • [5] Adjei O, 2001, JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, P980, DOI 10.1109/NAFIPS.2001.944738
  • [6] Recognition of On-line Arabic Handwritten Characters Using Structural Features
    Al-Taani, Ahmad T.
    Al-Haj, Saeed
    [J]. JOURNAL OF PATTERN RECOGNITION RESEARCH, 2010, 5 (01): : 23 - 37
  • [7] Alamri H, 2008, P 11 INT C FRONT HAN, P664
  • [8] Anaraki JR, 2013, 2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), P301, DOI 10.1109/IKT.2013.6620083
  • [9] [Anonymous], ADV INTELLIGENT SYST
  • [10] [Anonymous], THESIS