Primitive Printed Arabic Optical Character Recognition using Statistical Features

被引:0
|
作者
Dahi, Mohamed [1 ]
Semary, Noura A. [1 ]
Hadhoud, Mohiy M. [1 ]
机构
[1] Menoufia Univ, Fac Comp & Informat, Dept Informat Technol, Shibin Al Kawm, Egypt
来源
2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS) | 2015年
关键词
AOCR;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the several forms of different Arabic font types, Arabic character recognition is still a challenge. Most literature works consider only one font per text what results in low recognition accuracy. This paper tends to enhance the accuracy of AOCR (Arabic Optical Character Recognition) by considering an automatic Optical Font Recognition (OFR) stage before going ahead with the traditional OCR stages. This has been achieved using SIFT (Scale Invariant Feature Transform) descriptors. First, a comparative study of four most recent algorithms of primitive OCR has been performed to evaluate the different features and classifiers utilized in their systems. Accordingly, a combining of statistical features have been proposed as well as selecting Random Forest Tree classifier for classification stage. The combination of the features are used to train the classifiers. As a result, each recognized text font is directed to a specific classifier tree. The proposed system was tested on a generated Primitive Arabic Characters Noise Free dataset (PAC-NF) containing 30000 samples. Experimental results achieved a promising character recognition accuracy of 99.8-100%.
引用
收藏
页码:567 / 571
页数:5
相关论文
共 50 条
  • [1] A Comparative Study of Different Approaches of Primitive Printed Arabic Optical Character Recognition
    Dahi, Mohamed
    Semary, Noura A.
    Hadhoud, Mohiy M.
    2015 11TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2015, : 105 - 110
  • [2] Optical Character Recognition of Arabic Printed Text
    Taha, Safwa
    Babiker, Yusra
    Abbas, Mohamed
    2012 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2012,
  • [3] Optical character recognition of arabic printed text
    Electrical and Electronics Engineering Department, University of Khartoum, Sudan
    SCOReD - IEEE Stud. Conf. Res. Dev., (235-240):
  • [4] Printed Arabic Character Recognition using Local Energy and Structural Features
    Zaafouri, Ahmed
    Sayadi, Mounir
    Fnaiech, Farhat
    2012 2ND INTERNATIONAL CONFERENCE ON COMMUNICATIONS, COMPUTING AND CONTROL APPLICATIONS (CCCA), 2012,
  • [5] Printed Arabic Optical Character Recognition using Support vector machine
    Yamina, Ouled Jaafri
    El Mamoun, Mamouni
    Kaddour, Sadouni
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MATHEMATICS AND INFORMATION TECHNOLOGY (ICMIT), 2017, : 134 - 140
  • [6] A Novel Approach to Printed Arabic Optical Character Recognition
    Al Ghamdi, Mansoor A.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 2219 - 2237
  • [7] A Novel Approach to Printed Arabic Optical Character Recognition
    Mansoor A. Al Ghamdi
    Arabian Journal for Science and Engineering, 2022, 47 : 2219 - 2237
  • [8] Printed Arabic character recognition using HMM
    Abbas H. Hassin
    Xiang-Long Tang
    Jia-Feng Liu
    Wei Zhao
    Journal of Computer Science and Technology, 2004, 19 : 538 - 543
  • [9] Printed Arabic character recognition using HMM
    Hassin, AH
    Tang, XL
    Liu, JF
    Zhao, W
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2004, 19 (04) : 538 - 543
  • [10] New statistical method for machine-printed Arabic character recognition
    Wang, H
    Ding, XQ
    Jin, JM
    Halmurat
    DOCUMENT RECOGNITION AND RETRIEVAL XII, 2005, 5676 : 127 - 135