Optical character recognition of handwritten Arabic using hidden Markov models

被引:1
|
作者
Aulama, Mohannad M. [1 ]
Natsheh, Asem M. [1 ]
Abandah, Gheith A. [1 ]
Olama, Mohammed M. [2 ]
机构
[1] Univ Jordan, Dept Comp Engn, Amman 11942, Jordan
[2] CSED, Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
来源
OPTICAL PATTERN RECOGNITION XXII | 2011年 / 8055卷
关键词
Character recognition; OCR; Arabic OCR; hidden Markov models (HMMs); Viterbi algorithm;
D O I
10.1117/12.884087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of optical character recognition (OCR) of handwritten Arabic has not received a satisfactory solution yet. In this paper, an Arabic OCR algorithm is developed based on Hidden Markov Models (HMMs) combined with the Viterbi algorithm, which results in an improved and more robust recognition of characters at the sub-word level. Integrating the HMMs represents another step of the overall OCR trends being currently researched in the literature. The proposed approach exploits the structure of characters in the Arabic language in addition to their extracted features to achieve improved recognition rates. Useful statistical information of the Arabic language is initially extracted and then used to estimate the probabilistic parameters of the mathematical HMM. A new custom implementation of the HMM is developed in this study, where the transition matrix is built based on the collected large corpus, and the emission matrix is built based on the results obtained via the extracted character features. The recognition process is triggered using the Viterbi algorithm which employs the most probable sequence of sub-words. The model was implemented to recognize the sub-word unit of Arabic text raising the recognition rate from being linked to the worst recognition rate for any character to the overall structure of the Arabic language. Numerical results show that there is a potentially large recognition improvement by using the proposed algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Online Farsi Handwritten Character Recognition Using Hidden Markov Model
    Ghods, Vahid
    Sohrabi, Mohammad Karim
    JOURNAL OF COMPUTERS, 2016, 11 (02) : 169 - 175
  • [2] Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking
    AlKhateeb, Jawad H.
    Ren, Jinchang
    Jiang, Jianmin
    Al-Muhtaseb, Husni
    PATTERN RECOGNITION LETTERS, 2011, 32 (08) : 1081 - 1088
  • [3] Features Modelling in Discrete and Continuous Hidden Markov Models for Handwritten Arabic Words Recognition
    Benzenache, Amine
    Seridi, Hamid
    Akdag, Herman
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (05) : 681 - 690
  • [4] Optical Character Recognition of Arabic Handwritten Characters using Neural Network
    Hussien, Rana S.
    Elkhidir, Azza A.
    Elnourani, Mohamed G.
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, CONTROL, NETWORKING, ELECTRONICS AND EMBEDDED SYSTEMS ENGINEERING (ICCNEEE), 2015, : 456 - 461
  • [5] Handwritten Nushu Character Recognition Based on Hidden Markov Model
    Wang, Jiangqing
    Zhu, Rongbo
    JOURNAL OF COMPUTERS, 2010, 5 (05) : 663 - 670
  • [6] A Survey on Arabic Optical Character Recognition and an Isolated Handwritten Arabic Character Recognition Algorithm using Encoded Freeman Chain Code
    Althobaiti, Hassan
    Lu, Chao
    2017 51ST ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2017,
  • [7] Off-line unconstrained handwritten numeral character recognition with multiple hidden Markov models
    Namane, A
    Arezki, M
    Guessoum, A
    Soubari, E
    Meyrueis, P
    Bruynooghe, M
    PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON VISUALIZATION, IMAGING, AND IMAGE PROCESSING, 2004, : 269 - 276
  • [8] Arabic calligraphy, typewritten and handwritten using optical character recognition (OCR) system
    Al-Barhamtoshy, Hassanin M.
    Jambi, Kamal M.
    Ahmed, Hany
    Mohamed, Shaimaa
    Abdo, Sherif M.
    Rashwan, Mohsen A.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2019, 12 (02): : 283 - 296
  • [9] Recognition of writer-independent off-line handwritten Arabic (Indian) numerals using hidden Markov models
    Mahmoud, Sabri
    SIGNAL PROCESSING, 2008, 88 (04) : 844 - 857
  • [10] Recognising handwritten Arabic manuscripts using a single hidden Markov model
    Khorsheed, MS
    PATTERN RECOGNITION LETTERS, 2003, 24 (14) : 2235 - 2242