Sub-word Based Offline Handwritten Farsi Word Recognition Using Recurrent Neural Network

被引:11
作者
Ghadikolaie, Mohammad Fazel Younessy [1 ]
Kabir, Ehsanolah [2 ]
Razzazi, Farbod [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Elect & Comp Engn, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Elect & Comp Engn, Tehran, Iran
关键词
OCR; Handwritten recognition; Sub-word; PAW; Recurrent Neural Network; Farsi; Persian; Arabic; SEGMENTATION;
D O I
10.4218/etrij.16.0115.0542
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a segmentation-based method for offline Farsi handwritten word recognition. Although most segmentation-based systems suffer from segmentation errors within the first stages of recognition, using the inherent features of the Farsi writing script, we have segmented the words into sub-words. Instead of using a single complex classifier with many (N) output classes, we have created N simple recurrent neural network classifiers, each having only true/false outputs with the ability to recognize sub-words. Through the extraction of the number of sub-words in each word, and labeling the position of each sub-word (beginning/middle/end), many of the sub-word classifiers can be pruned, and a few remaining sub-word classifiers can be evaluated during the sub-word recognition stage. The candidate subwords are then joined together and the closest word from the lexicon is chosen. The proposed method was evaluated using the Iranshahr database, which consists of 17,000 samples of Iranian handwritten city names. The results show the high recognition accuracy of the proposed method.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 30 条
[1]  
Al-Hajj R, 2007, PROC INT CONF DOC, P959
[2]  
[Anonymous], 2009, Ethnologue: languages of the world
[3]  
[Anonymous], 2009, PEARSON ED INDIA
[4]   Handwritten Arabic word recognition: A review of common approaches [J].
Assma, O. H. ;
Khalifa, Othman O. ;
Hassan, Aisha .
2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, :801-805
[5]  
Bayesteh E., 2011, 7 IEEE IR C MACH VIS, P1, DOI DOI 10.1109/IRANIANMVIP.2011.6121550
[6]   Persian/arabic handwritten word recognition using M-band packet wavelet transform [J].
Broumandnia, A. ;
Shanbehzadeh, J. ;
Varnoosfaderani, M. Rezakhah .
IMAGE AND VISION COMPUTING, 2008, 26 (06) :829-842
[7]   VARIABLE DURATION HIDDEN MARKOV MODEL AND MORPHOLOGICAL SEGMENTATION FOR HANDWRITTEN WORD RECOGNITION [J].
CHEN, MY ;
KUNDU, A ;
SRIHARI, SN .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (12) :1675-1688
[8]   Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM [J].
Dehghan, M ;
Faez, K ;
Ahmadi, M ;
Shridhar, M .
PATTERN RECOGNITION, 2001, 34 (05) :1057-1065
[9]  
Dinges L., 2011, International Journal of Signal Processing, Image Processing, and Pattern recognition, V4, P131
[10]  
E-Hajj R, 2005, PROC INT CONF DOC, P893