Automatic Handwritten Indian Scripts Identification

被引:29
作者
Pardeshi, Rajmohan [1 ]
Chaudhuri, B. B. [2 ]
Hangarge, Mallikarjun [1 ]
Santosh, K. C. [3 ]
机构
[1] Karnatak Arts Sci & Commerce Coll, Dept Comp Sci, Bidar, India
[2] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, India
[3] NIH, US Natl Lib Med NLM, Bethesda, MD 20894 USA
来源
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2014年
关键词
The Radon transform; wavelet transform; discrete cosine transform; statistical filters; Indian script identification; TEXTURE;
D O I
10.1109/ICFHR.2014.69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since OCR engines are usually script-dependent, automatic text recognition in multi-script document requires a pre-processor module that identifies the scripts. Based on this motivation, in this paper, we present a word level handwritten Indian script identification technique. To handle this, words are first segmented by morphological dilation and performed connected component labelling. We then employ the Radon transform, discrete wavelet transform, statistical filters and discrete cosine transform to extract the directional multi-resolution spatial features. We tested the features by using linear discriminant analysis, support vector machine and K-nearest neighbour classifiers over 11 different major Indian scripts (including Roman) in bi-script and tri-script scenario. In our tests, we have achieved maximum accuracies of 98% and 96% for bi-script and tri-scipt respectively.
引用
收藏
页码:375 / 380
页数:6
相关论文
共 19 条
[1]  
Abe S., 2005, ADV PTRN RECOGNIT
[2]  
Bhardwaj A., 2009, SCRIPT IDENTIFICATIO, V7247
[3]   Texture for script identification [J].
Busch, A ;
Boles, WW ;
Sridharan, S .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (11) :1720-1732
[4]   Script Recognition-A Review [J].
Ghosh, Debashis ;
Dube, Tulika ;
Shivaprasad, Adamane P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (12) :2142-2161
[5]   Composite Script Identification and Orientation Detection for Indian Text Images [J].
Ghosh, Shamita ;
Chaudhuri, Bidyut B. .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :294-298
[6]  
Hangarge M., 2012, INT C EM TRENDS EL C, V1, P215
[7]   Directional Discrete Cosine Transform for Handwritten Script Identification [J].
Hangarge, Mallikarjun ;
Santosh, K. C. ;
Pardeshi, Rajmohan .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :344-348
[8]   Automatic script identification from document images using cluster-based templates [J].
Hochberg, J ;
Kelly, P ;
Thomas, T ;
Kerns, L .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :176-181
[9]  
Huanfeng Ma, 2003, Proceedings of the SPIE - The International Society for Optical Engineering, V5296, P124, DOI 10.1117/12.530538
[10]   A THEORY FOR MULTIRESOLUTION SIGNAL DECOMPOSITION - THE WAVELET REPRESENTATION [J].
MALLAT, SG .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (07) :674-693