Comparative Analysis of Gabor and Discriminating Feature Extraction Techniques for Script Identification

被引:0
作者
Rani, Rajneesh [1 ]
Dhir, Renu [1 ]
Lehal, G. S. [2 ]
机构
[1] NIT Jalandhar, Dept CSE, Jalandhar, Punjab, India
[2] Punjabi Univ, Dept CSE, Patiala, Punjab, India
来源
INFORMATION SYSTEMS FOR INDIAN LANGUAGES | 2011年 / 139卷
关键词
Script Identification; Gabor Features; Discriminating Features; Support Vector Machines; Knn;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A considerable amount of success has been achieved in developing monolingual OCR systems for Indian Scripts. But in a country like India, where many languages and scripts exist, it is more common that a single document contain words from more than one script. Therefore a script identification system is required to select the appropriate OCR. This paper presents a comparative analysis of two different feature extraction techniques for script identification of each word. In this work, for script identification discriminating and Gabor filter based features are computed of Punjabi words and English numerals. Extracted feature are simulated with Knn and SVM classifiers to identify the script and then recognition rates are compared. It has been observed that by selecting the appropriate value of k and appropriate kernel function with appropriate combination of feature extraction and classification scheme, there is significant drop in error rate.
引用
收藏
页码:174 / +
页数:2
相关论文
共 15 条
[1]  
Abirami S., 2009, International Journal of Recent Trends in Engineering, V1, P246
[2]  
DEVIJVER PA, 1982, PATTERN RECOGNITION
[3]  
Dhandra BV, 2006, 2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, P389
[4]  
Dhandra B. V., 2007, Journal of Multimedia, V2, P26, DOI 10.4304/jmm.2.6.26-33
[5]  
Dhandra B. V., 2006, P IET INT C VIS INF, P389
[6]   Script identification in printed bilingual documents [J].
Dhanya, D ;
Ramakrishnan, AG ;
Pati, PB .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1) :73-82
[7]  
Dhanya D., 2001, TAM INT C, P64
[8]  
Dhir R., 2004, P 39 ANN NAT CONV CO
[9]  
Padma MC, 2008, INT J COMPUT INT SYS, V1, P116
[10]  
Pal U., 2003, Proceedings of the Second International Workshop on Document Analysis Systems, P213