A statistical-topological feature combination for recognition of handwritten numerals

被引:78
作者
Das, Nibaran [1 ]
Reddy, Jagan Mohan [1 ]
Sarkar, Ram [1 ]
Basu, Subhadip [1 ]
Kundu, Mahantapas [1 ]
Nasipuri, Mita [1 ]
Basu, Dipak Kumar [1 ]
机构
[1] Univ Jadavpur, Dept Comp Sci & Engn, Kolkata 700032, India
关键词
PCA; MPCA; Feature combination; SVM; Character recognition; Statistical; Topological; CHARACTER-RECOGNITION; SYSTEM;
D O I
10.1016/j.asoc.2012.03.039
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Principal Component Analysis (PCA) and Modular PCA (MPCA) are well known statistical methods for recognition of facial images. But only PCA/MPCA is found to be insufficient to achieve high classification accuracy required for handwritten character recognition application. This is due to the shortcomings of those methods to represent certain local morphometric information present in the character patterns. On the other hand Quad-tree based hierarchically derived Longest-Run (QTLR) features, a type of popularly used topological features for character recognition, miss some global statistical information of the characters. In this paper, we have introduced a new combination of PCA/MPCA and QTLR features for OCR of handwritten numerals. The performance of the designed feature-combination is evaluated on handwritten numerals of five popular scripts of Indian sub-continent, viz., Arabic, Bangla, Devanagari, Latin and Telugu with Support Vector Machine (SVM) based classifier. From the results it has been observed that MPCA + QTLR feature combination outperforms PCA + QTLR feature combination and most other conventional features available in the literature. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2486 / 2495
页数:10
相关论文
共 35 条
[1]   A Novel Domain-Specific Feature Extraction Scheme For Arabic Handwritten Digits Recognition [J].
Abdelazeem, Sherif .
EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, :247-252
[2]  
Basu S., 2005, P 2 IND INT C ART IN, P407
[3]   A novel framework for automatic sorting of postal documents with multi-script address blocks [J].
Basu, Subhadip ;
Das, Nibaran ;
Sarkar, Ram ;
Kundu, Mahantapas ;
Nasipuri, Mita ;
Basu, Dipak Kumar .
PATTERN RECOGNITION, 2010, 43 (10) :3507-3521
[4]  
Basu S, 2009, LECT NOTES COMPUT SC, V5909, P381, DOI 10.1007/978-3-642-11164-8_62
[5]   Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals [J].
Bhattacharya, Ujjwal ;
Chaudhuri, B. B. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (03) :444-457
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Handwriting recognition research: Twenty years of achievement ... and beyond Discussion [J].
Cheriet, Mohamed ;
El Yacoubi, Mounim ;
Fujisawa, Hiromichi ;
Lopresti, Daniel ;
Lorette, Guy .
PATTERN RECOGNITION, 2009, 42 (12) :3131-3135
[8]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[9]  
Correia SEN, 2002, INT C PATT RECOG, P127, DOI 10.1109/ICPR.2002.1047811
[10]  
Cortes C., 1995, Machine Learning, V297, P273, DOI [DOI 10.1007/BF00994018, DOI 10.1023/A:1022627411411]