A Novel Shape-Based Character Segmentation Method for Devanagari Script

被引:3
作者
Jindal, Khushneet [1 ]
Kumar, Rajiv [1 ]
机构
[1] Thapar Univ, Comp Sci & Engn, Patiala, Punjab, India
关键词
Image segmentation; Image processing; Character recognition; RECOGNITION; KNOWLEDGE; SYSTEM; FUZZY;
D O I
10.1007/s13369-017-2420-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a new algorithm to extract shape-oriented feature vectors using pixel intensities from offline printed Devanagari script documents. Almost, all the characters of the script contain Shirorekha (header line) on the upper portion, which makes segmentation a difficult and complex problem. The problem gets more challenging when images are in multiple gray levels, skewed and noisy. A new fast and effective algorithm is designed using gradient structural information, and its performance is evaluated on a challenging dataset containing 80 printed documents consisting of around 87,000 characters. Experimental results show that the proposed algorithm has 98.56% accuracy, which is 02.66% higher than that reported in literature. Also, the proposed algorithm is time efficient and less complex in comparison with the existing methods.
引用
收藏
页码:3221 / 3228
页数:8
相关论文
共 15 条
[1]   Integrating knowledge sources in Devanagari text recognition system [J].
Bansal, V ;
Sinha, RMK .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (04) :500-505
[2]   Support vector machines versus multi-layer perceptrons for efficient off-line signature recognition [J].
Frias-Martinez, E. ;
Sanchez, A. ;
Velez, J. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2006, 19 (06) :693-704
[3]   Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy, multifactorial analysis [J].
Garain, U ;
Chaudhuri, BB .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (04) :449-459
[4]   Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation [J].
Grafmueller, Martin ;
Beyerer, Juergen .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (17) :6955-6963
[5]   Offline Recognition of Devanagari Script: A Survey [J].
Jayadevan, R. ;
Kolhe, Satish R. ;
Patil, Pradeep M. ;
Pal, Umapada .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06) :782-796
[6]   A Note on "Data mining based noise diagnosis and fuzzy filter design for image processing" [J].
Jindal, Khushneet ;
Kumar, Rajiv .
COMPUTERS & ELECTRICAL ENGINEERING, 2016, 49 :50-51
[7]   Devanagari OCR using a recognition driven segmentation framework and stochastic language models [J].
Kompalli, Suryaprakash ;
Setlur, Srirangaraj ;
Govindaraju, Venu .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2009, 12 (02) :123-138
[8]  
Ma H., 2003, ACM T ASIAN LANG INF, V2, P193, DOI [10.1145/979872.979875, DOI 10.1145/979872.979875]
[9]   An Intelligent System for Vehicle Access Control using RFID and ALPR Technologies [J].
Mohandes, M. ;
Deriche, M. ;
Ahmadi, H. ;
Kousa, M. ;
Balghonaim, A. .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (09) :3521-3530
[10]   An approach to divide pre-detected Devanagari words from the scene images into characters [J].
Murthy, O. V. Ramana ;
Roy, Sujoy ;
Narang, Vipin ;
Hanmandlu, M. ;
Gupta, Shorya .
SIGNAL IMAGE AND VIDEO PROCESSING, 2013, 7 (06) :1071-1082