A Novel Shape-Based Character Segmentation Method for Devanagari Script

被引:3
作者
Jindal, Khushneet [1 ]
Kumar, Rajiv [1 ]
机构
[1] Thapar Univ, Comp Sci & Engn, Patiala, Punjab, India
关键词
Image segmentation; Image processing; Character recognition; RECOGNITION; KNOWLEDGE; SYSTEM; FUZZY;
D O I
10.1007/s13369-017-2420-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a new algorithm to extract shape-oriented feature vectors using pixel intensities from offline printed Devanagari script documents. Almost, all the characters of the script contain Shirorekha (header line) on the upper portion, which makes segmentation a difficult and complex problem. The problem gets more challenging when images are in multiple gray levels, skewed and noisy. A new fast and effective algorithm is designed using gradient structural information, and its performance is evaluated on a challenging dataset containing 80 printed documents consisting of around 87,000 characters. Experimental results show that the proposed algorithm has 98.56% accuracy, which is 02.66% higher than that reported in literature. Also, the proposed algorithm is time efficient and less complex in comparison with the existing methods.
引用
收藏
页码:3221 / 3228
页数:8
相关论文
共 15 条
  • [1] Integrating knowledge sources in Devanagari text recognition system
    Bansal, V
    Sinha, RMK
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (04): : 500 - 505
  • [2] Support vector machines versus multi-layer perceptrons for efficient off-line signature recognition
    Frias-Martinez, E.
    Sanchez, A.
    Velez, J.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2006, 19 (06) : 693 - 704
  • [3] Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy, multifactorial analysis
    Garain, U
    Chaudhuri, BB
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2002, 32 (04): : 449 - 459
  • [4] Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation
    Grafmueller, Martin
    Beyerer, Juergen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (17) : 6955 - 6963
  • [5] Offline Recognition of Devanagari Script: A Survey
    Jayadevan, R.
    Kolhe, Satish R.
    Patil, Pradeep M.
    Pal, Umapada
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06): : 782 - 796
  • [6] A Note on "Data mining based noise diagnosis and fuzzy filter design for image processing"
    Jindal, Khushneet
    Kumar, Rajiv
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2016, 49 : 50 - 51
  • [7] Devanagari OCR using a recognition driven segmentation framework and stochastic language models
    Kompalli, Suryaprakash
    Setlur, Srirangaraj
    Govindaraju, Venu
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2009, 12 (02) : 123 - 138
  • [8] Ma H., 2003, ACM T ASIAN LANG INF, V2, P193, DOI [10.1145/979872.979875, DOI 10.1145/979872.979875]
  • [9] An Intelligent System for Vehicle Access Control using RFID and ALPR Technologies
    Mohandes, M.
    Deriche, M.
    Ahmadi, H.
    Kousa, M.
    Balghonaim, A.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (09) : 3521 - 3530
  • [10] An approach to divide pre-detected Devanagari words from the scene images into characters
    Murthy, O. V. Ramana
    Roy, Sujoy
    Narang, Vipin
    Hanmandlu, M.
    Gupta, Shorya
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2013, 7 (06) : 1071 - 1082