Recognition of Bangla compound characters using structural decomposition

被引:23
作者
Bag, Soumen [1 ]
Harit, Gaurav [2 ]
Bhowmick, Partha [3 ]
机构
[1] Int Inst Informat Technol Bhubaneswar, Dept Comp Sci & Engn, Bhubaneswar 751003, Orissa, India
[2] Indian Inst Technol, Ctr Informat & Commun Technol, Jodhpur 342011, Rajasthan, India
[3] Indian Inst Technol, Dept Comp Sci & Engn, Kharagpur 721302, W Bengal, India
关键词
Compound character recognition; Decomposition rules; Printed and handwritten Bangla character; Topological feature; Template matching; AUTOMATIC RECOGNITION; HANDWRITTEN BANGLA; OCR SYSTEM; SEGMENTATION; EXTRACTION;
D O I
10.1016/j.patcog.2013.08.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a novel character recognition method for Bangla compound characters. Accurate recognition of compound characters is a difficult problem due to their complex shapes. Our strategy is to decompose a compound character into skeletal segments. The compound character is then recognized by extracting the convex shape primitives and using a template matching scheme. The novelty of our approach lies in the formulation of appropriate rules of character decomposition for segmenting the character skeleton into stroke segments and then grouping them for extraction of meaningful shape components. Our technique is applicable to both printed and handwritten characters. The proposed method performs well for complex-shaped compound characters, which were confusing to the existing methods. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1187 / 1201
页数:15
相关论文
共 52 条
[1]  
Amin A, 1997, PROC INT CONF DOC, P596, DOI 10.1109/ICDAR.1997.620572
[2]  
[Anonymous], JOINT WORKSH MULT OC
[3]  
[Anonymous], INT C IM INF PROC
[4]  
Antani S., 1999, P INT C DOC AN REC, P218
[5]   A font and size-independent OCR system for printed Kannada documents using support vector machines [J].
Ashwin, TV ;
Sastry, PS .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1) :35-58
[6]   An improved contour-based thinning method for character images [J].
Bag, Soumen ;
Harit, Gaurav .
PATTERN RECOGNITION LETTERS, 2011, 32 (14) :1836-1842
[7]  
Bahrampour A, 2009, LECT NOTES COMPUT SC, V5856, P321, DOI 10.1007/978-3-642-10268-4_38
[8]   Integrating knowledge sources in Devanagari text recognition system [J].
Bansal, V ;
Sinha, RMK .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (04) :500-505
[9]   A hierarchical approach to recognition of handwritten Bangla characters [J].
Basu, Subhadip ;
Das, Nibaran ;
Sarkar, Ram ;
Kundu, Mahantapas ;
Nasipuri, Mita ;
Basu, Dipak Kumar .
PATTERN RECOGNITION, 2009, 42 (07) :1467-1484
[10]   Fast polygonal approximation of digital curves using relaxed straightness properties [J].
Bhowmick, Partha ;
Bhattacharya, Bhargab B. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (09) :1590-1602