Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy, multifactorial analysis

被引:62
作者
Garain, U [1 ]
Chaudhuri, BB [1 ]
机构
[1] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, W Bengal, India
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS | 2002年 / 32卷 / 04期
关键词
fuzzy decision making; indian script optical character recognition (OCR); multifactorial analysis; touching characters;
D O I
10.1109/TSMCC.2002.807272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the important reasons for poor recognition rate in optical character recognition (OCR) system is the error in character segmentation. Existence of touching characters in the scanned documents is a major problem to design an effective character segmentation procedure. In this paper, a new technique is presented for identification and segmentation of touching characters. The technique is based on fuzzy multifactorial analysis. A predictive algorithm is developed for effectively selecting possible cut columns for segmenting the touching characters. The proposed method has been applied to printed documents in Devnagari and Bangla: the two most popular scripts of the Indian sub-continent. The results obtained from a test-set of considerable size show that a reasonable improvement in recognition rate can be achieved with a modest increase in computations.
引用
收藏
页码:449 / 459
页数:11
相关论文
共 27 条
[1]  
Aho Alfred V., 1986, ADDISON WESLEY SERIE
[2]  
BAYER T, 1993, P INT C DOC AN REC T, P65
[3]   OMNIDOCUMENT TECHNOLOGIES [J].
BOKSER, M .
PROCEEDINGS OF THE IEEE, 1992, 80 (07) :1066-1078
[4]  
Casey R. G., 1982, Proceedings of the 6th International Conference on Pattern Recognition, P1023
[5]  
CASEY RG, 1996, IEEE T PATTERN ANAL, V18
[6]   A complete printed Bangla OCR system [J].
Chaudhuri, BB ;
Pal, U .
PATTERN RECOGNITION, 1998, 31 (05) :531-549
[7]   Skew angle detection of digitized Indian script documents [J].
Chaudhuri, BB ;
Pal, U .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :182-186
[8]  
Chaudhuri BB, 1997, PROC INT CONF DOC, P1011, DOI 10.1109/ICDAR.1997.620662
[9]  
CHAUDHURI BB, VISUAL TEXT RECOGNIT
[10]   A REVIEW OF SEGMENTATION AND CONTEXTUAL ANALYSIS TECHNIQUES FOR TEXT RECOGNITION [J].
ELLIMAN, DG ;
LANCASTER, IT .
PATTERN RECOGNITION, 1990, 23 (3-4) :337-346