Text line and word segmentation of handwritten documents

被引:128
作者
Louloudis, G. [1 ]
Gatos, B. [2 ]
Pratikakis, I. [2 ]
Halatsis, C. [1 ]
机构
[1] Univ Athens, Dept Informat & Telecommun, GR-10679 Athens, Greece
[2] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, Computat Intelligence Lab, Athens 15310, Greece
关键词
Handwritten document image analysis; Hough transform; Text line segmentation; Word segmentation; Gaussian mixture modeling; EXTRACTION; RECOGNITION;
D O I
10.1016/j.patcog.2008.12.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a segmentation methodology of handwritten documents in their distinct entities, namely, text lines and words. Text line segmentation is achieved by applying Hough transform on a subset of the document image connected components. A post-processing step includes the correction of possible false alarms, the detection of text lines that Hough transform failed to create and finally the efficient separation of vertically connected characters using a novel method based on skeletonization. Word segmentation is addressed as a two class problem, The distances between adjacent overlapped components in a text line are calculated using the combination of two distance metrics and each of them is categorized either as an inter- or an intra-word distance in a Gaussian mixture modeling framework. The performance of the proposed methodology is based on a consistent and concrete evaluation methodology that uses suitable performance measures in order to compare the text line segmentation and word segmentation results against the corresponding ground truth annotation. The efficiency of the proposed methodology is demonstrated by experimentation conducted on two different datasets: (a) on the test set of the ICDAR2007 handwriting segmentation competition and (b) on a set of historical handwritten documents. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3169 / 3183
页数:15
相关论文
共 40 条
[1]  
[Anonymous], 2005, HDB STAT
[2]   ICDAR2005 page segmentation competition [J].
Antonacopoulos, A ;
Gatos, B ;
Bridson, D .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :75-79
[3]  
Antonacopoulos A, 2003, PROC INT CONF DOC, P688
[4]  
ANTONACOPOULOS A, 2007, 9 INT C DOC AN REC I
[5]  
Arivazhagan M, 2007, P SPIE
[6]  
ATAER E, 2006, P 8 ACM SIGMM INT WO
[7]   Text line extraction from multi-skewed handwritten documents [J].
Basu, S. ;
Chaudhuri, C. ;
Kundu, M. ;
Nasipuri, M. ;
Basu, D. K. .
PATTERN RECOGNITION, 2007, 40 (06) :1825-1839
[8]  
Bruzzone E., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P749, DOI 10.1109/ICDAR.1999.791896
[9]  
DU X, 2008, INT C FRONT HANDWR R, P253
[10]  
Gatos B, 2007, PROC INT CONF DOC, P1284