Script-independent text line segmentation in freestyle handwritten documents

被引:120
作者
Li, Yi [1 ]
Zheng, Yefeng [2 ]
Doermann, David
Jaeger, Stefan [1 ,3 ]
机构
[1] Univ Maryland, Inst Adv Comp Studies, Language & Media Proc Lab, College Pk, MD 20742 USA
[2] Siemens Corp Res, Princeton, NJ 08540 USA
[3] Partner Inst Computat Biol, Grp Syst Bioinformat, CAS MPG, Shanghai 200031, Peoples R China
关键词
handwritten text line segmentation; document image analysis; density estimation; level set methods;
D O I
10.1109/TPAMI.2007.70792
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line segmentation in freestyle handwritten documents remains an open document analysis problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine-printed or hand-printed documents. In this paper, we propose a novel approach based on density estimation and a state-of-the-art image segmentation technique, the level set method. From an input document image, we estimate a probability map where each element represents the probability of the underlying pixel belonging to a text line. The level set method is then exploited to determine the boundary of neighboring text lines by evolving an initial estimate. Unlike connected component-based methods ([1] and [2], for example), the proposed algorithm does not use any script-specific knowledge. Extensive quantitative experiments on freestyle handwritten documents with diverse scripts such as Arabic, Chinese, Korean, and Hindi demonstrate that our algorithm consistently outperforms previous methods [1], [2], [3]. Further experiments show that the proposed algorithm is robust to scale change, rotation, and noise.
引用
收藏
页码:1313 / 1329
页数:17
相关论文
共 31 条
[1]  
[Anonymous], NATIONS NATL
[2]  
[Anonymous], THESIS U CALIFORNIA
[3]  
[Anonymous], 2000, Pattern Classification
[4]  
Baird H. S., 1993, Proceedings. Second Annual Symposium on Document Analysis and Information Retrieval, P1
[5]  
Cannon M., 1999, INT J DOC ANAL RECOG, V2, P80, DOI DOI 10.1007/S100320050039
[6]  
Doermann D, 2000, INT C PATT RECOG, P167, DOI 10.1109/ICPR.2000.902888
[7]  
JAEGER S, 2006, P SOC PHOTO-OPT INS, V13, P63
[8]   Document representation and its application to page decomposition [J].
Jain, AK ;
Yu, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) :294-308
[9]   NONLINEAR GLOBAL AND LOCAL DOCUMENT DEGRADATION MODELS [J].
KANUNGO, T ;
HARALICK, RM ;
PHILLIPS, I .
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1994, 5 (03) :220-230
[10]  
Li Y, 2006, IEEE COMMUN LETT, V10, P40, DOI [10.1109/LCOMM.2006.1576563, 10.1109/LCOMM.2006.01007]