On foreground - background separation in low quality document images

被引:13
作者
Garain, Utpal
Paquet, Thierry
Heutte, Laurent
机构
[1] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, India
[2] Univ Rouen, Lab PSI, CNRS, FRE 2645,UFR Sci, F-76821 Mont St Aignan, France
关键词
document image analysis; historical documents; color image; binarization; foreground segmentation;
D O I
10.1007/s10032-005-0007-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with effective separation of foreground and background in low quality document images suffering from various types of degradations including scanning noise, aging effects, uneven background, or foreground, etc. The proposed algorithm shows an excellent adaptability to tackle with these problems of uneven illumination and local changes or nonuniformity in background and foreground colors. The approach is primarily designed for (not restricted to) processing of color documents but it works well in the gray scale domain too. Test document set considers samples (in color as well as in gray scale) of old historical documents including manuscripts of high importance. The data set used in this study consists of hundred images. These images are selected from different sources including image databases that had been scanned from working notebooks of famous writers who used to write with quill or pencil generating very low contrast between foreground and background. Evaluation of foreground extraction method has been judged by computing the accuracy of extracting handwritten lines and words from the test images. This evaluation shows that the proposed method can extract lines and words with accuracies of about 84% and 93%, respectively. Apart from this quantitative method, a qualitative evaluation is also presented to compare the proposed method with one popular technique for foreground/background separation in document images.
引用
收藏
页码:47 / 63
页数:17
相关论文
共 32 条
[1]  
Antonacopoulos A, 2004, LECT NOTES COMPUT SC, V3163, P90
[2]   High quality document image compression with "DjVU" [J].
Bottou, L ;
Haffner, P ;
Howard, PG ;
Simard, P ;
Bengio, Y ;
LeCun, Y .
JOURNAL OF ELECTRONIC IMAGING, 1998, 7 (03) :410-425
[3]   Color image segmentation: advances and prospects [J].
Cheng, HD ;
Jiang, XH ;
Sun, Y ;
Wang, JL .
PATTERN RECOGNITION, 2001, 34 (12) :2259-2281
[4]  
DJVU, 2000, DOCUMENT EXPRESS
[5]  
Gatos B, 2004, LECT NOTES COMPUT SC, V3163, P102
[6]  
Gonzalez Rafael C, 2002, DIGITAL IMAGE PROCES
[7]  
He J, 2004, LECT NOTES COMPUT SC, V3163, P241
[8]   THRESHOLD SELECTION BASED ON A SIMPLE IMAGE STATISTIC [J].
KITTLER, J ;
ILLINGWORTH, J ;
FOGLEIN, J .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 30 (02) :125-147
[9]  
Leydier Y, 2004, LECT NOTES COMPUT SC, V3163, P252
[10]  
Li Y, 2003, PROC INT CONF DOC, P289