On foreground — background separation in low quality document images

被引:0
作者
Utpal Garain
Thierry Paquet
Laurent Heutte
机构
[1] Indian Statistical Institute,Computer Vision & Pattern Recognition Unit
[2] University of Rouen,Laboratoire PSI
来源
International Journal of Document Analysis and Recognition (IJDAR) | 2006年 / 8卷
关键词
Document image analysis; Historical documents; Color image; Binarization; Foreground segmentation;
D O I
暂无
中图分类号
学科分类号
摘要
This paper deals with effective separation of foreground and background in low quality document images suffering from various types of degradations including scanning noise, aging effects, uneven background, or foreground, etc. The proposed algorithm shows an excellent adaptability to tackle with these problems of uneven illumination and local changes or nonuniformity in background and foreground colors. The approach is primarily designed for (not restricted to) processing of color documents but it works well in the gray scale domain too. Test document set considers samples (in color as well as in gray scale) of old historical documents including manuscripts of high importance. The data set used in this study consists of hundred images. These images are selected from different sources including image databases that had been scanned from working notebooks of famous writers who used to write with quill or pencil generating very low contrast between foreground and background. Evaluation of foreground extraction method has been judged by computing the accuracy of extracting handwritten lines and words from the test images. This evaluation shows that the proposed method can extract lines and words with accuracies of about 84% and 93%, respectively. Apart from this quantitative method, a qualitative evaluation is also presented to compare the proposed method with one popular technique for foreground/background separation in document images.
引用
收藏
页码:47 / 63
页数:16
相关论文
共 29 条
[1]  
Sahoo P.K.(1988)A survey of thresholding techniques Comput. Vision Graphics Image Process. 41 233-260
[2]  
Soltani S.(1995)Evaluation of binarization methods for document images IEEE Trans. Pattern Anal. Machine Intell. 17 312-315
[3]  
Wong A.K.C.(1195)Goal-directed evaluation of binarization methods IEEE Trans. Pattern Anal. Machine Intell. 17 1191-1201
[4]  
Chen Y.C.(2000)Adaptive document image binarization Pattern Recog. 33 225-236
[5]  
Trier O.D.(1979)A threshold selection method from gray-level histograms IEEE Trans. Syst. Man Cybernet. 9 62-66
[6]  
Taxt T.(1985)Threshold selection based on a simple image statistic Comput. Vision Graphics Image Process. CVGIP 30 125-147
[7]  
Trier O.D.(1985)Moment-preserving thresholding: a new approach CVGIP: Graphical Models Image Process. 29 377-393
[8]  
Jain A.K.(1994)Binarization and multi-thresholding of document images using connectivity CVGIP: Graphical Models Image Process. 56 494-506
[9]  
Sauvola J.(1997)Document image binarization based on texture features IEEE Transa. Pattern Anal. and Machine Intell. 19 540-544
[10]  
Pietikainen M.(2001)Color image segmentation: advances and prospects Pattern Recog. 34 2259-2281