Binarization of degraded document image based on feature space partitioning and classification

被引:0
作者
Morteza Valizadeh
Ehsanollah Kabir
机构
[1] Tarbiat Modares University,Department of Electrical Engineering
来源
International Journal on Document Analysis and Recognition (IJDAR) | 2012年 / 15卷
关键词
Degraded document; Binarization; Mode association clustering; Structural contrast; Feature space partitioning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose a new algorithm for the binarization of degraded document images. We map the image into a 2D feature space in which the text and background pixels are separable, and then we partition this feature space into small regions. These regions are labeled as text or background using the result of a basic binarization algorithm applied on the original image. Finally, each pixel of the image is classified as either text or background based on the label of its corresponding region in the feature space. Our algorithm splits the feature space into text and background regions without using any training dataset. In addition, this algorithm does not need any parameter setting by the user and is appropriate for various types of degraded document images. The proposed algorithm demonstrated superior performance against six well-known algorithms on three datasets.
引用
收藏
页码:57 / 69
页数:12
相关论文
共 41 条
  • [1] Otsu N.(1979)A threshold selection method from grey level histogram IEEE Trans. Syst. Man Cybernet. 9 62-66
  • [2] Kapur J.N.(1985)A new method for gray level picture thresholding using the entropy of the histogram Comput. Vis. Graph. Image Process. 29 273-285
  • [3] Sahoo P.K.(1979)Histogram modification for threshold selection IEEE Trans. Syst. Man Cybernet. 9 38-52
  • [4] Wong A.K.C.(2004)Iterative multimodel subimage binarization for handwritten character segmentation IEEE Trans. Image Process. 13 1223-1230
  • [5] Weszka J.S.(1997)Document image binarization based on texture features IEEE Trans. Pattern Anal. Mach. Intell. 19 540-544
  • [6] Rosenfield A.(1983)Imager segmentation for optical character recognition and other applications requiring character image extraction IBM J. Res. Dev. 27 400-411
  • [7] Dawoud A.(1991)Gray level thresholding in badly illuminated images IEEE Trans. Pattern Anal. Mach. Intell. 13 813-819
  • [8] Kamel M.S.(1993)Extraction of binary character/graphics images from grayscale document images Graph. Models Image Process. 55 203-217
  • [9] Liu Y.(2005)Decompose algorithm for thresholding degraded historical document images IEE Proc. Vis. Image Signal Process. 152 702-714
  • [10] Srihari S.N.(2000)An adaptive logical method for binarization of degraded document images Pattern Recognit. 33 787-807