A new thresholding algorithm for document images based on the perception of objects by distance

被引:26
作者
Mesquita, R. G. [1 ]
Mello, C. A. B. [1 ]
Almeida, L. H. E. V. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
关键词
Document image processing; binarization; visual perception; document image analysis; image pixel classification; VISUAL-ACUITY; STRATEGIES; ENTROPY;
D O I
10.3233/ICA-130453
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work a new method to enhance and binarize document images with several kind of degradation is proposed. The method is based on the idea that by the absolute difference between a document image and its background it is possible to effectively emphasize the text and attenuate degraded regions. To generate the background of a document our work was inspired on the human visual system and on the perception of objects by distance. Snellen's visual acuity notation was used to define how far an image must be from an observer so that the details of the characters are not perceived anymore, remaining just the background. A scheme that combines k-means clustering algorithm and Otsu's thresholding method is also used to perform binarization. The proposed method has been tested on two different datasets of document images (DIBCO 2011 and a real historical document image dataset) with very satisfactory results.
引用
收藏
页码:133 / 146
页数:14
相关论文
共 48 条
  • [31] THE PHYSIOLOGIC LIMITS OF VISION IN PHYSIOGRAPHIC OBSERVATION
    OLMSTED, EW
    OLMSTED, EP
    [J]. SCIENCE, 1951, 113 (2929) : 176 - 177
  • [32] THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS
    OTSU, N
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01): : 62 - 66
  • [33] Palmer, 1999, VISION SCI PHOTONS P
  • [34] Pedrino E.C., 2013, INTEGRATED COMPUTER, V20, P257
  • [35] ICDAR 2011 Document Image Binarization Contest (DIBCO 2011)
    Pratikakis, Ioannis
    Gatos, Basilis
    Ntirogiannis, Konstantinos
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1506 - 1510
  • [36] HISTOGRAM CONCAVITY ANALYSIS AS AN AID IN THRESHOLD SELECTION
    ROSENFELD, A
    DELATORRE, P
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (02): : 231 - 235
  • [37] Automatic line and word segmentation applied to densely line-skewed historical handwritten document images
    Sanchez, A.
    Mello, C. A. B.
    Suarez, P. D.
    Lopes, A.
    [J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2011, 18 (02) : 125 - 142
  • [38] Adaptive document image binarization
    Sauvola, J
    Pietikäinen, M
    [J]. PATTERN RECOGNITION, 2000, 33 (02) : 225 - 236
  • [39] Low resolution, degraded document recognition using neural networks and hidden Markov models
    Schenkel, M
    Jabri, M
    [J]. PATTERN RECOGNITION LETTERS, 1998, 19 (3-4) : 365 - 371
  • [40] FOCUSING OF SPHERICAL GAUSSIAN BEAMS
    SELF, SA
    [J]. APPLIED OPTICS, 1983, 22 (05): : 658 - 661