Robust Binarization of Degraded Document Images Using Heuristics

被引:0
|
作者
Parker, Jon [1 ]
Frieder, Ophir [1 ]
Frieder, Gideon [1 ]
机构
[1] Georgetown Univ, Dept Comp Sci, Washington, DC 20057 USA
来源
关键词
readability enhancement; historic document processing; document degradation;
D O I
10.1117/12.2042581
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Historically significant documents are often discovered with defects that make them difficult to read and analyze. This fact is particularly troublesome if the defects prevent software from performing an automated analysis. Image enhancement methods are used to remove or minimize document defects, improve software performance, and generally make images more legible. We describe an automated, image enhancement method that is input page independent and requires no training data. The approach applies to color or greyscale images with hand written script, typewritten text, images, and mixtures thereof. We evaluated the image enhancement method against the test images provided by the 2011 Document Image Binarization Contest (DIBCO). Our method outperforms all 2011 DIBCO entrants in terms of average F1 measure - doing so with a significantly lower variance than top contest entrants. The capability of the proposed method is also illustrated using select images from a collection of historic documents stored at Yad Vashem Holocaust Memorial in Israel.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] An adaptive logical method for binarization of degraded document images
    Yang, YB
    Yan, H
    PATTERN RECOGNITION, 2000, 33 (05) : 787 - 807
  • [22] Binarization of Degraded Document Images with Generalized Gaussian Distribution
    Krupinski, Robert
    Lech, Piotr
    Teclaw, Mateusz
    Okarma, Krzysztof
    COMPUTATIONAL SCIENCE - ICCS 2019, PT V, 2019, 11540 : 177 - 190
  • [23] An adaptive water flow model for binarization of degraded document images
    Valizadeh, Morteza
    Kabir, Ehsanollah
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (02) : 165 - 176
  • [24] Quality evaluation of degraded document images for binarization result prediction
    Rabeux, V.
    Journet, N.
    Vialard, A.
    Domenger, J. P.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (02) : 125 - 137
  • [25] Quality evaluation of degraded document images for binarization result prediction
    V. Rabeux
    N. Journet
    A. Vialard
    J. P. Domenger
    International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 125 - 137
  • [26] A Global-to-Local Approach to Binarization of Degraded Document Images
    Biswas, Barun
    Bhattacharya, Ujjwal
    Chaudhuri, Bidyut B.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3008 - 3013
  • [27] A Contrast Independent Algorithm for Adaptive Binarization of Degraded Document Images
    Valizadeh, M.
    Komeili, M.
    Armanfard, N.
    Kabir, E.
    2009 14TH INTERNATIONAL COMPUTER CONFERENCE, 2009, : 127 - 132
  • [28] An adaptive water flow model for binarization of degraded document images
    Morteza Valizadeh
    Ehsanollah Kabir
    International Journal on Document Analysis and Recognition (IJDAR), 2013, 16 : 165 - 176
  • [29] Robust Binarization of Stereo and Monocular Document Images Using Percentile Filter
    Afzal, Muhammad Zeshan
    Kraemer, Martin
    Bukhari, Syed Saqib
    Yousefi, Mohammad Reza
    Shafait, Faisal
    Breuel, Thomas M.
    CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, CBDAR 2013, 2014, 8357 : 139 - 149
  • [30] Ternary Entropy-based Binarization of Degraded Document Images Using Morphological Operators
    Le, T. Hoang Ngan
    Bui, Tien D.
    Suen, Ching Y.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 114 - 118