Robust Binarization of Degraded Document Images Using Heuristics

被引:0
|
作者
Parker, Jon [1 ]
Frieder, Ophir [1 ]
Frieder, Gideon [1 ]
机构
[1] Georgetown Univ, Dept Comp Sci, Washington, DC 20057 USA
来源
关键词
readability enhancement; historic document processing; document degradation;
D O I
10.1117/12.2042581
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Historically significant documents are often discovered with defects that make them difficult to read and analyze. This fact is particularly troublesome if the defects prevent software from performing an automated analysis. Image enhancement methods are used to remove or minimize document defects, improve software performance, and generally make images more legible. We describe an automated, image enhancement method that is input page independent and requires no training data. The approach applies to color or greyscale images with hand written script, typewritten text, images, and mixtures thereof. We evaluated the image enhancement method against the test images provided by the 2011 Document Image Binarization Contest (DIBCO). Our method outperforms all 2011 DIBCO entrants in terms of average F1 measure - doing so with a significantly lower variance than top contest entrants. The capability of the proposed method is also illustrated using select images from a collection of historic documents stored at Yad Vashem Holocaust Memorial in Israel.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A local linear level set method for the binarization of degraded historical document images
    David Rivest-Hénault
    Reza Farrahi Moghaddam
    Mohamed Cheriet
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 101 - 124
  • [42] Adaptive binarization method for degraded document images based on surface contrast variation
    Bataineh, Bilal
    Abdullah, Siti Norul Huda Sheikh
    Omar, Khairuddin
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (03) : 639 - 652
  • [43] Binarization of degraded document images with global-local U-Nets
    Huang, Xiao
    Li, Lin
    Liu, Rong
    Xu, Chengshen
    Ye, Mingdeng
    OPTIK, 2020, 203 (203):
  • [44] A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images
    Hedjam, Rachid
    Moghaddam, Reza Farrahi
    Cheriet, Mohamed
    PATTERN RECOGNITION, 2011, 44 (09) : 2184 - 2196
  • [45] Adaptive binarization method for degraded document images based on surface contrast variation
    Bilal Bataineh
    Siti Norul Huda Sheikh Abdullah
    Khairuddin Omar
    Pattern Analysis and Applications, 2017, 20 : 639 - 652
  • [46] A local linear level set method for the binarization of degraded historical document images
    Rivest-Henault, David
    Moghaddam, Reza Farrahi
    Cheriet, Mohamed
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (02) : 101 - 124
  • [47] Anisotropic diffusion with fuzzy-based source for binarization of degraded document images
    Du, Zhongjie
    He, Chuanjiang
    APPLIED MATHEMATICS AND COMPUTATION, 2023, 441
  • [48] Reclamation of Information from Degraded and Damaged Document Images by Image Binarization Method
    Vishnudharan, B.
    Anusudha, K.
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [49] BINARIZATION AND MULTITHRESHOLDING OF DOCUMENT IMAGES USING CONNECTIVITY
    OGORMAN, L
    CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1994, 56 (06): : 494 - 506
  • [50] Gabor Filters for Degraded Document Image Binarization
    Sehad, Abdenour
    Chibani, Youcef
    Cheriet, Mohamed
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 702 - 707