Robust Binarization of Degraded Document Images Using Heuristics

被引:0
|
作者
Parker, Jon [1 ]
Frieder, Ophir [1 ]
Frieder, Gideon [1 ]
机构
[1] Georgetown Univ, Dept Comp Sci, Washington, DC 20057 USA
来源
关键词
readability enhancement; historic document processing; document degradation;
D O I
10.1117/12.2042581
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Historically significant documents are often discovered with defects that make them difficult to read and analyze. This fact is particularly troublesome if the defects prevent software from performing an automated analysis. Image enhancement methods are used to remove or minimize document defects, improve software performance, and generally make images more legible. We describe an automated, image enhancement method that is input page independent and requires no training data. The approach applies to color or greyscale images with hand written script, typewritten text, images, and mixtures thereof. We evaluated the image enhancement method against the test images provided by the 2011 Document Image Binarization Contest (DIBCO). Our method outperforms all 2011 DIBCO entrants in terms of average F1 measure - doing so with a significantly lower variance than top contest entrants. The capability of the proposed method is also illustrated using select images from a collection of historic documents stored at Yad Vashem Holocaust Memorial in Israel.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Robust Document Image Binarization Technique for Degraded Document Images
    Su, Bolan
    Lu, Shijian
    Tan, Chew Lim
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1408 - 1417
  • [2] Adaptive Thresholding to Robust Image Binarization for Degraded Document Images
    Ingle, Prashant Devidas
    Kaur, Parminder
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 189 - 193
  • [3] Broken and degraded document images binarization
    Chen, Yiping
    Wang, Liansheng
    NEUROCOMPUTING, 2017, 237 : 272 - 280
  • [4] Hybrid Binarization Technique for Degraded Document Images
    Ranganatha, D.
    Holi, Ganga
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 893 - 898
  • [5] Modified Sauvola binarization for degraded document images
    Kaur, Amandeep
    Rani, Usha
    Josan, Gurpreet Singh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 92
  • [6] A Multistage Binarization Technique for the Degraded Document Images
    Mousa, Usama W. A.
    Abd El Munim, Hossam E.
    Khalil, Mahmoud I.
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 332 - 337
  • [7] Efficient Binarization of Historical and Degraded Document Images
    Gatos, B.
    Pratikakis, I.
    Perantonis, S. J.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 447 - 454
  • [8] Binarization Techniques for Degraded Document Images - A Review
    Jyotsna
    Chauhan, Shivani
    Sharma, Ekta
    Doegar, Amit
    2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 163 - 166
  • [9] Parallel nonparametric binarization for degraded document images
    Chen, Xin
    Lin, Liang
    Gao, Yuefang
    NEUROCOMPUTING, 2016, 189 : 43 - 52
  • [10] A novel binarization system for degraded document images
    Xi, Yan
    Chen, Youbin
    Liao, Qingmin
    Leung Winghong
    Fung Shunming
    Deng Jiangwen
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 287 - +