Document Image Binarization Using "Multi-Scale" Predefined Filters

被引:0
作者
Saabni, Raid M. [1 ,2 ]
机构
[1] Tel Aviv Yaffo Acad Coll, Dept Comp Sci, Tel Aviv, Israel
[2] Triangle Res & Dev, IL-30075 Kafr Qarea, Israel
来源
NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017) | 2018年 / 10615卷
关键词
Binarization; Ada-Boosting; Multi-scale" filters; Document Image Analysis; FRAMEWORK;
D O I
10.1117/12.2303604
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Reading text or searching for key words within a historical document is a very challenging task. one of the first steps of the complete task is binarization, where we separate foreground such as text, figures and drawings from the background. Successful results of this important step in many cases can determine next steps to success or failure, therefore it is very vital to the success of the complete task of reading and analyzing the content of a document image. Generally, historical documents images are of poor quality due to their storage condition and degradation over time, which mostly cause to varying contrasts, stains, dirt and seeping ink from reverse side. In this paper, we use banks of anisotropic predefined filters in different scales and orientations to develop a binarization method for degraded documents and manuscripts. Using the fact, that handwritten strokes may follow different scales and orientations, we use predefined sets of filter banks having various scales, weights, and orientations to seek a compact set of filters and weights in order to generate different layers of foregrounds and background. Results of convolving these filters on the gray level image locally, weighted and accumulated to enhance the original image. Based on the different layers, seeds of components in the gray level image and a learning process, we present an improved binarization algorithm to separate the background from layers of foreground. Different layers of foreground which may be caused by seeping ink, degradation or other factors are also separated from the real foreground in a second phase. Promising experimental results were obtained on the DIBCO2011, DIBCO2013 and H-DIBCO2016 data sets and a collection of images taken from real historical documents.
引用
收藏
页数:10
相关论文
共 21 条
  • [1] [Anonymous], 1985, INTRO DIGITAL IMAGE
  • [2] Input sensitive thresholding for ancient Hebrew manuscript
    Bar-Yosef, I
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (08) : 1168 - 1173
  • [4] Chaki N, 2014, COMPREHENSIVE SURVEY, P5
  • [5] A learning framework for the optimization and automation of document binarization methods
    Cheriet, Mohamed
    Moghaddam, Reza Farrahi
    Hedjam, Rachid
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (03) : 269 - 280
  • [6] Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
  • [7] THE DESIGN AND USE OF STEERABLE FILTERS
    FREEMAN, WT
    ADELSON, EH
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (09) : 891 - 906
  • [8] A decision-theoretic generalization of on-line learning and an application to boosting
    Freund, Y
    Schapire, RE
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) : 119 - 139
  • [9] Adaptive degraded document image binarization
    Gatos, B
    Pratikakis, I
    Perantonis, SJ
    [J]. PATTERN RECOGNITION, 2006, 39 (03) : 317 - 327
  • [10] Gatos Basilis, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1375, DOI 10.1109/ICDAR.2009.246