A new efficient binarization method: application to degraded historical document images

被引:15
作者
Hadjadj, Zineb [1 ,3 ]
Cheriet, Mohamed [2 ]
Meziane, Abdelkrim [3 ]
Cherfa, Yazid [1 ]
机构
[1] Blida Univ, Elect Dept, Blida, Algeria
[2] Ecole Technol Super, Montreal, PQ, Canada
[3] Res Ctr Sci & Tech Informat Cerist, Algiers, Algeria
关键词
Document image; Binarization; Active contours; Image contrast; Average thresholding;
D O I
10.1007/s11760-017-1070-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binarization is an important step in reading text documents automatically through optical character recognition. Old document images often suffer from degradations that make their binarization a challenging task. In this paper, a new binarization technique for degraded document images is presented. The proposed technique is based on active contours evolving according to intrinsic geometric measures of the document image. The image contrast that is defined by the local image maximum and minimum is used to automatically generate the initialization map of our active contour model; an average thresholding is also used to produce the final delineation and binarization. The proposed implementation benefits from the level set framework, which allows the simultaneous application of a large variety of forces at the stroke-background interface. Our binarization method involves the combination of those forces in a specific way. The efficiency of the proposed method is shown on both recent and historical document images of the Document Image Binarization Contest (DIBCO) datasets that include different types of degradations. The results are compared to a number of known techniques from the literature.
引用
收藏
页码:1155 / 1162
页数:8
相关论文
共 22 条
  • [1] Bersen J., 1986, Eighth International Conference on Pattern Recognition. Proceedings (Cat. No.86CH2342-4), P1251
  • [2] Bukhari Syed Saqib, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P61, DOI 10.1109/ICDAR.2009.204
  • [3] A double-threshold image binarization method based on edge detector
    Chen, Qiang
    Sun, Quan-sen
    Heng, Pheng Ann
    Xia, De-shen
    [J]. PATTERN RECOGNITION, 2008, 41 (04) : 1254 - 1267
  • [4] Adaptive degraded document image binarization
    Gatos, B
    Pratikakis, I
    Perantonis, SJ
    [J]. PATTERN RECOGNITION, 2006, 39 (03) : 317 - 327
  • [5] Gatos Basilis, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1375, DOI 10.1109/ICDAR.2009.246
  • [6] An Active Contour Based Method for Image Binarization: Application to degraded historical document images
    Hadjadj, Zineb
    Meziane, Abdelkrim
    Cheriet, Mohamed
    Cherfa, Yazid
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 655 - 660
  • [7] ISauvola: Improved Sauvola's Algorithm for Document Image Binarization
    Hadjadj, Zineb
    Meziane, Abdelkrim
    Cherfa, Yazid
    Cheriet, Mohamed
    Setitra, Insaf
    [J]. IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), 2016, 9730 : 737 - 745
  • [8] A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images
    Hedjam, Rachid
    Moghaddam, Reza Farrahi
    Cheriet, Mohamed
    [J]. PATTERN RECOGNITION, 2011, 44 (09) : 2184 - 2196
  • [9] Document binarization with automatic parameter tuning
    Howe, Nicholas R.
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (03) : 247 - 258
  • [10] MINIMUM ERROR THRESHOLDING
    KITTLER, J
    ILLINGWORTH, J
    [J]. PATTERN RECOGNITION, 1986, 19 (01) : 41 - 47