BINARIZATION AND MULTITHRESHOLDING OF DOCUMENT IMAGES USING CONNECTIVITY

被引:73
|
作者
OGORMAN, L
机构
[1] AT and T Bell Labs, Murray Hill
来源
关键词
D O I
10.1006/cgip.1994.1044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Thresholding is a common image processing operation applied to gray-scale images to obtain binary or multilevel images. Traditionally, one of two approaches is used: global or locally adaptive processing. However, each of these approaches has a disadvantage: the global approach neglects local information, and the locally adaptive approach neglects global information. A thresholding method is described here that is global in approach, but uses a measure of local information, namely connectivity. Thresholds are found at the intensity levels that best preserve the connectivity of regions within the image. Thus, this method has advantages of both global and locally adaptive approaches. This method is applied here to document images. Experimental comparisons against other thresholding methods show that the connectivity-preserving method yields much improved results. On binary images, this method has been shown to improve subsequent OCR recognition rates from about 958 to 97.5%. More importantly, the new method has been shown to reduce the number of binarization failures ( where text is so poorly binarized as to be totally unrecognizable by a commercial OCR system) from 33%: to 68 on difficult images. For multilevel document images, as well, the results shown similar improvement. (C) 1994 Academic Press, Inc.
引用
收藏
页码:494 / 506
页数:13
相关论文
共 50 条
  • [11] Assessing Binarization Techniques for Document Images
    Lins, Rafael Dueire
    de Almeida, Marcos Martins
    Bernardino, Rodrigo Barros
    Jesus, Darlisson
    Oliveira, Jose Mario
    PROCEEDINGS OF THE 2017 ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 17), 2017, : 183 - 192
  • [12] Deep semantic binarization for document images
    Ajoy Mondal
    Chetan Reddy
    C. V. Jawahar
    Multimedia Tools and Applications, 2023, 82 : 6531 - 6555
  • [13] Adaptive binarization of historical document images
    Kavallieratou, Ergina
    Stathis, Stamatatos
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 742 - +
  • [14] Broken and degraded document images binarization
    Chen, Yiping
    Wang, Liansheng
    NEUROCOMPUTING, 2017, 237 : 272 - 280
  • [15] Optimum binarization of technical document images
    Valverde, JS
    Grigat, RR
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 985 - 988
  • [16] Robust Binarization of Stereo and Monocular Document Images Using Percentile Filter
    Afzal, Muhammad Zeshan
    Kraemer, Martin
    Bukhari, Syed Saqib
    Yousefi, Mohammad Reza
    Shafait, Faisal
    Breuel, Thomas M.
    CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, CBDAR 2013, 2014, 8357 : 139 - 149
  • [17] Robust Document Image Binarization Technique for Degraded Document Images
    Su, Bolan
    Lu, Shijian
    Tan, Chew Lim
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1408 - 1417
  • [18] Hybrid Binarization Technique for Degraded Document Images
    Ranganatha, D.
    Holi, Ganga
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 893 - 898
  • [19] Modified Sauvola binarization for degraded document images
    Kaur, Amandeep
    Rani, Usha
    Josan, Gurpreet Singh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 92
  • [20] Binarization of the Noisy Document Images: A New Approach
    Malakar, Samir
    Mohanta, Dheeraj
    Sarkar, Ram
    Das, Nibaran
    Nasipuri, Mita
    Basu, D. K.
    COMPUTER NETWORKS AND INTELLIGENT COMPUTING, 2011, 157 : 511 - +