BINARIZATION AND MULTITHRESHOLDING OF DOCUMENT IMAGES USING CONNECTIVITY

被引:73
|
作者
OGORMAN, L
机构
[1] AT and T Bell Labs, Murray Hill
来源
关键词
D O I
10.1006/cgip.1994.1044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Thresholding is a common image processing operation applied to gray-scale images to obtain binary or multilevel images. Traditionally, one of two approaches is used: global or locally adaptive processing. However, each of these approaches has a disadvantage: the global approach neglects local information, and the locally adaptive approach neglects global information. A thresholding method is described here that is global in approach, but uses a measure of local information, namely connectivity. Thresholds are found at the intensity levels that best preserve the connectivity of regions within the image. Thus, this method has advantages of both global and locally adaptive approaches. This method is applied here to document images. Experimental comparisons against other thresholding methods show that the connectivity-preserving method yields much improved results. On binary images, this method has been shown to improve subsequent OCR recognition rates from about 958 to 97.5%. More importantly, the new method has been shown to reduce the number of binarization failures ( where text is so poorly binarized as to be totally unrecognizable by a commercial OCR system) from 33%: to 68 on difficult images. For multilevel document images, as well, the results shown similar improvement. (C) 1994 Academic Press, Inc.
引用
收藏
页码:494 / 506
页数:13
相关论文
共 50 条
  • [1] Binarization of MultiSpectral Document Images
    Hollaus, Fabian
    Diem, Markus
    Sablatnig, Robert
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 109 - 120
  • [2] Binarization of Colored Document Images using Spectral Clustering
    Elgbbas, Enas M.
    Khalil, Mahmoud I.
    Abbas, Hazem
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 411 - 416
  • [3] Binarization of document images using image dependent model
    Dawoud, A
    Kamel, M
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 49 - 53
  • [4] Robust Binarization of Degraded Document Images Using Heuristics
    Parker, Jon
    Frieder, Ophir
    Frieder, Gideon
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [5] Global Binarization of Document Images Using a Neural Network
    Khashman, Adnan
    Sekeroglu, Boran
    SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 665 - 672
  • [7] Binarization of Document Images: A Comprehensive Review
    Mustafa, Wan Azani
    Kader, Mohamed Mydin M. Abdul
    1ST INTERNATIONAL CONFERENCE ON GREEN AND SUSTAINABLE COMPUTING (ICOGES) 2017, 2018, 1019
  • [8] Deep semantic binarization for document images
    Mondal, Ajoy
    Reddy, Chetan
    Jawahar, C., V
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 6531 - 6555
  • [9] Binarization of document images with complex background
    Zhang Chong-yang
    Yang Jing-yu
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [10] EVALUATION OF BINARIZATION METHODS FOR DOCUMENT IMAGES
    TRIER, OD
    TAXT, T
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (03) : 312 - 315