BINARIZATION AND MULTITHRESHOLDING OF DOCUMENT IMAGES USING CONNECTIVITY

被引:73
|
作者
OGORMAN, L
机构
[1] AT and T Bell Labs, Murray Hill
来源
关键词
D O I
10.1006/cgip.1994.1044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Thresholding is a common image processing operation applied to gray-scale images to obtain binary or multilevel images. Traditionally, one of two approaches is used: global or locally adaptive processing. However, each of these approaches has a disadvantage: the global approach neglects local information, and the locally adaptive approach neglects global information. A thresholding method is described here that is global in approach, but uses a measure of local information, namely connectivity. Thresholds are found at the intensity levels that best preserve the connectivity of regions within the image. Thus, this method has advantages of both global and locally adaptive approaches. This method is applied here to document images. Experimental comparisons against other thresholding methods show that the connectivity-preserving method yields much improved results. On binary images, this method has been shown to improve subsequent OCR recognition rates from about 958 to 97.5%. More importantly, the new method has been shown to reduce the number of binarization failures ( where text is so poorly binarized as to be totally unrecognizable by a commercial OCR system) from 33%: to 68 on difficult images. For multilevel document images, as well, the results shown similar improvement. (C) 1994 Academic Press, Inc.
引用
收藏
页码:494 / 506
页数:13
相关论文
共 50 条
  • [21] Binarization of Document Images with Various Object Sizes
    Hadjadj, Zineb
    Meziane, Abdelkrim
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 21 - 25
  • [22] A binarization algorithm specialized on document images and photos
    Kavallieratou, E
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 463 - 467
  • [23] Binarization Techniques for Degraded Document Images - A Review
    Jyotsna
    Chauhan, Shivani
    Sharma, Ekta
    Doegar, Amit
    2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 163 - 166
  • [24] Parallel nonparametric binarization for degraded document images
    Chen, Xin
    Lin, Liang
    Gao, Yuefang
    NEUROCOMPUTING, 2016, 189 : 43 - 52
  • [25] A Multistage Binarization Technique for the Degraded Document Images
    Mousa, Usama W. A.
    Abd El Munim, Hossam E.
    Khalil, Mahmoud I.
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 332 - 337
  • [26] Binarization and Segmentation of Kannada Handwritten Document Images
    Vinod, H. C.
    Niranjan, S. K.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT 2018), 2018, : 488 - 493
  • [27] Adaptive, quadratic preprocessing of document images for binarization
    Mo, S
    Mathews, VJ
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (07) : 992 - 999
  • [28] Efficient Binarization of Historical and Degraded Document Images
    Gatos, B.
    Pratikakis, I.
    Perantonis, S. J.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 447 - 454
  • [29] A combined approach for the binarization of handwritten document images
    Ntirogiannis, K.
    Gatos, B.
    Pratikakis, I.
    PATTERN RECOGNITION LETTERS, 2014, 35 : 3 - 15
  • [30] A novel binarization system for degraded document images
    Xi, Yan
    Chen, Youbin
    Liao, Qingmin
    Leung Winghong
    Fung Shunming
    Deng Jiangwen
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 287 - +