A binarization method with learning-built rules for document images produced by cameras

被引:72
作者
Chou, Chien-Hsing [2 ]
Lin, Wen-Hsiung [1 ]
Chang, Fu [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan
关键词
Document image binarization; Global threshold; Image processing; Local threshold; Multi-label problem; Non-uniform brightness; Support vector machine; THRESHOLDING TECHNIQUES; PERFORMANCE; EXTRACTION; ALGORITHM;
D O I
10.1016/j.patcog.2009.10.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel binarization method for document images produced by cameras. Such images often have varying degrees of brightness and require more careful treatment than merely applying a statistical method to obtain a threshold value. To resolve the problem, the proposed method divides an image into several regions and decides how to binarize each region. The decision rules are derived from a learning process that takes training images as input. Tests on images produced under normal and inadequate illumination conditions show that our method yields better visual quality and better OCR performance than three global binarization methods and four locally adaptive binarization methods. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1518 / 1530
页数:13
相关论文
共 45 条
  • [1] AUTOMATIC THRESHOLDING OF GRAY-LEVEL PICTURES USING TWO-DIMENSIONAL ENTROPY
    ABUTALEB, AS
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1989, 47 (01): : 22 - 32
  • [2] Bernsen J., 1986, ICPR 86, P1251
  • [3] BOTTOU L, 1994, INT C PATT RECOG, P77, DOI 10.1109/ICPR.1994.576879
  • [4] AUTOMATIC BOUNDARY DETECTION OF LEFT VENTRICLE FROM CINEANGIOGRAMS
    CHOW, CK
    KANEKO, T
    [J]. COMPUTERS AND BIOMEDICAL RESEARCH, 1972, 5 (04): : 388 - &
  • [5] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [6] Eikvil L., 1991, Proceedings of the 1st International Conference on Document Analaysis and Recognition, P435
  • [7] AN ANALYSIS OF HISTOGRAM-BASED THRESHOLDING ALGORITHMS
    GLASBEY, CA
    [J]. CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1993, 55 (06): : 532 - 537
  • [8] MULTILEVEL THRESHOLDING USING EDGE MATCHING
    HERTZ, L
    SCHAFER, RW
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1988, 44 (03): : 279 - 295
  • [9] A comparison of methods for multiclass support vector machines
    Hsu, CW
    Lin, CJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02): : 415 - 425
  • [10] IMAGE THRESHOLDING BY MINIMIZING THE MEASURES OF FUZZINESS
    HUANG, LK
    WANG, MJJ
    [J]. PATTERN RECOGNITION, 1995, 28 (01) : 41 - 51