A binarization method with learning-built rules for document images produced by cameras

被引:72
作者
Chou, Chien-Hsing [2 ]
Lin, Wen-Hsiung [1 ]
Chang, Fu [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan
关键词
Document image binarization; Global threshold; Image processing; Local threshold; Multi-label problem; Non-uniform brightness; Support vector machine; THRESHOLDING TECHNIQUES; PERFORMANCE; EXTRACTION; ALGORITHM;
D O I
10.1016/j.patcog.2009.10.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel binarization method for document images produced by cameras. Such images often have varying degrees of brightness and require more careful treatment than merely applying a statistical method to obtain a threshold value. To resolve the problem, the proposed method divides an image into several regions and decides how to binarize each region. The decision rules are derived from a learning process that takes training images as input. Tests on images produced under normal and inadequate illumination conditions show that our method yields better visual quality and better OCR performance than three global binarization methods and four locally adaptive binarization methods. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1518 / 1530
页数:13
相关论文
共 45 条
[1]   AUTOMATIC THRESHOLDING OF GRAY-LEVEL PICTURES USING TWO-DIMENSIONAL ENTROPY [J].
ABUTALEB, AS .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1989, 47 (01) :22-32
[2]  
Bernsen J., 1986, ICPR 86, P1251
[3]  
BOTTOU L, 1994, INT C PATT RECOG, P77, DOI 10.1109/ICPR.1994.576879
[4]   AUTOMATIC BOUNDARY DETECTION OF LEFT VENTRICLE FROM CINEANGIOGRAMS [J].
CHOW, CK ;
KANEKO, T .
COMPUTERS AND BIOMEDICAL RESEARCH, 1972, 5 (04) :388-&
[5]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[6]  
Eikvil L., 1991, Proceedings of the 1st International Conference on Document Analaysis and Recognition, P435
[7]   AN ANALYSIS OF HISTOGRAM-BASED THRESHOLDING ALGORITHMS [J].
GLASBEY, CA .
CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1993, 55 (06) :532-537
[8]   MULTILEVEL THRESHOLDING USING EDGE MATCHING [J].
HERTZ, L ;
SCHAFER, RW .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1988, 44 (03) :279-295
[9]   A comparison of methods for multiclass support vector machines [J].
Hsu, CW ;
Lin, CJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02) :415-425
[10]   IMAGE THRESHOLDING BY MINIMIZING THE MEASURES OF FUZZINESS [J].
HUANG, LK ;
WANG, MJJ .
PATTERN RECOGNITION, 1995, 28 (01) :41-51