Optimal combination of document binarization techniques using a self-organizing map neural network

被引:41
作者
Badekas, E. [1 ]
Papamarkos, N. [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Image Proc & Multimedia Lab, GR-67100 Xanthi, Greece
关键词
binarization; thresholding; document processing;
D O I
10.1016/j.engappai.2006.04.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an integrated system for the binarization of normal and degraded printed documents for the purpose of visualization and recognition of text characters. In degraded documents, where considerable background noise or variation in contrast and illumination exists, there are many pixels that cannot be easily classified as foreground or background pixels. For this reason, it is necessary to perform document binarization by combining and taking into account the results of a set of binarization techniques, especially for document pixels that have high vagueness. The proposed binarization technique takes advantage of the benefits of a set of selected binarization algorithms by combining their results using a Kohonen self-organizing map neural network. Specifically, in the first stage the best parameter values for each independent binarization technique are estimated. In the second stage and in order to take advantage of the binarization information given by the independent techniques, the neural network is fed by the binarization results obtained by those techniques using their estimated best parameter values. This procedure is adaptive because the estimation of the best parameter values depends on the content of images. The proposed binarization technique is extensively tested with a variety of degraded document images. Several experimental and comparative results, exhibiting the performance of the proposed technique, are presented. (C) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:11 / 24
页数:14
相关论文
共 32 条
[1]  
[Anonymous], P ICDAR FRANC
[2]  
BADEKAS E, 2003, 3 INT S IM SIGN PROC, V2, P909
[3]  
Bernsen J., PROC INT CONF PATT R, P1251
[4]  
CHI Z, 1996, ALGORITHMS APPL IMAG
[5]   AUTOMATIC BOUNDARY DETECTION OF LEFT VENTRICLE FROM CINEANGIOGRAMS [J].
CHOW, CK ;
KANEKO, T .
COMPUTERS AND BIOMEDICAL RESEARCH, 1972, 5 (04) :388-&
[6]  
GORMAN LO, 1994, GRAPH MODEL IM PROC, V56, P494
[7]  
Haykin S., 1994, Neural networks: a comprehensive foundation
[8]  
KAMEL M, 1993, CVGIP-GRAPH MODEL IM, V55, P203, DOI 10.1006/cgip.1993.1015
[9]   A NEW METHOD FOR GRAY-LEVEL PICTURE THRESHOLDING USING THE ENTROPY OF THE HISTOGRAM [J].
KAPUR, JN ;
SAHOO, PK ;
WONG, AKC .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 29 (03) :273-285
[10]   MINIMUM ERROR THRESHOLDING [J].
KITTLER, J ;
ILLINGWORTH, J .
PATTERN RECOGNITION, 1986, 19 (01) :41-47