Optimal combination of document binarization techniques using a self-organizing map neural network

被引:41
作者
Badekas, E. [1 ]
Papamarkos, N. [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Image Proc & Multimedia Lab, GR-67100 Xanthi, Greece
关键词
binarization; thresholding; document processing;
D O I
10.1016/j.engappai.2006.04.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an integrated system for the binarization of normal and degraded printed documents for the purpose of visualization and recognition of text characters. In degraded documents, where considerable background noise or variation in contrast and illumination exists, there are many pixels that cannot be easily classified as foreground or background pixels. For this reason, it is necessary to perform document binarization by combining and taking into account the results of a set of binarization techniques, especially for document pixels that have high vagueness. The proposed binarization technique takes advantage of the benefits of a set of selected binarization algorithms by combining their results using a Kohonen self-organizing map neural network. Specifically, in the first stage the best parameter values for each independent binarization technique are estimated. In the second stage and in order to take advantage of the binarization information given by the independent techniques, the neural network is fed by the binarization results obtained by those techniques using their estimated best parameter values. This procedure is adaptive because the estimation of the best parameter values depends on the content of images. The proposed binarization technique is extensively tested with a variety of degraded document images. Several experimental and comparative results, exhibiting the performance of the proposed technique, are presented. (C) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:11 / 24
页数:14
相关论文
共 32 条
[21]  
Sauvola J, 1997, PROC INT CONF DOC, P147, DOI 10.1109/ICDAR.1997.619831
[22]  
SAUVOLA J, 1999, MEDIA TEAM DOCUMENT, V2
[23]   Survey over image thresholding techniques and quantitative performance evaluation [J].
Sezgin, M ;
Sankur, B .
JOURNAL OF ELECTRONIC IMAGING, 2004, 13 (01) :146-168
[24]   Text extraction in complex color documents [J].
Strouthopoulos, C ;
Papamarkos, N ;
Atsalakis, AE .
PATTERN RECOGNITION, 2002, 35 (08) :1743-1758
[25]   SEGMENTATION OF DOCUMENT IMAGES [J].
TAXT, T ;
FLYNN, PJ ;
JAIN, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (12) :1322-1329
[26]   EVALUATION OF BINARIZATION METHODS FOR DOCUMENT IMAGES [J].
TRIER, OD ;
TAXT, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (03) :312-315
[27]   GOAL-DIRECTED EVALUATION OF BINARIZATION METHODS [J].
TRIER, OD ;
JAIN, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (12) :1191-1201
[28]  
TRIER OD, 1995, PATTERN RECOGN LETT, V16, P277, DOI 10.1016/0167-8655(94)00101-8
[29]   IMAGE THRESHOLDING FOR OPTICAL CHARACTER-RECOGNITION AND OTHER APPLICATIONS REQUIRING CHARACTER IMAGE EXTRACTION [J].
WHITE, JM ;
ROHRER, GD .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1983, 27 (04) :400-411
[30]   An adaptive logical method for binarization of degraded document images [J].
Yang, YB ;
Yan, H .
PATTERN RECOGNITION, 2000, 33 (05) :787-807