Separating text and background in degraded document images - A comparison of global thresholding techniques for multi-stage thresholding

被引:53
作者
Leedham, G [1 ]
Varma, S [1 ]
Patankar, A [1 ]
Govindaraju, V [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 839798, Singapore
来源
EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS | 2002年
关键词
D O I
10.1109/IWFHR.2002.1030917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Before any processing of the textual content of a document image can be performed the text must be separated from the background of the image. Several thresholding algorithms have previously been proposed and are widely used in document processing. None have been shown effective at thresholding difficult documents where the background and foreground are non-uniform. In this paper we investigate the use of three global thresholding algorithms (Otsu's, Kapur's entropy and Solihin's quadratic integral ratio (QIR)) as the first stage in a multi-stage thresholding algorithm for use in degraded document images. It is concluded that Otsu's and Kapur's algorithms do not work well for difficult documents as they tend to over-threshold the image, thus losing much of the useful information. The QIR algorithm is more accurate in separating the foreground and background in these images, leaving a range of undecided, fuzzy, pixels for later processing in a subsequent stage.
引用
收藏
页码:244 / 249
页数:2
相关论文
共 7 条
[1]   AUTOMATIC THRESHOLDING OF GRAY-LEVEL PICTURES USING TWO-DIMENSIONAL ENTROPY [J].
ABUTALEB, AS .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1989, 47 (01) :22-32
[2]   A NEW METHOD FOR GRAY-LEVEL PICTURE THRESHOLDING USING THE ENTROPY OF THE HISTOGRAM [J].
KAPUR, JN ;
SAHOO, PK ;
WONG, AKC .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 29 (03) :273-285
[3]   MINIMUM ERROR THRESHOLDING [J].
KITTLER, J ;
ILLINGWORTH, J .
PATTERN RECOGNITION, 1986, 19 (01) :41-47
[4]   THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS [J].
OTSU, N .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01) :62-66
[5]   Integral ratio: A new class of global thresholding techniques for handwriting images [J].
Solihin, Y ;
Leedham, CG .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (08) :761-768
[6]   GOAL-DIRECTED EVALUATION OF BINARIZATION METHODS [J].
TRIER, OD ;
JAIN, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (12) :1191-1201
[7]   An adaptive logical method for binarization of degraded document images [J].
Yang, YB ;
Yan, H .
PATTERN RECOGNITION, 2000, 33 (05) :787-807