Document image segmentation using wavelet scale-space features

被引:41
作者
Acharyya, M [1 ]
Kundu, MK [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, India
关键词
document segmentation; M-band wavelet; texture segmentation;
D O I
10.1109/TCSVT.2002.806812
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, an efficient and computationally fast method for segmenting text and graphics part of document images based on textural cues is presented. We assume that the graphics part have different textural properties than the nongraphics (text) part. The segmentation method uses the notion of multiscale wavelet analysis and statistical pattern recognition. We have used M-band wavelets which decompose an image into M x M bandpass channels. Various combinations of these. channels represent the image at different scales and orientations in the frequency plane. The objective is to transform the edges between textures into detectable discontinuities and create the feature maps which give a measure of the local energy around each pixel at different scales. From these feature maps, a scale-space signature is derived, which is the vector of features at different scales taken at each single pixel in an image. We achieve segmentation by simple analysis of the scale-space signature with traditional k-means clustering., We do, not assume any a priori information regarding the font size, scanning resolution, type of layout, etc. of the document in our segmentation scheme.
引用
收藏
页码:1117 / 1127
页数:11
相关论文
共 32 条
[1]   An adaptive approach to unsupervised texture segmentation using M-Band wavelet transform [J].
Acharyya, M ;
Kundu, MK .
SIGNAL PROCESSING, 2001, 81 (07) :1337-1356
[2]   DESIGN OF EFFICIENT M-BAND CODERS WITH LINEAR-PHASE AND PERFECT-RECONSTRUCTION PROPERTIES [J].
ALKIN, O ;
CAGLAR, H .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (07) :1579-1590
[3]   Page segmentation using the description of the background [J].
Antonacopoulos, A .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (03) :350-369
[4]   Texture analysis and classification with tree-structured wavelet transform [J].
Chang, Tianhorng ;
Kuo, C. -C. Jay .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1993, 2 (04) :429-441
[5]  
CHAUDHURI JNB, 1993, CAN METALL QUART, V32, P1
[6]   Multiscale image segmentation using wavelet-domain hidden Markov models [J].
Choi, H ;
Baraniuk, RG .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (09) :1309-1321
[7]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996
[8]  
Daubechies I., 1993, Ten Lectures of Wavelets, V28, P350
[9]   Multiscale segmentation of unstructured document pages using soft decision integration [J].
Etemad, K ;
Doermann, D ;
Chellappa, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (01) :92-96
[10]  
Farrokhnia F., 1991, Proceedings 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (91CH2983-5), P364, DOI 10.1109/CVPR.1991.139717