Color, complex document segmentation and compression

被引:0
作者
Fung, HT
Parker, KJ
机构
来源
DOCUMENT RECOGNITION IV | 1997年 / 3027卷
关键词
document; color documents; complex documents; segmentation; and compression;
D O I
10.1117/12.270071
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We propose a novel segmentation algorithm called SMART (Segmentation by subjecting Macroblocks of Active Regions to the binarizability Test) for color, complex documents. It decomposes a document image into ''binarizable'' and ''non-binarizable'' components. The segmentation procedure includes color transformation, halftone texture suppression, subdivision of the image into 8x8 blocks, classification of the 8x8 blocks as ''active'' or ''inactive,'' formation of macroblocks from the active blocks, and classification of the macroblocks as binarizable or non-binarizable. The classification processes involve the DCT coefficients and a histogram analysis. SMART is compared to three well-known segmentation algorithms: CRLA,(1) RXYC,(2) and SPACE.(3) SMART can handle image components of various shapes, multiple backgrounds of different gray levels, different relative grayness of text to its background, tilted image components, and text of different gray levels. To compress the segmented image, we apply JPEG(4) to the non-binarizable macroblocks and the Group 4 coding scheme(5) to the binary image representing the binarizable macroblocks and to the bitmap, storing the configuration of all macroblocks. Data about the representative gray values, the color information, and other descriptors of the binarizable macroblocks and the background regions are also sent to allow image reconstruction. The gain in using our compression algorithm over using JPEG for the whole image is significant. This gain increases as the proportion of the size of the binarizable macroblocks and the background regions to the image size increases. Psychovisual experiments also show that the subjects prefer the reconstructed images from our compression algorithm to those from the bitrate-matching JPEG images. In a series of test images, this document segmentation and compression system enables compression ratios two times to six times improved over standard methods.
引用
收藏
页码:180 / 191
页数:2
相关论文
empty
未找到相关数据