Multithresholding of mixed-type documents

被引:14
作者
Strouthopoulos, C [1 ]
Papamarkos, N [1 ]
机构
[1] Democritus Univ Thrace, Dept Elect & Comp Engn, Elect Circuits Anal Lab, GR-67100 Xanthi, Greece
关键词
page layout analysis; multithreshold selection; document segmentation; neural-networks;
D O I
10.1016/S0952-1976(00)00004-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mixed-type documents include text, drawings and graphics regions. It is obvious that a technique that can reduce the number of the gray-levels in accordance with the type of each document region could be important for many document applications, such as storage, transmission and recognition. To solve this problem, this paper proposes a new method, called the document multithresholding technique. The method is based on a page layout analysis (PLA) technique and on a neural-network multilevel threshold-selection approach. The proposed technique is applicable to any mixed-type document and achieves document multithresholding by taking advantage of the types of the document blocks. Thus, in the final document different block types are stored with the appropriate and limited numbers of pray-level values. The proposed method includes two main steps. First, a PLA technique is applied, which classifies the document blocks into text, line-drawing and graphics regions. In the second stage, a new neural-network multithresholding technique is applied to each of the document blocks. In text and line-drawing blocks, only one threshold is determined, whereas in the graphics blocks the optimal number of thresholds is first determined. The performance of the method has been extensively tested on a variety of documents. Several examples illustrate the strength and the effectiveness of the proposed methodology. (C) 2000 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:323 / 343
页数:21
相关论文
共 28 条