Page segmentation using the description of the background

被引:51
作者
Antonacopoulos, A [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZF, Merseyside, England
关键词
D O I
10.1006/cviu.1998.0691
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is an ever increasing number of publications which do not have the "traditional" layout where printed regions are rectangular. Text paragraphs and areas of graphic type may be of any shape, individually rotated and in any arrangement. Previous document analysis techniques are not well suited to such complex layouts. This paper introduces a new method for the segmentation of images of document pages having both traditional and complex layouts. The underlining idea is to efficiently produce a flexible description (by means of tiles) of the background space which surrounds the printed regions in the page image under all the above conditions. Using this description of space, the contours of printed regions are identified with significant accuracy. The new approach is fast as there is no need for skew detection and correction, and only few simple operations are performed on the description of the background (not on the pixel-based data). (C) 1998 Academic Press.
引用
收藏
页码:350 / 369
页数:20
相关论文
共 12 条
  • [1] Akindele O. T., 1993, Proceedings of the Second International Conference on Document Analysis and Recognition (Cat. No.93TH0578-5), P341, DOI 10.1109/ICDAR.1993.395719
  • [2] [Anonymous], 1982, CHICAGO MANUAL STYLE
  • [3] ANTONACOPOULOS A, 1995, P 3 INT C DOC AN REC, V2, P1132
  • [4] BAIRD HS, 1992, ADV STRUCTURAL SYNTA, P253
  • [5] FISHER JL, 1909, P 10 INT C PATT REC, V1, P567
  • [6] HARALICK RM, 1994, P IAPR INT WORKSH ST
  • [7] Jain A. K., 1992, Machine Vision and Applications, V5, P169, DOI 10.1007/BF02626996
  • [8] Nagy G., 1988, P ACM C DOC PROC SYS, P169
  • [9] THE DOCUMENT SPECTRUM FOR PAGE LAYOUT ANALYSIS
    OGORMAN, L
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (11) : 1162 - 1173
  • [10] PAGE SEGMENTATION AND CLASSIFICATION
    PAVLIDIS, T
    ZHOU, JY
    [J]. CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1992, 54 (06): : 484 - 496