Page segmentation using minimum homogeneity algorithm and adaptive mathematical morphology

被引:0
作者
Tuan Anh Tran
In Seop Na
Soo Hyung Kim
机构
[1] Chonnam National University,School of Electronic and Computer Engineering
来源
International Journal on Document Analysis and Recognition (IJDAR) | 2016年 / 19卷
关键词
Page segmentation; Document layout analysis; Homogeneity structure; OCR; Mathematical morphology; Recursive filter;
D O I
暂无
中图分类号
学科分类号
摘要
Document layout analysis or page segmentation is the task of decomposing document images into many different regions such as texts, images, separators, and tables. It is still a challenging problem due to the variety of document layouts. In this paper, we propose a novel hybrid method, which includes three main stages to deal with this problem. In the first stage, the text and non-text elements are classified by using minimum homogeneity algorithm. This method is the combination of connected component analysis and multilevel homogeneity structure. Then, in the second stage, a new homogeneity structure is combined with an adaptive mathematical morphology in the text document to get a set of text regions. Besides, on the non-text document, further classification of non-text elements is applied to get separator regions, table regions, image regions, etc. The final stage, in refinement region and noise detection process, all regions both in the text document and non-text document are refined to eliminate noises and get the geometric layout of each region. The proposed method has been tested with the dataset of ICDAR2009 page segmentation competition and many other databases with different languages. The results of these tests showed that our proposed method achieves a higher accuracy compared to other methods. This proves the effectiveness and superiority of our method.
引用
收藏
页码:191 / 209
页数:18
相关论文
共 50 条
  • [21] Segmentation of Dendritic Cells from Microscopic Images Using Mathematical Morphology
    Braiki, Marwa
    Benzinou, Abdesslam
    Nasreddine, Kamal
    Labidi, Salam
    Hymery, Nolwenn
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 277 - 282
  • [22] Image segmentation and restoration using inverse diffusion equations and mathematical morphology
    Dong, NL
    Jin, G
    Chen, HB
    Ma, JG
    Qi, B
    SAR IMAGE ANALYSIS, MODELING, AND TECHNIQUES V, 2003, 4883 : 213 - 220
  • [23] Improved Document Image Segmentation Algorithm using Multiresolution Morphology
    Bukhari, Syed Saqib
    Shafait, Faisal
    Breuel, Thomas M.
    DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [24] Page segmentation using texture analysis
    Jain, AK
    Zhong, Y
    PATTERN RECOGNITION, 1996, 29 (05) : 743 - 770
  • [25] Pore feature segmentation based on mathematical morphology
    Qi, Heng-Nian
    Chen, Feng-Nong
    Ma, Ling-Fei
    IECON 2007: 33RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, CONFERENCE PROCEEDINGS, 2007, : 2474 - 2477
  • [26] Segmentation of vessel-like patterns using mathematical morphology and curvature evaluation
    Zana, F
    Klein, JC
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (07) : 1010 - 1019
  • [27] Blood Vessel Segmentation in Angiograms using Fuzzy Inference System and Mathematical Morphology
    Ashoorirad, Masoomeh
    Baghbani, Rasool
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2009, : 272 - 276
  • [28] A web page segmentation algorithm for extracting product information
    Wu, Changjun
    Zeng, Guosun
    Xu, Guorong
    2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 1374 - 1379
  • [29] Adaptive mathematical morphology - A survey of the field
    Curic, Vladimir
    Landstrom, Anders
    Thurley, Matthew J.
    Hendriks, Cris L. Luengo
    PATTERN RECOGNITION LETTERS, 2014, 47 : 18 - 28
  • [30] A Fast Automatic Method of Lung Segmentation in CT Images Using Mathematical Morphology
    Li, W.
    Nie, S. D.
    Cheng, J. J.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2006, VOL 14, PTS 1-6, 2007, 14 : 2419 - +