Page segmentation using minimum homogeneity algorithm and adaptive mathematical morphology

被引:0
作者
Tuan Anh Tran
In Seop Na
Soo Hyung Kim
机构
[1] Chonnam National University,School of Electronic and Computer Engineering
来源
International Journal on Document Analysis and Recognition (IJDAR) | 2016年 / 19卷
关键词
Page segmentation; Document layout analysis; Homogeneity structure; OCR; Mathematical morphology; Recursive filter;
D O I
暂无
中图分类号
学科分类号
摘要
Document layout analysis or page segmentation is the task of decomposing document images into many different regions such as texts, images, separators, and tables. It is still a challenging problem due to the variety of document layouts. In this paper, we propose a novel hybrid method, which includes three main stages to deal with this problem. In the first stage, the text and non-text elements are classified by using minimum homogeneity algorithm. This method is the combination of connected component analysis and multilevel homogeneity structure. Then, in the second stage, a new homogeneity structure is combined with an adaptive mathematical morphology in the text document to get a set of text regions. Besides, on the non-text document, further classification of non-text elements is applied to get separator regions, table regions, image regions, etc. The final stage, in refinement region and noise detection process, all regions both in the text document and non-text document are refined to eliminate noises and get the geometric layout of each region. The proposed method has been tested with the dataset of ICDAR2009 page segmentation competition and many other databases with different languages. The results of these tests showed that our proposed method achieves a higher accuracy compared to other methods. This proves the effectiveness and superiority of our method.
引用
收藏
页码:191 / 209
页数:18
相关论文
共 50 条
[31]   A web page segmentation algorithm for extracting product information [J].
Wu, Changjun ;
Zeng, Guosun ;
Xu, Guorong .
2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, :1374-1379
[32]   A region merging algorithm using mathematical morphology: Application to Macula detection [J].
Zana, F ;
Meunier, I ;
Klein, JC .
MATHEMATICAL MORPHOLOGY AND ITS APPLICATIONS TO IMAGE AND SIGNAL PROCESSING, 1998, 12 :423-430
[33]   Fusion of Structure Adaptive Filtering and Mathematical Morphology for Vessel Segmentation in Fundus Images of Infants with Retinopathy of Prematurity [J].
Nisha, K. L. ;
Sreelekha, G. ;
Savithri, Sathidevi Puthumangalathu ;
Mohanachandran, Poornima ;
Vinekar, Anand .
2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
[34]   A High-Precision Algorithm for Phasor Measurement Using Mathematical Morphology [J].
Wang, Chao ;
Wang, Fangzong .
2010 ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2010,
[35]   Adaptive page segmentation for color technical journals' cover images [J].
Chen, WY ;
Chen, SY .
IMAGE AND VISION COMPUTING, 1998, 16 (12-13) :855-877
[36]   Dynamic Reactive Power Optimization Using Mathematical Morphology and Genetic Algorithm [J].
Zhang, Anan ;
Jiang, Zhenchao ;
Yang, Honggeng .
2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, 2008, :709-714
[37]   A clustering algorithm based on mathematical morphology [J].
Luo, Huilan ;
Kong, Fansheng ;
Zhang, Kejun ;
He, Lingmin .
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, :6064-+
[38]   Teeth segmentation in digitized dental X-ray films using mathematical morphology [J].
Said, Eyad Haj ;
Nassar, Diaa Eldin M. ;
Fahmy, Gamal ;
Ammar, Hany H. .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2006, 1 (02) :178-189
[39]   Mathematical morphology based lung segmentation using multiscale dense pyramid network architecture [J].
Kumar, Pradeep ;
Raja, Linesh ;
Soni, Pramod Kumar ;
Gaur, Kuntal .
JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2024, 27 (04) :1139-1149
[40]   Segmentation of skull and scalp in 3-D human MRI using mathematical morphology [J].
Dogdas, B ;
Shattuck, DW ;
Leahy, RM .
HUMAN BRAIN MAPPING, 2005, 26 (04) :273-285