Deep features based convolutional neural network model for text and non-text region segmentation from document images

被引:16
作者
Umer, Saiyed
Mondal, Ranjan
Pandey, Hari Mohan [1 ,3 ]
Rout, Ranjeet Kumar [2 ]
机构
[1] Aliah Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Indian Stat Inst, Elect & Commun Sci Unit, Kolkata, India
[3] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
关键词
Complex layout; Document image; Text and Non-text region; Segmentation; Patch-based approach; Deep Learning Method; EXTRACTION;
D O I
10.1016/j.asoc.2021.107917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A deep convolutional neural network model is presented here which uses deep learning features for text and non-text region segmentation from document images. The key objective is to extract text regions from the complex layout document images without any prior knowledge of segmentation. In a real-world scenario, a document or magazine images contain various text information along with non-text regions such as symbols, logos, pictures, and graphics. Extraction of text regions from non-text regions is challenging. To mitigate these issues, an efficient and robust segmentation technique has been proposed in this paper. The implementation of the proposed model is divided into three phases: (a) a method for pre-processing of document images using different patch sizes is employed to handle the situations for variants of text fonts and sizes in mage; (b) a deep convolutional neural network model is proposed to predict the text or non-text or ambiguous region within the image; (c) a method for post-processing of document image is proposed to handle the situation where the image has complex ambiguous regions by utilizing the recursive partitioning of those regions into their proper classes (i.e. text or non-text) and then the system accumulates the responses of those predictive patches with varying resolutions for handling the situation of text fonts variations within the image. Extensive computer simulations have been conducted using a collection of complex layout magazine images from Google sites and the ICDAR 2015 database. Results are collected and compared with state-of-the-art methods. It reveals that the proposed model is robust and more effective as compared to state-of-the-art methods. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
[41]   Automated femur segmentation from computed tomography images using a deep neural network [J].
Bjornsson, P. A. ;
Helgason, B. ;
Palsson, H. ;
Sigurdsson, S. ;
Gudnason, V. ;
Ellingsen, L. M. .
MEDICAL IMAGING 2021: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2021, 11600
[42]   Segmentation of Intra-Retinal Cysts From Optical Coherence Tomography Images Using a Fully Convolutional Neural Network Model [J].
Girish, G. N. ;
Thakur, Bibhash ;
Chowdhury, Sohini Roy ;
Kothari, Abhishek R. ;
Rajan, Jeny .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (01) :296-304
[43]   A Fully-Automatic Segmentation of the Carpal Tunnel from Magnetic Resonance Images Based on the Convolutional Neural Network-Based Approach [J].
Yang, Tai-Hua ;
Yang, Cheng-Wei ;
Sun, Yung-Nien ;
Horng, Ming-Huwi .
JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2021, 41 (05) :610-625
[44]   A Fully-Automatic Segmentation of the Carpal Tunnel from Magnetic Resonance Images Based on the Convolutional Neural Network-Based Approach [J].
Tai-Hua Yang ;
Cheng-Wei Yang ;
Yung-Nien Sun ;
Ming-Huwi Horng .
Journal of Medical and Biological Engineering, 2021, 41 :610-625
[45]   Deep Convolutional Neural Network Based Analysis of Liver Tissues Using Computed Tomography Images [J].
Nisa, Mehrun ;
Buzdar, Saeed Ahmad ;
Khan, Khalil ;
Ahmad, Muhammad Saeed .
SYMMETRY-BASEL, 2022, 14 (02)
[46]   Fig Plant Segmentation from Aerial Images Using a Deep Convolutional Encoder-Decoder Network [J].
Fuentes-Pacheco, Jorge ;
Torres-Olivares, Juan ;
Roman-Rangel, Edgar ;
Cervantes, Salvador ;
Juarez-Lopez, Porfirio ;
Hermosillo-Valadez, Jorge ;
Manuel Rendon-Mancha, Juan .
REMOTE SENSING, 2019, 11 (10)
[47]   Automated Method of Road Extraction from Aerial Images Using a Deep Convolutional Neural Network [J].
Alshaikhli, Tamara ;
Liu, Wen ;
Maruyama, Yoshihisa .
APPLIED SCIENCES-BASEL, 2019, 9 (22)
[48]   RETRACTED ARTICLE: An improved convolutional neural network for abnormality detection and segmentation from human sperm images [J].
L. Prabaharan ;
A. Raghunathan .
Journal of Ambient Intelligence and Humanized Computing, 2021, 12 :3341-3352
[49]   Deep learning on edge: Extracting field boundaries from satellite images with a convolutional neural network [J].
Waldner, Francois ;
Diakogiannis, Foivos, I .
REMOTE SENSING OF ENVIRONMENT, 2020, 245 (245)
[50]   Automatic classification of white blood cells using deep features based convolutional neural network [J].
Meenakshi, A. ;
Ruth, J. Anitha ;
Kanagavalli, V. R. ;
Uma, R. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) :30121-30142