Deep features based convolutional neural network model for text and non-text region segmentation from document images

被引:16
作者
Umer, Saiyed
Mondal, Ranjan
Pandey, Hari Mohan [1 ,3 ]
Rout, Ranjeet Kumar [2 ]
机构
[1] Aliah Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Indian Stat Inst, Elect & Commun Sci Unit, Kolkata, India
[3] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
关键词
Complex layout; Document image; Text and Non-text region; Segmentation; Patch-based approach; Deep Learning Method; EXTRACTION;
D O I
10.1016/j.asoc.2021.107917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A deep convolutional neural network model is presented here which uses deep learning features for text and non-text region segmentation from document images. The key objective is to extract text regions from the complex layout document images without any prior knowledge of segmentation. In a real-world scenario, a document or magazine images contain various text information along with non-text regions such as symbols, logos, pictures, and graphics. Extraction of text regions from non-text regions is challenging. To mitigate these issues, an efficient and robust segmentation technique has been proposed in this paper. The implementation of the proposed model is divided into three phases: (a) a method for pre-processing of document images using different patch sizes is employed to handle the situations for variants of text fonts and sizes in mage; (b) a deep convolutional neural network model is proposed to predict the text or non-text or ambiguous region within the image; (c) a method for post-processing of document image is proposed to handle the situation where the image has complex ambiguous regions by utilizing the recursive partitioning of those regions into their proper classes (i.e. text or non-text) and then the system accumulates the responses of those predictive patches with varying resolutions for handling the situation of text fonts variations within the image. Extensive computer simulations have been conducted using a collection of complex layout magazine images from Google sites and the ICDAR 2015 database. Results are collected and compared with state-of-the-art methods. It reveals that the proposed model is robust and more effective as compared to state-of-the-art methods. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Deep Learning-Based Segmentation of Peach Diseases Using Convolutional Neural Network
    Yao, Na
    Ni, Fuchuan
    Wu, Minghao
    Wang, Haiyan
    Li, Guoliang
    Sung, Wing-Kin
    [J]. FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [32] Improvement of automatic building region extraction based on deep neural network segmentation
    Hayasaka, Noboru
    Shirazawa, Yuki
    Kanai, Mizuki
    Futagami, Takuya
    [J]. JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2023, 7 (04) : 393 - 408
  • [33] Distance transform based text-line extraction from unconstrained handwritten document images
    Bera, Suman Kumar
    Kundu, Soumyadeep
    Kumar, Neeraj
    Sarkar, Ram
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 186
  • [34] Gabor filter based block energy analysis for text extraction from digital document images
    Raju, S
    Pati, PB
    Ramakrishnan, AG
    [J]. FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, : 233 - 243
  • [35] Novel Light Convolutional Neural Network for COVID Detection with Watershed Based Region Growing Segmentation
    Khan, Hassan Ali
    Gong, Xueqing
    Bi, Fenglin
    Ali, Rashid
    [J]. JOURNAL OF IMAGING, 2023, 9 (02)
  • [36] Hybrid convolutional neural network based segmentation of visceral and subcutaneous adipose tissue from abdominal magnetic resonance images
    Devi B.S.
    Misbha D.S.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (10) : 13333 - 13347
  • [37] HIC-net: A deep convolutional neural network model for classification of histopathological breast images
    Ozturk, Saban
    Akdemir, Bayram
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2019, 76 : 299 - 310
  • [38] Writer Code Based Adaptation of Deep Neural Network for Offline Handwritten Chinese Text Recognition
    Wang, Zi-Rui
    Du, Jun
    [J]. PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 548 - 553
  • [39] RETRACTED: Named Entity Recognition of Medical Text Based on the Deep Neural Network (Retracted Article)
    Yang, Tianjiao
    He, Ying
    Yang, Ning
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2022, 2022
  • [40] Microstructural crack segmentation of three-dimensional concrete images based on deep convolutional neural networks
    Dong, Yijia
    Su, Chao
    Qiao, Pizhong
    Sun, Lizhi
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2020, 253