Deep features based convolutional neural network model for text and non-text region segmentation from document images

被引:16
作者
Umer, Saiyed
Mondal, Ranjan
Pandey, Hari Mohan [1 ,3 ]
Rout, Ranjeet Kumar [2 ]
机构
[1] Aliah Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Indian Stat Inst, Elect & Commun Sci Unit, Kolkata, India
[3] Edge Hill Univ, Dept Comp Sci, Ormskirk, Lancs, England
关键词
Complex layout; Document image; Text and Non-text region; Segmentation; Patch-based approach; Deep Learning Method; EXTRACTION;
D O I
10.1016/j.asoc.2021.107917
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A deep convolutional neural network model is presented here which uses deep learning features for text and non-text region segmentation from document images. The key objective is to extract text regions from the complex layout document images without any prior knowledge of segmentation. In a real-world scenario, a document or magazine images contain various text information along with non-text regions such as symbols, logos, pictures, and graphics. Extraction of text regions from non-text regions is challenging. To mitigate these issues, an efficient and robust segmentation technique has been proposed in this paper. The implementation of the proposed model is divided into three phases: (a) a method for pre-processing of document images using different patch sizes is employed to handle the situations for variants of text fonts and sizes in mage; (b) a deep convolutional neural network model is proposed to predict the text or non-text or ambiguous region within the image; (c) a method for post-processing of document image is proposed to handle the situation where the image has complex ambiguous regions by utilizing the recursive partitioning of those regions into their proper classes (i.e. text or non-text) and then the system accumulates the responses of those predictive patches with varying resolutions for handling the situation of text fonts variations within the image. Extensive computer simulations have been conducted using a collection of complex layout magazine images from Google sites and the ICDAR 2015 database. Results are collected and compared with state-of-the-art methods. It reveals that the proposed model is robust and more effective as compared to state-of-the-art methods. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Deep Neural Network based Hidden Markov Model for Offline Handwritten Chinese Text Recognition
    Du, Jun
    Wang, Zi-Rui
    Zhai, Jian-Fang
    Hu, Jin-Shui
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3428 - 3433
  • [22] Segmentation of intervertebral disks from videofluorographic images using convolutional neural network
    Fujinaka, Ayano
    Saito, Yuki
    Mekata, Kojiro
    Takizawa, Hotaka
    Kudo, Hiroyuki
    INTERNATIONAL FORUM ON MEDICAL IMAGING IN ASIA 2019, 2019, 11050
  • [23] Optimizing Text Detachment from the Document Image Using Block-Based Segmentation and Wavelet Transform
    Jalali, Fateme
    Ebrahimi, Afshin
    Alirezazadeh, Saeid
    2017 IEEE 4TH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2017, : 343 - 348
  • [24] Automatic detection and segmentation of brain metastases on multimodal MR images with a deep convolutional neural network
    Charron, Odelin
    Lallement, Alex
    Jarnet, Delphine
    Noblet, Vincent
    Clavier, Jean-Baptiste
    Meyer, Philippe
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 95 : 43 - 54
  • [25] Combining Deep Fully Convolutional Network and Graph Convolutional Neural Network for the Extraction of Buildings from Aerial Images
    Zhang, Wenzhuo
    Yu, Mingyang
    Chen, Xiaoxian
    Zhou, Fangliang
    Ren, Jie
    Xu, Haiqing
    Xu, Shuai
    BUILDINGS, 2022, 12 (12)
  • [26] A knowledge-based system for extracting text-lines from mixed and overlapping text/graphics compound document images
    Chen, Yen-Lin
    Hong, Zeng-Wei
    Chuang, Cheng-Hung
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 494 - 507
  • [27] Brain Tumor Segmentation from MRI Images Using Handcrafted Convolutional Neural Network
    Ullah, Faizan
    Nadeem, Muhammad
    Abrar, Mohammad
    Al-Razgan, Muna
    Alfakih, Taha
    Amin, Farhan
    Salam, Abdu
    DIAGNOSTICS, 2023, 13 (16)
  • [28] Unstructured Text Resource Access Control Attribute Mining Technology Based on Convolutional Neural Network
    Liu, Aodi
    Du, Xuehui
    Wang, Na
    IEEE ACCESS, 2019, 7 : 43031 - 43041
  • [29] Deep Learning-Based Segmentation of Peach Diseases Using Convolutional Neural Network
    Yao, Na
    Ni, Fuchuan
    Wu, Minghao
    Wang, Haiyan
    Li, Guoliang
    Sung, Wing-Kin
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [30] A Deep Adaptive Convolutional Network for Brain Tumor Segmentation from Multimodal MR Images
    Ghosal, Palash
    Reddy, Shanmukha
    Sai, Charan
    Pandey, Vikas
    Chakraborty, Jayasree
    Nandi, Debashis
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1065 - 1070