Entropy-Based Approach for Enabling Text Line Segmentation in Handwritten Documents

被引:0
|
作者
Sindhushree, G. S. [1 ]
Amarnath, R. [1 ]
Nagabhushan, P. [2 ]
机构
[1] Univ Mysore, Dept Studies Comp Sci, Mysore, Karnataka, India
[2] Indian Inst Informat Technol, Allahabad, Uttar Pradesh, India
来源
DATA ANALYTICS AND LEARNING | 2019年 / 43卷
关键词
Separators; Entropy; Correspondence; Text line segmentation;
D O I
10.1007/978-981-13-2514-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining text and non-text regions in an unconstrained handwritten document image is a challenging task. In this article, we propose a novel approach based on entropy for enabling the text line segmentation. A document image is divided into multiple blocks and entropy is calculated for each block. Entropy would be higher in the text region when compared to that of non-text region. Separator points are introduced accordingly to separate text from non-text part. Further correspondence between these separators would enable text line segmentation. The proposed algorithm works with an order of O (m x n) in worst case and requires less buffer space, since it is based on unsupervised learning. Benchmark ICDAR-13 dataset is used for experimentation and accuracy is reported.
引用
收藏
页码:169 / 184
页数:16
相关论文
共 50 条
  • [21] Text line detection in handwritten documents
    Louloudis, G.
    Gatos, B.
    Pratikakis, I.
    Halatsis, C.
    PATTERN RECOGNITION, 2008, 41 (12) : 3758 - 3772
  • [22] LINE SEGMENTATION OF HANDWRITTEN KANNADA DOCUMENTS
    Swetha, S.
    Chinmayi, P. S.
    Mamatha, H. R.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [23] On-line handwritten documents segmentation
    Blanchard, J
    Artières, T
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 148 - 153
  • [24] Robust line segmentation for handwritten documents
    Kuzhinjedathu, Kamal
    Srinivasan, Harish
    Srihari, Sargur
    DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815
  • [25] A Grid based Approach for Handwritten Text Segmentation
    Ghosh, Soumalya
    Gupta, Umesh Kumar
    Ghosh, Uttam
    Shetty, Sachin
    2019 IEEE SOUTHEASTCON, 2019,
  • [26] A generalized line segmentation method for multi-script handwritten text documents
    Rakshit, Payel
    Halder, Chayan
    Md Obaidullah, Sk
    Roy, Kaushik
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [27] Text line segmentation in handwritten documents using Mumford-Shah model
    Du, Xiaojun
    Pan, Wumo
    Bui, Tien D.
    PATTERN RECOGNITION, 2009, 42 (12) : 3136 - 3145
  • [28] A Multi-scale Text Line Segmentation Method in Freestyle Handwritten Documents
    Gao, Yangdong
    Ding, Xiaoqing
    Liu, Changsong
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 643 - 647
  • [29] Segmentation of Historical Handwritten Documents into Text Zones and Text Lines
    Gatos, Basilis
    Louloudis, Georgios
    Stamatopoulos, Nikolaos
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 464 - 469
  • [30] A robust approach to text line grouping in online handwritten Japanese documents
    Zhou, Xiang-Dong
    Wang, Da-Han
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2009, 42 (09) : 2077 - 2088