Using Scale-Space Anisotropic Smoothing for Text Line Extraction in Historical Documents

被引:17
作者
Cohen, Rafi [1 ]
Dinstein, Itshak [2 ]
El-Sana, Jihad [1 ]
Kedem, Klara [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Elect & Comp Engn, Beer Sheva, Israel
来源
IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I | 2014年 / 8814卷
关键词
Historical document processing; Text lines extraction;
D O I
10.1007/978-3-319-11758-4_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line extraction is vital pre-requisite for various document processing tasks. This paper presents a novel approach for text line extraction which is based on Gaussian scale space and dedicated binarization that utilize the inherent structure of smoothed text document images. It enhances the text lines in the image using multi-scale anisotropic second derivative of Gaussian filter bank at the average height of the text line. It then applies a binarization, which is based on component-tree and is tailored towards line extraction. The final stage of the algorithm is based on an energy minimization framework for removing spurious text line and assigning connected components to lines. We have tested our approach on various datasets written in different languages at range of image quality and received high detection rates, which outperform state-of-the-art algorithms. Our MATLAB code is publicly available. (http://www.cs.bgu.ac.il/similar to rafico/LineExtraction.zip)
引用
收藏
页码:349 / 358
页数:10
相关论文
共 16 条
  • [1] Text Line Extraction using DMLP Classifiers for Historical Manuscripts
    Baechler, Micheal
    Liwicki, Marcus
    Ingold, Rolf
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1029 - 1033
  • [2] Bar-Yosef Itay, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1161, DOI 10.1109/ICDAR.2009.191
  • [3] Evolution maps for connected components in text documents
    Biller, Ofer
    Kedem, Klara
    Dinstein, Itshak
    El-Sana, Jihad
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 405 - 410
  • [4] Bukhari Syed Saqib, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P446, DOI 10.1109/ICDAR.2009.206
  • [5] Cohen Rafi., 2013, P 2 INT WORKSH HIST, P110, DOI DOI 10.1145/2501115.2501117
  • [6] Fast Approximate Energy Minimization with Label Costs
    Delong, Andrew
    Osokin, Anton
    Isack, Hossam N.
    Boykov, Yuri
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 1 - 27
  • [7] Text Line Detection for Heterogeneous Documents
    Diem, Markus
    Kleber, Florian
    Sablatnig, Robert
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 743 - 747
  • [8] ICDAR2009 handwriting segmentation contest
    Gatos, B.
    Stamatopoulos, N.
    Louloudis, G.
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2011, 14 (01) : 25 - 33
  • [9] Gatos B., 2010, Proceedings 2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010), P737, DOI 10.1109/ICFHR.2010.120
  • [10] Script-independent text line segmentation in freestyle handwritten documents
    Li, Yi
    Zheng, Yefeng
    Doermann, David
    Jaeger, Stefan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) : 1313 - 1329