Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network

被引:23
作者
Barakat, Berat [1 ]
Droby, Ahmad [1 ]
Kassis, Majeed [1 ]
El-Sana, Jihad [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, Beer Sheva, Israel
来源
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2018年
关键词
EXTRACTION;
D O I
10.1109/ICFHR-2018.2018.00072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method for text line segmentation of challenging historical manuscript images. These manuscript images contain narrow interline spaces with touching components, interpenetrating vowel signs and inconsistent font types and sizes. In addition, they contain curved, multi-skewed and multi-directed side note lines within a complex page layout. Therefore, bounding polygon labeling would be very difficult and time consuming. Instead we rely on line masks that connect the components on the same text line. Then these line masks are predicted using a Fully Convolutional Network (FCN). In the literature, FCN has been successfully used for text line segmentation of regular handwritten document images. The present paper shows that FCN is useful with challenging manuscript images as well. Using a new evaluation metric that is sensitive to over segmentation as well as under segmentation, testing results on a publicly available challenging handwritten dataset are comparable with the results of a previous work on the same dataset.
引用
收藏
页码:374 / 379
页数:6
相关论文
共 22 条
  • [1] A new scheme for unconstrained handwritten text-line segmentation
    Alaei, Alireza
    Pal, Umapada
    Nagabhushan, P.
    [J]. PATTERN RECOGNITION, 2011, 44 (04) : 917 - 928
  • [2] [Anonymous], 2012, IJDAR
  • [3] Arivazhagan Manivannan., 2007, International Conference on Document Recognition and Retrieval XIV SPIE, p6500T
  • [4] Bar-Yosef Itay, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1161, DOI 10.1109/ICDAR.2009.191
  • [5] Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents
    Bar-Yosef, Itay
    Beckman, Isaac
    Kedem, Klara
    Dinstein, Itshak
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 89 - 99
  • [6] Barakat B., 2018, AR SCRIPT AN REC ASA, P26
  • [7] Bukhari Syed Saqib, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P446, DOI 10.1109/ICDAR.2009.206
  • [8] Using Scale-Space Anisotropic Smoothing for Text Line Extraction in Historical Documents
    Cohen, Rafi
    Dinstein, Itshak
    El-Sana, Jihad
    Kedem, Klara
    [J]. IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 349 - 358
  • [9] cBAD: ICDAR2017 Competition on Baseline Detection
    Diem, Markus
    Kleber, Florian
    Fiel, Stefan
    Gatos, Basilis
    Gruening, Tobias
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1355 - 1360
  • [10] Fitzgibbon A.W., 1996, DAI Research paper