Text Line Extraction in Document Images

被引:0
作者
Wang, Liuan [1 ]
Fan, Wei [1 ]
Sun, Jun [1 ]
Naoi, Satshi [1 ]
Tanaka, Hiroshi [2 ]
机构
[1] Fujitsu Res & Dev Ctr CO LTD, Beijing, Peoples R China
[2] Fujitsu Labs Ltd, Kawasaki, Kanagawa, Japan
来源
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2015年
关键词
generic text line extraction; MSER; hierarchical edge reconstruction and cut; text line energy minimization; SCENE; REGION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line extraction in document images is an important prerequisite for many content based image understanding applications. In this paper, we propose an accurate and robust method for generic text line extraction, which can be applied on large categories of document images, diverse languages, and text lines with different orientations. Firstly, the candidate connected components are extracted from document image using Maximal Stable Extremal Region (MSER) with the noises filtered by Adaboost and Convolution Neural Network (CNN). Then, the coarse text lines are generated from hierarchical edges reconstruction and cut by local linearity of text lines in the document spanning tree. Finally, for accurate text line extraction, the cut mUlti-components are re-connected based on text line energy minimization in terms of text line consistency and the fitting error. Experimental results on multilingual test dataset demonstrate the effectiveness and robust of the proposed method, which yields higher performance compared with state-of-the-art methods.
引用
收藏
页码:191 / 195
页数:5
相关论文
共 28 条
  • [1] [Anonymous], P IEEE C COMP VIS PA
  • [2] Bukhari S.S., 2009, 10 INT C DOC AN REC, P61
  • [3] Towards Generic Text-Line Extraction
    Bukhari, Syed Saqib
    Shafait, Faisal
    Breuel, Thomas M.
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 748 - 752
  • [4] Text-Line Extraction using a Convolution of Isotropic Gaussian Filter with a Set of Line Filters
    Bukhari, Syed Saqib
    Shafait, Faisal
    Breuel, Thomas M.
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 579 - 583
  • [5] TEXTLINE INFORMATION EXTRACTION FROM GRAYSCALE CAMERA-CAPTURED DOCUMENT IMAGES
    Bukhari, Syed Saqib
    Breuel, Thomas M.
    Shafait, Faisal
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2013 - +
  • [6] Bukhari SS, 2009, LECT NOTES COMPUT SC, V5702, P173, DOI 10.1007/978-3-642-03767-2_21
  • [7] Cunzhao Shi, 2012, Proceedings of the 10th IAPR International Workshop on Document Analysis Systems (DAS 2012), P210, DOI 10.1109/DAS.2012.40
  • [8] Gatos B., 2010, INT J DOC ANAL RECOG, V14, P25
  • [9] ICDAR 2013 Robust Reading Competition
    Karatzas, Dimosthenis
    Shafait, Faisal
    Uchida, Seiichi
    Iwamura, Masakazu
    Gomez i Bigorda, Lluis
    Robles Mestre, Sergi
    Mas, Joan
    Fernandez Mota, David
    Almazan Almazan, Jon
    Pere de las Heras, Lluis
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1484 - 1493
  • [10] Kumar J., 2010, Proc. 8th IAPR Int. Work. Doc. Anal. Syst. - DAS, V10, P135