Learning text-line localization with shared and local regression neural networks

被引:0
作者
Moysset, Bastien [1 ,3 ]
Louradour, Jerome [1 ]
Kermorvant, Christopher [4 ]
Wolff, Christian [2 ,3 ]
机构
[1] A2iA SAS, Paris, France
[2] Univ Lyon, CNRS, Villeurbanne, France
[3] INSA Lyon, LIRIS, UMR5205, F-69621 Villeurbanne, France
[4] Teklia SAS, Paris, France
来源
PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2016年
关键词
text-line segmentation; neural network; deep learning; LSTM; regression;
D O I
10.1109/ICFHR.2016.11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line detection and localisation is a crucial step for full page document analysis, but still suffers from heterogeneity of real life documents. In this paper, we present a novel approach for text line localisation based on Convolutional Neural Networks and Multidimensional Long Short-Term Memory cells as a regressor in order to predict the coordinates of the text line bounding boxes directly from the pixel values. Targeting typically large images in document image analysis, we propose a new model using weight sharing over local blocks. We compare two strategies: directly predicting the four coordinates or predicting lower-left and upper-right points separately followed by matching. We evaluate our work on the highly unconstrained Maurdor dataset and show that our method outperforms both other machine learning and image processing methods.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 28 条
[1]  
[Anonymous], INT C DOC AN REC 200
[2]  
[Anonymous], INT C DOC AN REC
[3]  
[Anonymous], INT C DOC AN REC
[4]  
[Anonymous], INT C FRONT HANDWR R
[5]  
[Anonymous], WORKSH HIST DOC IM P
[6]  
[Anonymous], INT C COMP VIS PATT
[7]  
[Anonymous], ARXIV151103776
[8]  
[Anonymous], WORKSH HIST DOC IM P
[9]  
[Anonymous], INT C PATT REC
[10]  
[Anonymous], IEEE T PATTERN ANAL