Table Detection in Noisy Off-line Handwritten Documents

被引:22
作者
Chen, Jin [1 ]
Lopresti, Daniel [2 ]
机构
[1] Lehigh Univ, Dept Comp Sci & Engn, Bethlehem, PA 18015 USA
[2] Lehigh Univ Bethlehem, Dept Comp Sci Engn, Bethlehem, PA 18015 USA
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
Off-line handwriting; table detection; noisy documents;
D O I
10.1109/ICDAR.2011.88
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Table detection can be a valuable step in the analysis of unstructured documents. Although much work has been conducted in the domain of machine-print including books, scientific papers, etc., little has been done to address the case of handwritten inputs. In this paper, we study table detection in scanned handwritten documents subject to challenging artifacts and noise. First, we separate text components (machine-print, handwriting) from the rest of the page using an SVM classifier. We then employ a correlation-based approach to measure the coherence between adjacent text lines which may be part of the same table, solving the resulting page decomposition problem using dynamic programming. A report of preliminary results from ongoing experiments concludes the paper.
引用
收藏
页码:399 / 403
页数:5
相关论文
共 14 条
[11]  
Namboodiri A., 2004, THESIS
[12]  
Shafait F., 2010, P 9 IAPR INT WORKSH, P65
[13]  
Wang X., 2016, Tabular abstraction, editing, and formatting
[14]  
Zanibbi R., 2003, International Journal of Document Analysis and Recognition, V7, P1