Table Detection in Noisy Off-line Handwritten Documents

被引:22
作者
Chen, Jin [1 ]
Lopresti, Daniel [2 ]
机构
[1] Lehigh Univ, Dept Comp Sci & Engn, Bethlehem, PA 18015 USA
[2] Lehigh Univ Bethlehem, Dept Comp Sci Engn, Bethlehem, PA 18015 USA
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
关键词
Off-line handwriting; table detection; noisy documents;
D O I
10.1109/ICDAR.2011.88
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Table detection can be a valuable step in the analysis of unstructured documents. Although much work has been conducted in the domain of machine-print including books, scientific papers, etc., little has been done to address the case of handwritten inputs. In this paper, we study table detection in scanned handwritten documents subject to challenging artifacts and noise. First, we separate text components (machine-print, handwriting) from the rest of the page using an SVM classifier. We then employ a correlation-based approach to measure the coherence between adjacent text lines which may be part of the same table, solving the resulting page decomposition problem using dynamic programming. A report of preliminary results from ongoing experiments concludes the paper.
引用
收藏
页码:399 / 403
页数:5
相关论文
共 14 条
[1]  
Agrawal Mudit, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P556, DOI 10.1109/ICDAR.2009.277
[2]  
Cesarini F, 2002, INT C PATT RECOG, P236, DOI 10.1109/ICPR.2002.1047838
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]   Table-processing paradigms: a research survey [J].
Embley, David W. ;
Hurst, Matthew ;
Lopresti, Daniel ;
Nagy, George .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2006, 8 (2-3) :66-86
[5]  
Favata JT, 1996, INT J IMAG SYST TECH, V7, P304, DOI 10.1002/(SICI)1098-1098(199624)7:4<304::AID-IMA5>3.0.CO
[6]  
2-C
[7]  
Hu J., 2001, Document Recognition and Retrieval VIII IST/SPIE Electronic Imaging, P44
[8]   An approach towards benchmarking of table structure recognition results [J].
Kieninger, T ;
Dengel, A .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :1232-1236
[9]   How Carefully Designed Open Resource Sharing Can Help and Expand Document Analysis Research [J].
Lamiroy, Bart ;
Lopresti, Daniel ;
Korth, Hank ;
Heflin, Jeff .
DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
[10]  
LAURENTINI A, 1992, 11TH IAPR INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, PROCEEDINGS, VOL II, P405, DOI 10.1109/ICPR.1992.201803