Table Detection using Deep Learning

被引:118
作者
Gilani, Azka [1 ]
Qasim, Shah Rukh [1 ]
Malik, Imran
Shafait, Faisal
机构
[1] Natl Univ Sci & Technol, Islamabad, Pakistan
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
EUCLIDEAN DISTANCE TRANSFORM;
D O I
10.1109/ICDAR.2017.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Table detection is a crucial step in many document analysis applications as tables are used for presenting essential information to the reader in a structured manner. It is a hard problem due to varying layouts and encodings of the tables. Researchers have proposed numerous techniques for table detection based on layout analysis of documents. Most of these techniques fail to generalize because they rely on hand engineered features which are not robust to layout variations. In this paper, we have presented a deep learning based method for table detection. In the proposed method, document images are first pre-processed. These images are then fed to a Region Proposal Network followed by a fully connected neural network for table detection. The proposed method works with high precision on document images with varying layouts that include documents, research papers, and magazines. We have done our evaluations on publicly available UNLV dataset where it beats Tesseract's state of the art table detection system by a significant margin.
引用
收藏
页码:771 / 776
页数:6
相关论文
共 28 条
[1]  
Abbyy, 2017, OCR SDK ENG
[2]  
Akmal Jahan M. A. C., 2014, 7th International Conference on Information and Automation for Sustainability (ICIAfS), P1, DOI 10.1109/ICIAFS.2014.7069552
[3]   LINEAR-TIME EUCLIDEAN DISTANCE TRANSFORM ALGORITHMS [J].
BREU, H ;
GIL, J ;
KIRKPATRICK, D ;
WERMAN, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (05) :529-533
[4]  
e Silva Ana Costa, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P843, DOI 10.1109/ICDAR.2009.185
[5]   2D Euclidean distance transform algorithms: A comparative survey [J].
Fabbri, Ricardo ;
Costa, Luciano Da F. ;
Torelli, Julio C. ;
Bruno, Odemir M. .
ACM COMPUTING SURVEYS, 2008, 40 (01)
[6]   A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Structures [J].
Fang, Jing ;
Gao, Liangcai ;
Bai, Kun ;
Qiu, Ruiheng ;
Tao, Xin ;
Tang, Zhi .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :779-783
[7]  
Gatos B, 2005, LECT NOTES COMPUT SC, V3686, P609
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   A Table Detection Method for PDF Documents Based on Convolutional Neural Networks [J].
Hao, Leipeng ;
Gao, Liangcai ;
Yi, Xiaohan ;
Tang, Zhi .
PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, :287-292
[10]  
Harit G., 2012, P 8 IND C COMP VIS G