Deep-learning and graph-based approach to table structure recognition

被引:0
作者
Eunji Lee
Jaewoo Park
Hyung Il Koo
Nam Ik Cho
机构
[1] INMC,Department of Electrical and Computer Engineering
[2] Seoul National University,Department of Electrical and Computer Engineering
[3] Ajou University,School of Data Science
[4] Seoul National University,undefined
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Deep learning; Document analysis; Graph-based approach; Table understanding;
D O I
暂无
中图分类号
学科分类号
摘要
Table structure recognition is a key component in document understanding. Many prior methods have addressed this problem with three sequential steps: table detection, table component extraction, and structure analysis based on pairwise relations. However, they have limitations in addressing complexly structured tables and/or practical scenarios (e.g., scanned documents). In this paper, we propose a novel graph-based table structure recognition framework. In order to handle complex tables, we formulate tables as planar graphs, whose faces are cell-regions. Then, we compute vertex (junction) confidence maps and line fields with the heatmap regression networks having a small number of parameters (about 1M) and reconstruct tables by solving a constrained optimization problem. We demonstrate the robustness of the proposed system through experiments on ICDAR 2019 dataset and on challenging table images. Experimental results show that the proposed method outperforms the conventional method for a range of scenarios and delivers good generalization performance.
引用
收藏
页码:5827 / 5848
页数:21
相关论文
共 25 条
[1]  
Bhowmik S(2021)Binyas: a complex document layout analysis system Multimedia Tools and Applications 80 8471-8504
[2]  
Kundu S(2019)Openpose: realtime multi-person 2d pose estimation using part affinity fields IEEE Transactions on Pattern Analysis and Machine Intelligence 43 172-186
[3]  
Sarkar R(2016)Robust skew estimation using straight lines in document images Journal of Electronic Imaging 25 033014-57
[4]  
Cao Z(2015)Junction-based table detection in camera-captured document images International Journal on Document Analysis and Recognition (IJDAR) 18 47-74161
[5]  
Hidalgo G(2018)Decnt: Deep deformable cnn for table detection IEEE Access 6 74151-1497
[6]  
Simon T(2014)Learning visual representations at scale ICLR Invited Talk 1 2-16
[7]  
Wei SE(2004)Table structure understanding and its performance evaluation Pattern Recognition 37 1479-undefined
[8]  
Sheikh Y(2004)A survey of table recognition Document Analysis and Recognition 7 1-undefined
[9]  
Koo HI(undefined)undefined undefined undefined undefined-undefined
[10]  
Cho NI(undefined)undefined undefined undefined undefined-undefined