Deep-learning and graph-based approach to table structure recognition

被引:10
|
作者
Lee, Eunji [1 ]
Park, Jaewoo [1 ]
Koo, Hyung Il [2 ]
Cho, Nam Ik [1 ,3 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, INMC, Seoul 08826, South Korea
[2] Ajou Univ, Dept Elect & Comp Engn, Suwon 16499, South Korea
[3] Seoul Natl Univ, Sch Data Sci, Seoul 08826, South Korea
关键词
Deep learning; Document analysis; Graph-based approach; Table understanding;
D O I
10.1007/s11042-021-11819-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Table structure recognition is a key component in document understanding. Many prior methods have addressed this problem with three sequential steps: table detection, table component extraction, and structure analysis based on pairwise relations. However, they have limitations in addressing complexly structured tables and/or practical scenarios (e.g., scanned documents). In this paper, we propose a novel graph-based table structure recognition framework. In order to handle complex tables, we formulate tables as planar graphs, whose faces are cell-regions. Then, we compute vertex (junction) confidence maps and line fields with the heatmap regression networks having a small number of parameters (about 1M) and reconstruct tables by solving a constrained optimization problem. We demonstrate the robustness of the proposed system through experiments on ICDAR 2019 dataset and on challenging table images. Experimental results show that the proposed method outperforms the conventional method for a range of scenarios and delivers good generalization performance.
引用
收藏
页码:5827 / 5848
页数:22
相关论文
共 50 条
  • [21] A graph-based approach for positive and unlabeled learning
    Carnevali, Julio Cesar
    Rossi, Rafael Geraldeli
    Milios, Evangelos
    Lopes, Alneu de Andrade
    INFORMATION SCIENCES, 2021, 580 : 655 - 672
  • [22] Analyzing the harmonic structure in graph-based learning
    Wu, Xiao-Ming
    Li, Zhenguo
    Chang, Shih-Fu
    Advances in Neural Information Processing Systems, 2013,
  • [23] Graph-based deep learning for communication networks: A survey
    Jiang, Weiwei
    COMPUTER COMMUNICATIONS, 2022, 185 : 40 - 54
  • [24] A survey on graph-based deep learning for computational histopathology
    Ahmedt-Aristizabal, David
    Armin, Mohammad Ali
    Denman, Simon
    Fookes, Clinton
    Petersson, Lars
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2022, 95
  • [25] Graph-based Deep Learning Analysis and Instance Selection
    Nonaka, Keisuke
    Shekkizhar, Sarath
    Ortega, Antonio
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [26] Graph-based Deep Learning in Natural Language Processing
    Vashishth, Shikhar
    Yadati, Naganand
    Talukdar, Partha
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 371 - 372
  • [27] A survey for table recognition based on deep learning
    Yu, Chenglong
    Li, Weibin
    Li, Wei
    Zhu, Zixuan
    Liu, Ruochen
    Hou, Biao
    Jiao, Licheng
    NEUROCOMPUTING, 2024, 600
  • [28] TGLSTM: A time based graph deep learning approach to gait recognition
    Battistone, Francesco
    Petrosino, Alfredo
    PATTERN RECOGNITION LETTERS, 2019, 126 : 132 - 138
  • [29] Deep Learning for Table Detection and Structure Recognition: A Survey
    Kasem, Mahmoud salaheldin
    Abdallah, Abdelrahman
    Berendeyev, Alexander
    Elkady, Ebrahem
    Mahmoud, Mohamed
    Abdalla, Mahmoud
    Hamada, Mohamed
    Vascon, Sebastiano
    Nurseitov, Daniyar
    Taj-eddin, Islam
    ACM COMPUTING SURVEYS, 2024, 56 (12)
  • [30] Deep learning for table detection and structure recognition: A survey
    Kasem, Mahmoud
    Abdallah, Abdelrahman
    Berendeyev, Alexander
    Elkady, Ebrahem
    Abdalla, Mahmoud
    Mahmoud, Mohamed
    Hamada, Mohamed
    Nurseitov, Daniyar
    Taj-Eddin, Islam
    arXiv, 2022,