From Detection to Application: Recent Advances in Understanding Scientific Tables and Figures

被引:2
作者
Huang, Jiani [1 ]
Chen, Haihua [2 ]
Yu, Fengchang [1 ]
Lu, Wei [1 ]
机构
[1] Wuhan Univ, Wuhan, Hubei, Peoples R China
[2] Univ North Texas, Denton, TX USA
关键词
Scientific documents; figure understanding; table understanding; IMAGE RETRIEVAL; INFORMATION EXTRACTION; VISUAL INFORMATION; RECOGNITION; FRAMEWORK;
D O I
10.1145/3657285
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Tables and figures are usually used to present information in a structured and visual way in scientific documents. Understanding the tables and figures in scientific documents is significant for a series of downstream tasks, such as academic search, scientific knowledge graphs, and so on. Existing studies mainly focus on detecting figures and tables from scientific documents, interpreting their semantics, and integrating them into downstream tasks. However, a systematic and comprehensive literature review on the mining and application of tables and figures in academic papers is still missing. In this article, we introduce the research framework and the whole pipeline for understanding tables and figures, including detection, structural analysis, interpretation, and application. We deliver a thorough analysis of benchmark datasets, recent techniques, and their pros and cons. Additionally, a quantitative analysis of the effectiveness of different models on popular benchmarks is presented. We further outline several important applications that exploit the semantics of scientific tables and figures. Finally, we highlight the challenges and some potential directions for future research. We believe this is the first comprehensive survey in understanding scientific tables and figures that covers the landscape from detection to application.
引用
收藏
页数:39
相关论文
共 203 条
[21]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[22]  
Casado C., 2007, In Proceedings of the 70th Annual Meeting of the American Society for Information Science Technology, P1, DOI DOI 10.1002/MEET.1450440390
[23]  
Chang SC, 2022, Arxiv, DOI arXiv:2211.08545
[24]  
Chaudhry R, 2020, IEEE WINT CONF APPL, P3501, DOI 10.1109/WACV45572.2020.9093269
[25]   VIS30K: A Collection of Figures and Tables From IEEE Visualization Conference Publications [J].
Chen, Jian ;
Ling, Meng ;
Li, Rui ;
Isenberg, Petra ;
Isenberg, Tobias ;
Sedlmair, Michael ;
Moeller, Torsten ;
Laramee, Robert S. ;
Shen, Han-Wei ;
Wuensche, Katharina ;
Wang, Qiru .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (09) :3826-3833
[26]   Composition and Configuration Patterns in Multiple-View Visualizations [J].
Chen, Xi ;
Zeng, Wei ;
Lin, Yanna ;
AI-maneea, Hayder Mahdi ;
Roberts, Jonathan ;
Chang, Remco .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) :1514-1524
[27]   DiagramFlyer: A Search Engine for Data-Driven Diagrams [J].
Chen, Zhe ;
Cafarella, Michael ;
Adar, Eytan .
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, :183-186
[28]  
Cheng Beibei, 2011, Automatic segmentation of subfigure image panels for multimodal biomedical document retrieval, V7874, P294, DOI [10.1117/12.873685, DOI 10.1117/12.873685]
[29]  
Chi ZW, 2019, Arxiv, DOI arXiv:1908.04729
[30]  
Choudhury SR, 2013, ACM-IEEE J CONF DIG, P369