From Detection to Application: Recent Advances in Understanding Scientific Tables and Figures

被引:2
作者
Huang, Jiani [1 ]
Chen, Haihua [2 ]
Yu, Fengchang [1 ]
Lu, Wei [1 ]
机构
[1] Wuhan Univ, Wuhan, Hubei, Peoples R China
[2] Univ North Texas, Denton, TX USA
关键词
Scientific documents; figure understanding; table understanding; IMAGE RETRIEVAL; INFORMATION EXTRACTION; VISUAL INFORMATION; RECOGNITION; FRAMEWORK;
D O I
10.1145/3657285
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Tables and figures are usually used to present information in a structured and visual way in scientific documents. Understanding the tables and figures in scientific documents is significant for a series of downstream tasks, such as academic search, scientific knowledge graphs, and so on. Existing studies mainly focus on detecting figures and tables from scientific documents, interpreting their semantics, and integrating them into downstream tasks. However, a systematic and comprehensive literature review on the mining and application of tables and figures in academic papers is still missing. In this article, we introduce the research framework and the whole pipeline for understanding tables and figures, including detection, structural analysis, interpretation, and application. We deliver a thorough analysis of benchmark datasets, recent techniques, and their pros and cons. Additionally, a quantitative analysis of the effectiveness of different models on popular benchmarks is presented. We further outline several important applications that exploit the semantics of scientific tables and figures. Finally, we highlight the challenges and some potential directions for future research. We believe this is the first comprehensive survey in understanding scientific tables and figures that covers the landscape from detection to application.
引用
收藏
页数:39
相关论文
共 203 条
[1]   TNCR: Table net detection and classification dataset [J].
Abdallah, Abdelrahman ;
Berendeyev, Alexander ;
Nuradin, Islam ;
Nurseitov, Daniyar .
NEUROCOMPUTING, 2022, 473 :79-97
[2]   CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images [J].
Agarwal, Madhav ;
Mondal, Ajoy ;
Jawahar, C., V .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :9491-9498
[3]  
Agarwal Shashank, 2009, AMIA Annu Symp Proc, V2009, P6
[4]  
Ajij M., 2022, SN COMPUTER SCI, V3, P1
[5]   Content-Based Image Retrieval in Radiology: Current Status and Future Directions [J].
Akgul, Ceyhun Burak ;
Rubin, Daniel L. ;
Napel, Sandy ;
Beaulieu, Christopher F. ;
Greenspan, Hayit ;
Acar, Burak .
JOURNAL OF DIGITAL IMAGING, 2011, 24 (02) :208-222
[6]  
Al-Zaidy Rabah A., 2015, Automatic extraction of data from bar charts, P30, DOI [10.1145/2815833.2816956, DOI 10.1145/2815833.2816956]
[7]  
[Anonymous], 2016, What you get is what you see: a visual markup decompiler
[8]  
[Anonymous], 1996, UW-III English/Technical Document Image database Manual
[9]  
Antani S, 2004, STUD HEALTH TECHNOL, V107, P829
[10]  
Artley B, 2023, Arxiv, DOI arXiv:2306.11699