Automatic Metadata Information Extraction from Scientific Literature using Deep Neural Networks

被引:1
作者
Yang, Huichen [1 ]
Hsu, William [1 ]
机构
[1] Kansas State Univ, Manhattan, KS 66506 USA
来源
FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021) | 2022年 / 12084卷
关键词
Metadata Information Extraction; Document Layout Detection; Text Recognition; Transfer Learning; Deep Learning;
D O I
10.1117/12.2623554
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We present a novel computer vision-based deep learning approach for metadata extraction as both a central component of and an ancillary aid to structured information extraction from scientific literature which has various formats. The number of scientific publications is growing rapidly, but existing methods cannot combine the techniques of layout extraction and text recognition efficiently because of the various formats used by scientific literature publishers. In this paper, we introduce an end-to-end trainable neural network for segmenting and labeling the main regions of scientific documents, while simultaneously recognizing text from the detected regions. The proposed framework combines object detection techniques based on Recurrent Convolutional Neural Network (RCNN) for scientific document layout detection with Convolutional Recurrent Neural Network (CRNN) for text recognition. We also contribute a novel data set of main region annotations for scientific literature metadata information extraction to complement the limited availability of high-quality data set. The final outputs of the network are the text content (payload) and the corresponding labels of the major regions. Our results show that our model outperforms state-of-the-field baselines.
引用
收藏
页数:8
相关论文
共 30 条
[1]  
[Anonymous], 2006, P 23 INT C MACH LEAR, DOI 10.1145/1143844.1143891
[2]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[3]  
Constantin Alexandru., 2013, Proceedings of the 2013 ACM symposium on Document engineering, P177
[4]   ICDAR 2013 Table Competition [J].
Goebel, Max ;
Hassan, Tamir ;
Oro, Ermelinda ;
Orsi, Giorgio .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1449-1453
[5]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[8]   Semi-supervised machine-learning classification of materials synthesis procedures [J].
Huo, Haoyan ;
Rong, Ziqin ;
Kononova, Olga ;
Sun, Wenhao ;
Botari, Tiago ;
He, Tanjin ;
Tshitoyan, Vahe ;
Ceder, Gerbrand .
NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
[9]  
Kay A., 2007, Linux J, V2007, P2, DOI DOI 10.5555/1288165.1288167
[10]   Materials Synthesis Insights from Scientific Literature via Text Extraction and Machine Learning [J].
Kim, Edward ;
Huang, Kevin ;
Saunders, Adam ;
McCallum, Andrew ;
Ceder, Gerbrand ;
Olivetti, Elsa .
CHEMISTRY OF MATERIALS, 2017, 29 (21) :9436-9444