DeepBIBX: Deep Learning for Image Based Bibliographic Data Extraction

被引：5

作者：

Bhardwaj, Akansha ^{[1
,2
]}

Mercier, Dominik ^{[1
]}

Dengel, Andreas ^{[1
]}

Ahmed, Sheraz ^{[1
]}

机构：

[1] DFKI Kaiserslautern, Smart Data & Serv, Kaiserslautern, Germany

[2] Univ Fribourg, eXascale Infolab, Fribourg, Switzerland

来源：

NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II | 2017年 / 10635卷

基金：

瑞士国家科学基金会;

关键词：

Deep learning; Machine learning; Bibliographic data; Reference linking;

D O I：

10.1007/978-3-319-70096-0_30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Extraction of structured bibliographic data from document images of non-native-digital academic content is a challenging problem that finds its application in the automation of cataloging systems in libraries and reference linking domain. The existing approaches discard the visual cues and focus on converting the document image to text and further identifying citation strings using trained segmentation models. Apart from the large training data, which these existing methods require, they are also language dependent. This paper presents a novel approach (DeepBIBX) which targets this problem from a computer vision perspective and uses deep learning to semantically segment the individual citation strings in a document image. DeepBIBX is based on deep Fully Convolutional Networks and uses transfer learning to extract bibliographic references from document images. Unlike existing approaches which use textual content to semantically segment bibliographic references, DeepBIBX utilizes image based contextual information, which makes it applicable to documents of any language. To gauge the performance of the presented approach, a dataset consisting of 286 document images containing 5090 bibliographic references is collected. Evaluation results reveals that the DeepBIBX outperforms state-of-the-art method (ParsCit, 71.7%) for bibliographic references extraction and achieved an accuracy of 84.9% in comparison to 71.7%. Furthermore, in terms of pixel classification task, DeepBIBX achieved a precision and a recall rate of 96.2%, 94.4% respectively.

引用

页码：286 / 293

页数：8

共 50 条

[1] Transfer Learning and Deep Feature Extraction for Planktonic Image Data Sets
Orenstein, Eric C.
Beijbom, Oscar
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1082 - 1088
[2] Research on remote sensing image extraction based on deep learning
Shun Z.
Li D.
Jiang H.
Li J.
Peng R.
Lin B.
Liu Q.
Gong X.
Zheng X.
Liu T.
PeerJ Computer Science, 2022, 8
[3] Research on remote sensing image extraction based on deep learning
Shun, Zhao
Li, Danyang
Jiang, Hongbo
Li, Jiao
Peng, Ran
Lin, Bin
Liu, QinLi
Gong, Xinyao
Zheng, Xingze
Liu, Tao
PEERJ COMPUTER SCIENCE, 2022, 8
[4] Table Information Extraction Using Data Augmentation on Deep Learning and Image Processing
Zulkarnain, Izuardo
Nurmalasari, Rin Rin
Azizah, Fazat Nur
Proceeding of 2022 16th International Conference on Telecommunication Systems Services and Applications, TSSA 2022, 2022,
[5] Feature Extraction for Side Scan Sonar Image Based on Deep Learning
Tang, Yanghua
Wang, Hongjian
Xiao, Yao
Gao, Wei
Wang, Zhao
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8416 - 8421
[6] Image data augmentation techniques based on deep learning: A survey
Zeng W.
Mathematical Biosciences and Engineering, 2024, 21 (06) : 6190 - 6224
[7] ImageDC: Image Data Cleaning Framework Based on Deep Learning
Zhang, Yun
Jin, Zongze
Liu, Fan
Zhu, Weilin
Mu, Weimin
Wang, Weiping
PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 748 - 752
[8] A KEYLESS EXTRACTION FRAMEWORK TARGETING AT DEEP LEARNING BASED IMAGE-WITHIN-IMAGE MODELS
Peng, Rongxuan
Mo, Xianbo
Tan, Shunquan
Li, Bin
Huang, Jiwu
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 12846 - 12850
[9] Deep-learning-based image reconstruction with limited data: generating synthetic raw data using deep learning
Zijlstra, Frank
While, Peter Thomas
MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2024, 37 (06): : 1059 - 1076
[10] Information Extraction Method of Part Machining Features Based on Image Deep Learning
Zhang, Shengwen
Zhou, Xi
Li, Bincheng
Cheng, Dejun
Chen, Wendi
Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (03): : 348 - 355

← 1 2 3 4 5 →