DeepBIBX: Deep Learning for Image Based Bibliographic Data Extraction

被引:5
|
作者
Bhardwaj, Akansha [1 ,2 ]
Mercier, Dominik [1 ]
Dengel, Andreas [1 ]
Ahmed, Sheraz [1 ]
机构
[1] DFKI Kaiserslautern, Smart Data & Serv, Kaiserslautern, Germany
[2] Univ Fribourg, eXascale Infolab, Fribourg, Switzerland
来源
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II | 2017年 / 10635卷
基金
瑞士国家科学基金会;
关键词
Deep learning; Machine learning; Bibliographic data; Reference linking;
D O I
10.1007/978-3-319-70096-0_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extraction of structured bibliographic data from document images of non-native-digital academic content is a challenging problem that finds its application in the automation of cataloging systems in libraries and reference linking domain. The existing approaches discard the visual cues and focus on converting the document image to text and further identifying citation strings using trained segmentation models. Apart from the large training data, which these existing methods require, they are also language dependent. This paper presents a novel approach (DeepBIBX) which targets this problem from a computer vision perspective and uses deep learning to semantically segment the individual citation strings in a document image. DeepBIBX is based on deep Fully Convolutional Networks and uses transfer learning to extract bibliographic references from document images. Unlike existing approaches which use textual content to semantically segment bibliographic references, DeepBIBX utilizes image based contextual information, which makes it applicable to documents of any language. To gauge the performance of the presented approach, a dataset consisting of 286 document images containing 5090 bibliographic references is collected. Evaluation results reveals that the DeepBIBX outperforms state-of-the-art method (ParsCit, 71.7%) for bibliographic references extraction and achieved an accuracy of 84.9% in comparison to 71.7%. Furthermore, in terms of pixel classification task, DeepBIBX achieved a precision and a recall rate of 96.2%, 94.4% respectively.
引用
收藏
页码:286 / 293
页数:8
相关论文
共 50 条
  • [1] Transfer Learning and Deep Feature Extraction for Planktonic Image Data Sets
    Orenstein, Eric C.
    Beijbom, Oscar
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1082 - 1088
  • [2] Research on remote sensing image extraction based on deep learning
    Shun Z.
    Li D.
    Jiang H.
    Li J.
    Peng R.
    Lin B.
    Liu Q.
    Gong X.
    Zheng X.
    Liu T.
    PeerJ Computer Science, 2022, 8
  • [3] Research on remote sensing image extraction based on deep learning
    Shun, Zhao
    Li, Danyang
    Jiang, Hongbo
    Li, Jiao
    Peng, Ran
    Lin, Bin
    Liu, QinLi
    Gong, Xinyao
    Zheng, Xingze
    Liu, Tao
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [4] Table Information Extraction Using Data Augmentation on Deep Learning and Image Processing
    Zulkarnain, Izuardo
    Nurmalasari, Rin Rin
    Azizah, Fazat Nur
    Proceeding of 2022 16th International Conference on Telecommunication Systems Services and Applications, TSSA 2022, 2022,
  • [5] Feature Extraction for Side Scan Sonar Image Based on Deep Learning
    Tang, Yanghua
    Wang, Hongjian
    Xiao, Yao
    Gao, Wei
    Wang, Zhao
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8416 - 8421
  • [6] Image data augmentation techniques based on deep learning: A survey
    Zeng W.
    Mathematical Biosciences and Engineering, 2024, 21 (06) : 6190 - 6224
  • [7] ImageDC: Image Data Cleaning Framework Based on Deep Learning
    Zhang, Yun
    Jin, Zongze
    Liu, Fan
    Zhu, Weilin
    Mu, Weimin
    Wang, Weiping
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 748 - 752
  • [8] A KEYLESS EXTRACTION FRAMEWORK TARGETING AT DEEP LEARNING BASED IMAGE-WITHIN-IMAGE MODELS
    Peng, Rongxuan
    Mo, Xianbo
    Tan, Shunquan
    Li, Bin
    Huang, Jiwu
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 12846 - 12850
  • [9] Deep-learning-based image reconstruction with limited data: generating synthetic raw data using deep learning
    Zijlstra, Frank
    While, Peter Thomas
    MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2024, 37 (06): : 1059 - 1076
  • [10] Information Extraction Method of Part Machining Features Based on Image Deep Learning
    Zhang, Shengwen
    Zhou, Xi
    Li, Bincheng
    Cheng, Dejun
    Chen, Wendi
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2022, 33 (03): : 348 - 355