Neural Architecture Comparison for Bibliographic Reference Segmentation: An Empirical Study

被引:0
|
作者
Cuellar Hidalgo, Rodrigo [1 ]
Pinto Elias, Raul [2 ]
Torres-Moreno, Juan-Manuel [3 ]
Vergara Villegas, Osslan Osiris [4 ]
Reyes Salgado, Gerardo [5 ]
Magadan Salazar, Andrea [2 ]
机构
[1] Biblioteca Daniel Cosio Villegas, Colegio Mexico, Carretera Picacho Ajusco 20, Mexico City 14110, Mexico
[2] Tecnol Nacl Mexico CENIDET, Cuernavaca 62490, Mexico
[3] Univ Avignon, Lab Informat Avignon, 339 Chemin Meinajaries, F-84911 Avignon 9, France
[4] Univ Autonoma Ciudad Juarez, Ind & Mfg Engn Dept, Ciudad Juarez 32310, Mexico
[5] Univ Rey Juan Carlos, Dept Informat & Estadist, Ave Alcalde de Mostoles, Madrid 28933, Spain
关键词
reference mining; BiLSTM; transformers; byte-pair encoding; Conditional Random Fields; EXTRACTION;
D O I
10.3390/data9050071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the realm of digital libraries, efficiently managing and accessing scientific publications necessitates automated bibliographic reference segmentation. This study addresses the challenge of accurately segmenting bibliographic references, a task complicated by the varied formats and styles of references. Focusing on the empirical evaluation of Conditional Random Fields (CRF), Bidirectional Long Short-Term Memory with CRF (BiLSTM + CRF), and Transformer Encoder with CRF (Transformer + CRF) architectures, this research employs Byte Pair Encoding and Character Embeddings for vector representation. The models underwent training on the extensive Giant corpus and subsequent evaluation on the Cora Corpus to ensure a balanced and rigorous comparison, maintaining uniformity across embedding layers, normalization techniques, and Dropout strategies. Results indicate that the BiLSTM + CRF architecture outperforms its counterparts by adeptly handling the syntactic structures prevalent in bibliographic data, achieving an F1-Score of 0.96. This outcome highlights the necessity of aligning model architecture with the specific syntactic demands of bibliographic reference segmentation tasks. Consequently, the study establishes the BiLSTM + CRF model as a superior approach within the current state-of-the-art, offering a robust solution for the challenges faced in digital library management and scholarly communication.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] A comparison of manual and automated neural architecture search for white matter tract segmentation
    Ari Tchetchenian
    Yanming Zhu
    Fan Zhang
    Lauren J. O’Donnell
    Yang Song
    Erik Meijering
    Scientific Reports, 13
  • [2] A comparison of manual and automated neural architecture search for white matter tract segmentation
    Tchetchenian, Ari
    Zhu, Yanming
    Zhang, Fan
    O'Donnell, Lauren J. J.
    Song, Yang
    Meijering, Erik
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] Neural Architecture for Tibetan Word Segmentation
    Chen, Mengzhu
    Zhao, Shengjie
    Yang, Kai
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 367 - 370
  • [4] EMPIRICAL STUDY OF REFERENCE
    GARDINER, GL
    COLLEGE & RESEARCH LIBRARIES, 1969, 30 (02): : 130 - &
  • [5] Upsampling Algorithms for Autoencoder Segmentation Neural Networks: A Comparison Study
    Kolarik, Martin
    Burget, Radim
    Riha, Kamil
    2019 11TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2019,
  • [6] ARCHITECTURE - A BIBLIOGRAPHIC GUIDE TO BASIC REFERENCE WORKS, HISTORIES, AND HANDBOOKS - EHRESMANN,DL
    SABLE, MH
    RQ, 1984, 24 (01): : 99 - 99
  • [7] How to design a deep neural network for retinal vessel segmentation: an empirical study
    Su, Yanzhou
    Cheng, Jian
    Cao, Guiqun
    Liu, Haijun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77
  • [8] Neural Architecture Search for Microscopy Cell Segmentation
    Zhu, Yanming
    Meijering, Erik
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2020, 2020, 12436 : 542 - 551
  • [9] A neural architecture for 3D segmentation
    Chella, A
    Maniscalco, U
    Pirrone, R
    NEURAL NETS, 2003, 2859 : 121 - 128
  • [10] Evolving blocks by segmentation for neural architecture search
    Zhao, Xiaoping
    Jiang, Liwen
    Slowik, Adam
    Zhang, Zhenman
    Xue, Yu
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (03): : 2016 - 2032