Language model based on deep learning network for biomedical named entity recognition

被引:2
作者
Hou, Guan [1 ]
Jian, Yuhao [1 ]
Zhao, Qingqing [1 ]
Quan, Xiongwen [1 ]
Zhang, Han [1 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tianjin, Peoples R China
关键词
Biomedical named entity recognition; Deep learning; Language model; Multi-task learning;
D O I
10.1016/j.ymeth.2024.04.013
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biomedical Named Entity Recognition (BioNER) is one of the most basic tasks in biomedical text mining, which aims to automatically identify and classify biomedical entities in text. Recently, deep learning-based methods have been applied to Biomedical Named Entity Recognition and have shown encouraging results. However, many biological entities are polysemous and ambiguous, which is one of the main obstacles to the task of biomedical named entity recognition. Deep learning methods require large amounts of training data, so the lack of data also affect the performance of model recognition. To solve the problem of polysemous words and insufficient data, for the task of biomedical named entity recognition, we propose a multi-task learning framework fused with language model based on the BiLSTM-CRF architecture. Our model uses a language model to design a differential encoding of the context, which could obtain dynamic word vectors to distinguish words in different datasets. Moreover, we use a multi-task learning method to collectively share the dynamic word vector of different types of entities to improve the recognition performance of each type of entity. Experimental results show that our model reduces the false positives caused by polysemous words through differentiated coding, and improves the performance of each subtask by sharing information between different entity data. Compared with other state-of-the art methods, our model achieved superior results in four typical training sets, and achieved the best results in F1 values.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 16 条
  • [1] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [2] Chiu J.P., 2016, Transactions of the Association for Computational Linguistics, V4, P357, DOI [10.1162/tacl_a_00104, DOI 10.1162/TACLA00104]
  • [3] A neural network multi-task learning approach to biomedical named entity recognition
    Crichton, Gamal
    Pyysalo, Sampo
    Chiu, Billy
    Korhonen, Anna
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [4] Deep learning with word embeddings improves biomedical named entity recognition
    Habibi, Maryam
    Weber, Leon
    Neves, Mariana
    Wiegandt, David Luis
    Leser, Ulf
    [J]. BIOINFORMATICS, 2017, 33 (14) : I37 - I48
  • [5] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [6] Biomedical named entity recognition and linking datasets: survey and our recent development
    Huang, Ming-Siang
    Lai, Po-Ting
    Lin, Pei-Yen
    You, Yu-Ting
    Tsai, Richard Tzong-Han
    Hsu, Wen-Lian
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (06) : 2219 - 2238
  • [7] Lample G., 2016, P 2016 C N AM CHAPT, P260, DOI [10.18653/v1/N16-1030, DOI 10.18653/V1/N16-1030]
  • [8] Semi-supervised sequence tagging with bidirectional language models
    Peters, Matthew E.
    Ammar, Waleed
    Bhagavatula, Chandra
    Power, Russell
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1756 - 1765
  • [9] Peters ME, 2018, ARXIV
  • [10] Pyysalo S., 2013, P 5 INT S LANG BIOL, P39