Language model based on deep learning network for biomedical named entity recognition

被引：2

作者：

Hou, Guan ^{[1
]}

Jian, Yuhao ^{[1
]}

Zhao, Qingqing ^{[1
]}

Quan, Xiongwen ^{[1
]}

Zhang, Han ^{[1
]}

机构：

[1] Nankai Univ, Coll Artificial Intelligence, Tianjin, Peoples R China

来源：

METHODS | 2024年 / 226卷

关键词：

Biomedical named entity recognition; Deep learning; Language model; Multi-task learning;

D O I：

10.1016/j.ymeth.2024.04.013

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Biomedical Named Entity Recognition (BioNER) is one of the most basic tasks in biomedical text mining, which aims to automatically identify and classify biomedical entities in text. Recently, deep learning-based methods have been applied to Biomedical Named Entity Recognition and have shown encouraging results. However, many biological entities are polysemous and ambiguous, which is one of the main obstacles to the task of biomedical named entity recognition. Deep learning methods require large amounts of training data, so the lack of data also affect the performance of model recognition. To solve the problem of polysemous words and insufficient data, for the task of biomedical named entity recognition, we propose a multi-task learning framework fused with language model based on the BiLSTM-CRF architecture. Our model uses a language model to design a differential encoding of the context, which could obtain dynamic word vectors to distinguish words in different datasets. Moreover, we use a multi-task learning method to collectively share the dynamic word vector of different types of entities to improve the recognition performance of each type of entity. Experimental results show that our model reduces the false positives caused by polysemous words through differentiated coding, and improves the performance of each subtask by sharing information between different entity data. Compared with other state-of-the art methods, our model achieved superior results in four typical training sets, and achieved the best results in F1 values.

引用

页码：71 / 77

页数：7

共 16 条

[1]

[Anonymous], 2016, NAACL HLT 2016 2016, DOI [DOI 10.18653/V1/N16-1030, 10 . 18653 / v1 / N16-1030]

[2] Multitask learning [J].

Caruana, R .

MACHINE LEARNING, 1997, 28 (01) :41-75

[3] A neural network multi-task learning approach to biomedical named entity recognition [J].

Crichton, Gamal ;

Pyysalo, Sampo ;

Chiu, Billy ;

Korhonen, Anna .

BMC BIOINFORMATICS, 2017, 18

[4] Deep learning with word embeddings improves biomedical named entity recognition [J].

Habibi, Maryam ;

Weber, Leon ;

Neves, Mariana ;

Wiegandt, David Luis ;

Leser, Ulf .

BIOINFORMATICS, 2017, 33 (14) :I37-I48

[5]

Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]

[6] Biomedical named entity recognition and linking datasets: survey and our recent development [J].

Huang, Ming-Siang ;

Lai, Po-Ting ;

Lin, Pei-Yen ;

You, Yu-Ting ;

Tsai, Richard Tzong-Han ;

Hsu, Wen-Lian .

BRIEFINGS IN BIOINFORMATICS, 2020, 21 (06) :2219-2238

[7]

Nichols Eric, 2016, Transactions of the Association for Computational Linguistics, V4, P357

[8] Semi-supervised sequence tagging with bidirectional language models [J].

Peters, Matthew E. ;

Ammar, Waleed ;

Bhagavatula, Chandra ;

Power, Russell .

PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1756-1765

[9]

Pyysalo S., 2013, P 5 INT S LANG BIOL, P39

[10]

Radford A., 2018, Technical Report

← 1 2 →