Continuous Prompt Enhanced Biomedical Entity Normalization

被引：2

作者：

Lai, Zhaohong ^{[1
,2
]}

Fu, Biao ^{[1
,2
]}

Wei, Shangfei ^{[1
,2
]}

Shi, Xiaodong ^{[1
,2
]}

机构：

[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China

[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China

来源：

NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II | 2022年 / 13552卷

关键词：

Prompt-BEN; Prompt learning; Contrastive loss; RECOGNITION;

D O I：

10.1007/978-3-031-17189-5_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Biomedical entity normalization (BEN) aims to link the entity mentions in a biomedical text to referent entities in a knowledge base. Recently, the paradigm of large-scale language model pre-training and fine-tuning have achieved superior performance in BEN task. However, pre-trained language models like SAPBERT [21] typically contain hundreds of millions of parameters, and fine-tuning all parameters is computationally expensive. The latest research such as prompt technology is proposed to reduce the amount of parameters during the model training. Therefore, we propose a framework Prompt-BEN using continuous Prompt to enhance BEN, which just needs to fine-tune few parameters of prompt. Our method employs embeddings with the continuous prefix prompt to capture the semantic similarity between mention and terms. We also design a contrastive loss with synonym marginalized strategy for the BEN task. Finally, experimental results on three benchmark datasets demonstrated that our method achieves competitive or even greater linking accuracy than the state-of-the-art fine-tuning-based models while having about 600 times fewer tuned parameters.

引用

页码：61 / 72

页数：12

共 39 条

[21] BioCreative V CDR task corpus: a resource for chemical disease relation extraction
Li, Jiao
Sun, Yueping
Johnson, Robin J.
Sciaky, Daniela
Wei, Chih-Hsuan
Leaman, Robert
Davis, Allan Peter
Mattingly, Carolyn J.
Wiegers, Thomas C.
Lu, Zhiyong
[J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[22] Li XLS, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P4582
[23] Focal Loss for Dense Object Detection
Lin, Tsung-Yi
Goyal, Priya
Girshick, Ross
He, Kaiming
Dollar, Piotr
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
[24] Liu F., 2021, P 2021 C N AM CHAPTE, P4228
[25] Liu Pengfei, 2021, arXiv
[26] Liu X, 2022, Arxiv, DOI [arXiv:2110.07602, 10.48550/arXiv.2110.07602]
[27] Liu X, 2023, Arxiv, DOI arXiv:2103.10385
[28] Mikolov T., 2013, P 26 INT C NEURAL IN, V26, DOI 10.5555/2999792.2999959
[29] Mondal I., 2019, Medical entity linking using triplet network, P95, DOI [10. 18653/v1/W19- 1912, DOI 10.18653/V1/W19-1912]
[30] Pennington J., 2014, C EMPIRICAL METHODS, P1532, DOI 10.3115/v1/D14-1162

← 1 2 3 4 →