Continuous Prompt Enhanced Biomedical Entity Normalization

被引:2
作者
Lai, Zhaohong [1 ,2 ]
Fu, Biao [1 ,2 ]
Wei, Shangfei [1 ,2 ]
Shi, Xiaodong [1 ,2 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Minist Culture & Tourism, Key Lab Digital Protect & Intelligent Proc Intang, Xiamen, Peoples R China
来源
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II | 2022年 / 13552卷
关键词
Prompt-BEN; Prompt learning; Contrastive loss; RECOGNITION;
D O I
10.1007/978-3-031-17189-5_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biomedical entity normalization (BEN) aims to link the entity mentions in a biomedical text to referent entities in a knowledge base. Recently, the paradigm of large-scale language model pre-training and fine-tuning have achieved superior performance in BEN task. However, pre-trained language models like SAPBERT [21] typically contain hundreds of millions of parameters, and fine-tuning all parameters is computationally expensive. The latest research such as prompt technology is proposed to reduce the amount of parameters during the model training. Therefore, we propose a framework Prompt-BEN using continuous Prompt to enhance BEN, which just needs to fine-tune few parameters of prompt. Our method employs embeddings with the continuous prefix prompt to capture the semantic similarity between mention and terms. We also design a contrastive loss with synonym marginalized strategy for the BEN task. Finally, experimental results on three benchmark datasets demonstrated that our method achieves competitive or even greater linking accuracy than the state-of-the-art fine-tuning-based models while having about 600 times fewer tuned parameters.
引用
收藏
页码:61 / 72
页数:12
相关论文
共 39 条
  • [21] BioCreative V CDR task corpus: a resource for chemical disease relation extraction
    Li, Jiao
    Sun, Yueping
    Johnson, Robin J.
    Sciaky, Daniela
    Wei, Chih-Hsuan
    Leaman, Robert
    Davis, Allan Peter
    Mattingly, Carolyn J.
    Wiegers, Thomas C.
    Lu, Zhiyong
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [22] Li XLS, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P4582
  • [23] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [24] Liu F., 2021, P 2021 C N AM CHAPTE, P4228
  • [25] Liu Pengfei, 2021, arXiv
  • [26] Liu X, 2022, Arxiv, DOI [arXiv:2110.07602, 10.48550/arXiv.2110.07602]
  • [27] Liu X, 2023, Arxiv, DOI arXiv:2103.10385
  • [28] Mikolov T., 2013, P 26 INT C NEURAL IN, V26, DOI 10.5555/2999792.2999959
  • [29] Mondal I., 2019, Medical entity linking using triplet network, P95, DOI [10. 18653/v1/W19- 1912, DOI 10.18653/V1/W19-1912]
  • [30] Pennington J., 2014, C EMPIRICAL METHODS, P1532, DOI 10.3115/v1/D14-1162