Prompt robust large language model for Chinese medical named entity recognition

被引:0
作者
Chen, Yubo [1 ,2 ]
Zhang, Baoli [1 ]
Li, Sirui [3 ]
Jin, Zhuoran [1 ]
Cai, Zhengyuan [4 ]
Wang, Yingzheng [5 ]
Qiu, Delai [6 ]
Liu, ShengPing [6 ]
Zhao, Jun [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Key Lab Cognit & Decis Intelligence Complex Syst, 95 Zhongguancun East Rd, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, 1 Yanqihu East Rd, Beijing 101408, Peoples R China
[3] Univ Calif Berkeley, 1122 Univ Ave, Berkeley, CA 94702 USA
[4] Jinan Univ, Jinan, Guangdong, Peoples R China
[5] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 5, 8 East St, Beijing, Peoples R China
[6] Unisound AI Technol Co Ltd, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Named entity recognition; Large language model; Natural language processing; Model distillation; Nested entity;
D O I
10.1016/j.ipm.2025.104189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical Named Entity Recognition (NER) is crucial for constructing healthcare knowledge graphs and enhancing intelligent medical systems, yet it faces three challenges: data scarcity, low recall in nested entities annotation and high prompt sensitivity of generative NER model. In this paper, we aim to address the three challenges simultaneously. First, we construct a Multi-Scenario Medical NER dataset which is the largest medical NER dataset, including over 40,000 samples and over 3400 entity types with eight major scenarios: medical web data, online consultation, medical book, etc. Second, we propose a decomposed question answering based data annotation and selection method, which improved F1 score by 6% compared to direct annotation. Third, to enhance the robustness of large models to diverse prompts in real-world scenarios, we construct diverse prompt templates and implements dynamic prompt strategy during the training phase. Finally, we conducted a comprehensive set of experiments, and the results demonstrate the effectiveness of our annotation method and robustness training approach. Notably, the proposed framework achieves a 5% performance improvement on the test set compared to conventional methods. Moreover, our method enables a 7B parameter model to surpass a 32B parameter model, highlighting its superior efficiency and capability.
引用
收藏
页数:12
相关论文
共 30 条
[1]  
Chen B, 2025, Arxiv, DOI [arXiv:2403.04652, DOI 10.48550/ARXIV.2403.04652, 10.48550/arxiv.2403.04652]
[2]   MedNER: A Service-Oriented Framework for Chinese Medical Named-Entity Recognition with Real-World Application [J].
Chen, Weisi ;
Qiu, Pengxiang ;
Cauteruccio, Francesco .
BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (08)
[3]  
Dai Z., 2019, Named entity recognition using BERT BiLSTM CRF for Chinese electronic health records
[4]   Planarized sentence representation for nested named entity recognition [J].
Geng, Rushan ;
Chen, Yanping ;
Huang, Ruizhang ;
Qin, Yongbin ;
Zheng, Qinghua .
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
[5]   Research on Chinese medical named entity recognition based on collaborative cooperation of multiple neural network models [J].
Ji, Bin ;
Li, Shasha ;
Yu, Jie ;
Ma, Jun ;
Tang, Jintao ;
Wu, Qingbo ;
Tan, Yusong ;
Liu, Huijun ;
Ji, Yun .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 104
[6]  
Katiyar Arzoo., 2018, P 2018 C N AM CHAPTE, V1, P861, DOI [10.18653/v1/n18-1079, DOI 10.18653/V1/N18-1079]
[7]  
Li ZK, 2024, 2024 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2024, P557
[8]  
Liu Wei, 2021, P 50 NINETH ANN M AS, V1, P5847, DOI DOI 10.18653/V1/2021.ACL-LONG.454
[9]  
Munnangi M., 2024, P 2024 C N AM CHAPT, P3833
[10]   Chinese medical entity recognition based on the dual-branch TENER model [J].
Peng, Hui ;
Zhang, Zhichang ;
Liu, Dan ;
Qin, Xiaohui .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)