Few-shot biomedical NER empowered by LLMs-assisted data augmentation and multi-scale feature extraction

被引:0
作者
Zhao, Di [1 ,2 ,3 ]
Mu, Wenxuan [1 ]
Jia, Xiangxing [1 ]
Liu, Shuang [1 ]
Chu, Yonghe [4 ]
Meng, Jiana [1 ]
Lin, Hongfei [2 ]
机构
[1] Dalian Minzu Univ, Sch Comp Sci & Engn, Jinshitan St, Dalian 116650, Liaoning, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Liaoning, Peoples R China
[3] Dalian Yongia Elect Technol Co Ltd, Postdoctoral Workstn, Dalian 116024, Liaoning, Peoples R China
[4] Nantong Univ, Nantong 226019, Jiangsu, Peoples R China
关键词
Few-shot learning; ChatGPT; Data augmentation; Named entity recognition;
D O I
10.1186/s13040-025-00443-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Named Entity Recognition (NER) is a fundamental task in processing biomedical text. Due to the limited availability of labeled data, researchers have investigated few-shot learning methods to tackle this challenge. However, replicating the performance of fully supervised methods remains difficult in few-shot scenarios. This paper addresses two main issues. In terms of data augmentation, existing methods primarily focus on replacing content in the original text, which can potentially distort the semantics. Furthermore, current approaches often neglect sentence features at multiple scales. To overcome these challenges, we utilize ChatGPT to generate enriched data with distinct semantics for the same entities, thereby reducing noisy data. Simultaneously, we employ dynamic convolution to capture multi-scale semantic information in sentences and enhance feature representation based on PubMedBERT. We evaluated the experiments on four biomedical NER datasets (BC5CDR-Disease, NCBI, BioNLP11EPI, BioNLP13GE), and the results exceeded the current state-of-the-art models in most few-shot scenarios, including mainstream large language models like ChatGPT. The results confirm the effectiveness of the proposed method in data augmentation and model generalization.
引用
收藏
页数:18
相关论文
共 40 条
[31]  
Wang R, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P571
[32]  
Wang ZH, 2022, Arxiv, DOI arXiv:2205.11799
[33]   Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation [J].
Wu, Fangzhao ;
Liu, Junxin ;
Wu, Chuhan ;
Huang, Yongfeng ;
Xie, Xing .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :3342-3348
[34]  
Yang Y, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6365
[35]   Decomposed Two-Stage Prompt Learning for Few-Shot Named Entity Recognition [J].
Ye, Feiyang ;
Huang, Liang ;
Liang, Senjie ;
Chi, KaiKai .
INFORMATION, 2023, 14 (05)
[36]   Leveraging Continuous Prompt for Few-Shot Named Entity Recognition in Electric Power Domain with Meta-Learning [J].
Yu, Yang ;
He, Wei ;
Kang, Yu-meng ;
Ji, You-lang .
DATA INTELLIGENCE, 2023, 5 (02) :494-509
[37]  
Zhang S., 2023, 11 INT C LEARN REPR
[38]   Exploring Modular Task Decomposition in Cross-domain Named Entity Recognition [J].
Zhang, Xinghua ;
Yu, Bowen ;
Wang, Yubin ;
Liu, Tingwen ;
Su, Taoyu ;
Xu, Hongbo .
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, :301-311
[39]  
Zhou R, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P2251
[40]   GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text [J].
Zhu, Qile ;
Li, Xiaolin ;
Conesa, Ana ;
Pereira, Cecile .
BIOINFORMATICS, 2018, 34 (09) :1547-1554