Few-shot biomedical NER empowered by LLMs-assisted data augmentation and multi-scale feature extraction

被引:0
作者
Zhao, Di [1 ,2 ,3 ]
Mu, Wenxuan [1 ]
Jia, Xiangxing [1 ]
Liu, Shuang [1 ]
Chu, Yonghe [4 ]
Meng, Jiana [1 ]
Lin, Hongfei [2 ]
机构
[1] Dalian Minzu Univ, Sch Comp Sci & Engn, Jinshitan St, Dalian 116650, Liaoning, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Liaoning, Peoples R China
[3] Dalian Yongia Elect Technol Co Ltd, Postdoctoral Workstn, Dalian 116024, Liaoning, Peoples R China
[4] Nantong Univ, Nantong 226019, Jiangsu, Peoples R China
关键词
Few-shot learning; ChatGPT; Data augmentation; Named entity recognition;
D O I
10.1186/s13040-025-00443-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Named Entity Recognition (NER) is a fundamental task in processing biomedical text. Due to the limited availability of labeled data, researchers have investigated few-shot learning methods to tackle this challenge. However, replicating the performance of fully supervised methods remains difficult in few-shot scenarios. This paper addresses two main issues. In terms of data augmentation, existing methods primarily focus on replacing content in the original text, which can potentially distort the semantics. Furthermore, current approaches often neglect sentence features at multiple scales. To overcome these challenges, we utilize ChatGPT to generate enriched data with distinct semantics for the same entities, thereby reducing noisy data. Simultaneously, we employ dynamic convolution to capture multi-scale semantic information in sentences and enhance feature representation based on PubMedBERT. We evaluated the experiments on four biomedical NER datasets (BC5CDR-Disease, NCBI, BioNLP11EPI, BioNLP13GE), and the results exceeded the current state-of-the-art models in most few-shot scenarios, including mainstream large language models like ChatGPT. The results confirm the effectiveness of the proposed method in data augmentation and model generalization.
引用
收藏
页数:18
相关论文
共 40 条
[1]  
Chawla A, 2021, 2021 IEEE 24 INT C I, P1
[2]   Few-shot biomedical named entity recognition via knowledge-guided instance generation and prompt contrastive learning [J].
Chen, Peng ;
Wang, Jian ;
Lin, Hongfei ;
Zhao, Di ;
Yang, Zhihao ;
Wren, Jonathan .
BIOINFORMATICS, 2023, 39 (08)
[3]  
Chen SG, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P5346
[4]  
Chen Xiang, 2022, P 29 INT C COMP LING, P2374, DOI DOI 10.48550/ARXIV.2109.00720
[5]   Similarity-Driven Adaptive Prototypical Network for Class-incremental Few-shot Named Entity Recognition [J].
Chen, Yifan ;
Huang, Zhan ;
Hu, Minghao ;
Li, Dongsheng ;
Wang, Changjian ;
Wang, Ankun ;
Wang, Boyang ;
Lu, Xicheng .
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, :219-227
[6]   Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition [J].
Cho, Minsoo ;
Ha, Jihwan ;
Park, Chihyun ;
Park, Sanghyun .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
[7]  
Das SSS, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P6338
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]  
Ding B, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6045
[10]  
Ding N, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P3198