Span-Based Chinese Few-Shot NER with Contrastive and Prompt Learning

被引:0
作者
Ye, Feiyang [1 ]
Lai, Peichao [1 ]
Yang, Sanhe [1 ]
Zhang, Zhengfeng [1 ]
Wang, Yilei [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
来源
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024 | 2025年 / 15360卷
关键词
Named Entity Recognition; Contrastive Learning; Few-shot Learning;
D O I
10.1007/978-981-97-9434-8_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Chinese Named Entity Recognition (NER) tasks, achieving better performance with fewer training samples remains a challenge. Previous works primarily focus on enhancing model performance in NER by incorporating additional knowledge to construct entity features. These approaches neglect the semantic information of entity labels and the information of entity boundaries. Moreover, conventional methods typically treat NER as a sequence labeling task, which makes them inadequate for addressing the issue of nested entities. We propose a new span-based approach by using contrastive learning and prompt learning to address these problems. By pulling similar entities closer together, pushing dissimilar entities further apart, and leveraging entity label information, we improve model performance in few-shot scenarios effectively. Experimental results demonstrate that our method achieves significant performance improvements on a sampled Chinese nested medical dataset and several other flattened datasets, providing a new insight into addressing challenges in few-shot NER tasks.
引用
收藏
页码:43 / 55
页数:13
相关论文
共 22 条
[1]  
Das SSS, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P6338
[2]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[3]  
Ding N, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P3198
[4]  
Gao TY, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P6894
[5]  
Jie Z, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P729
[6]  
Lai P., 2022, COLING, P2199
[7]  
Li JY, 2022, AAAI CONF ARTIF INTE, P10965
[8]  
Li X., 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 6836-6842, P6836, DOI DOI 10.18653/V1/2020.ACL-MAIN.611
[9]  
Liu W., 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), P5847, DOI [DOI 10.18653/V1/2021.ACL-LONG.454, 10.18653/v1/2021.acl-long.454]
[10]  
Ma R., 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, P5951, DOI [10.18653/v1/2020.acl-main.528, DOI 10.18653/V1/2020.ACL-MAIN.528, 10.18653/V1/2020.ACL-MAIN.528]