Label-Description Enhanced Network for Few-Shot Named Entity Recognition

被引:0
作者
Zhang, Xinyue [1 ]
Gao, Hui [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII | 2023年 / 14261卷
关键词
Named Entity Recognition; Few-shot Setting; Domain Adaptation;
D O I
10.1007/978-3-031-44198-1_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the essential tasks in the context of natural language understanding, Few-shot Named Entity Recognition (NER) aims to identify and classify entities against limited samples. Recently, many works have attempted to enhance semantic representations by constructing prompt templates with text and label names. These methods, however, not only distract attention from the text, but also cause unnecessary enumerations. Furthermore, ambiguous label names always fail in delivering the intended meaning. To address the above issues, we present a Label-Description Enhanced Network (LaDEN) for few-shot named entity recognition, under which we propose a BERT-based Siamese network to incorporate fine-grained label descriptions as knowledge augmentation. The designed semantic attention mechanism captures label-specific textual representations, and the distance function matches similar token and label representations based on the nearest-neighbor criterion. Experimental results demonstrate that our model outperforms previous works in both few-shot and resource-rich settings, achieving state-of-the-art performance on five benchmarks. Our method is particularly efficient in low-resource scenarios, especially for cross-domain applications.
引用
收藏
页码:444 / 455
页数:12
相关论文
共 28 条
[1]  
Aly R, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P1516
[2]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
[3]  
Collier N, 2004, P INT JOINT WORKSH N, P73
[4]  
Cui LY, 2021, Arxiv, DOI arXiv:2106.01760
[5]  
Derczynski L., 2017, P 3 WORKSH NOIS US G, P140, DOI [10.18653/v1/W17-4418,eprint:https://aclanthology.org/W17-4418.pdf, DOI 10.18653/V1/W17-4418]
[6]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[7]  
Ding N., 2021, arXiv, DOI DOI 10.48550/ARXIV.2108.10604
[8]  
Ding N, 2021, Arxiv, DOI arXiv:2105.07464
[9]   NCBI disease corpus: A resource for disease name recognition and concept normalization [J].
Dogan, Rezarta Islamaj ;
Leaman, Robert ;
Lu, Zhiyong .
JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 47 :1-10
[10]   Few-shot classification in Named Entity Recognition Task [J].
Fritzler, Alexander ;
Logacheva, Varvara ;
Kretov, Maksim .
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, :993-1000