Attending to Entity Class Attributes for Named Entity Recognition with Few-Shot Learning

被引:0
作者
Patel, Raj Nath [1 ]
Dutta, Sourav [1 ]
Assem, Haytham [2 ]
机构
[1] Huawei Res, Dublin, Ireland
[2] Amazon Alexa AI, Cambridge, England
来源
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, INTELLISYS 2023 | 2024年 / 824卷
关键词
Named entity recognition; Knowledge graph; Few-shot learning; Transformers; Attention mask; TEXT;
D O I
10.1007/978-3-031-47715-7_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) serves as the foundation for several natural language applications like question answering, chat-bots and intent classification. Identification of entity boundaries and its categorization into entity types poses a significant challenge in domain-dependent and low-resource settings, with limited training data availability. To this end, we propose AtEnA, a novel NER framework utilizing entity class attributes from external knowledge source for few-shot learning. We use a two-stage fine-tuning process, wherein a language model is initially trained to "attend" to the different entity class attributes along with the textual context, and is then fine-tuned for the downstream application data with few annotated training examples. Experiments on benchmark NER datasets depict AtEnA to perform around 10 F1 score points better than the existing NER methodologies, specifically for fewshot limited training scenarios.
引用
收藏
页码:859 / 870
页数:12
相关论文
共 31 条
[1]  
Aone C., 1998, MUC-7
[2]  
Aone C., 2022, A trainable summarizer with knowledge acquired from robust NLP techniques, P71
[3]  
Bocklisch T., 2017, NIPS WORKSH CONV AI
[4]   Named entity disambiguation in short texts over knowledge graphs [J].
Bouarroudj, Wissem ;
Boufaida, Zizette ;
Bellatreche, Ladjel .
KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (02) :325-351
[5]  
Cheng PX, 2020, AAAI CONF ARTIF INTE, V34, P7554
[6]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[7]   Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365
[8]   Unsupervised named-entity extraction from the Web: An experimental study [J].
Etzioni, O ;
Cafarella, M ;
Downey, D ;
Popescu, AM ;
Shaked, T ;
Soderland, S ;
Weld, DS ;
Yates, A .
ARTIFICIAL INTELLIGENCE, 2005, 165 (01) :91-134
[9]   Recent Named Entity Recognition and Classification techniques: A systematic review [J].
Goyal, Archana ;
Gupta, Vishal ;
Kumar, Manish .
COMPUTER SCIENCE REVIEW, 2018, 29 :21-43
[10]   Support vector machines [J].
Hearst, MA .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04) :18-21