FEW-NERD: A Few-shot Named Entity Recognition Dataset

被引:0
|
作者
Ding, Ning [1 ,3 ]
Xu, Guangwei [2 ]
Chen, Yulin [3 ]
Wang, Xiaobin [2 ]
Han, Xu [1 ]
Xie, Pengjun [2 ]
Zheng, Hai-Tao [3 ]
Liu, Zhiyuan [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
[3] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China
来源
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1 | 2021年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, considerable literature has grown up around the theme of few-shot named entity recognition (NER), but little published benchmark data specifically focused on the practical and challenging task. Current approaches collect existing supervised NER datasets and reorganize them into the few-shot setting for empirical study. These strategies conventionally aim to recognize coarse-grained entity types with few examples, while in practice, most unseen entity types are fine-grained. In this paper, we present FEW-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types. FEW-NERD consists of 188,238 sentences from Wikipedia, 4,601,160 words are included and each is annotated as context or a part of a two-level entity type. To the best of our knowledge, this is the first few-shot NER dataset and the largest human-crafted NER dataset. We construct benchmark tasks with different emphases to comprehensively assess the generalization capability of models. Extensive empirical results and analysis show that FEW-NERD is challenging and the problem requires further research.
引用
收藏
页码:3198 / 3213
页数:16
相关论文
共 50 条
  • [11] Few-Shot Named Entity Recognition: An Empirical Baseline Study
    Huang, Jiaxin
    Lie, Chunyuan
    Subudhi, Krishan
    Jose, Damien
    Balakrishnan, Shobana
    Chen, Weizhu
    Peng, Baolin
    Gao, Jianfeng
    Han, Jiawei
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10408 - 10423
  • [12] Attending to Entity Class Attributes for Named Entity Recognition with Few-Shot Learning
    Patel, Raj Nath
    Dutta, Sourav
    Assem, Haytham
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 3, INTELLISYS 2023, 2024, 824 : 859 - 870
  • [13] Few-shot Named Entity Recognition with Supported and Dependent Label Representations
    Miura, Yasuhide
    Takahashi, Takumi
    13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 385 - 391
  • [14] Joint span and token framework for few-shot named entity recognition
    Fang, Wenlong
    Liu, Yongbin
    Ouyang, Chunping
    Ren, Lin
    Li, Jiale
    Wan, Yaping
    AI OPEN, 2023, 4 : 111 - 119
  • [15] Few-shot Named Entity Recognition: Definition, Taxonomy and Research Directions
    Moscato, Vincenzo
    Postiglione, Marco
    Sperli, Giancarlo
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (05)
  • [16] CONTAINER: Few-Shot Named Entity Recognition via Contrastive Learning
    Das, Sarkar Snigdha Sarathi
    Katiyar, Arzoo
    Passonneau, Rebecca J.
    Zhang, Rui
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6338 - 6353
  • [17] In-context Learning for Few-shot Multimodal Named Entity Recognition
    Cai, Chenran
    Wang, Qianlong
    Liang, Bin
    Qin, Bing
    Yang, Min
    Wong, Kam-Fai
    Xu, Ruifeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2969 - 2979
  • [18] Few-shot Named Entity Recognition with Joint Token and Sentence Awareness
    Wen, Wen
    Liu, Yongbin
    Lin, Qiang
    Ouyang, Chunping
    DATA INTELLIGENCE, 2023, 5 (03) : 767 - 785
  • [19] A Self-training Approach for Few-Shot Named Entity Recognition
    Qian, Yudong
    Zheng, Weiguo
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 183 - 191
  • [20] Few-shot Named Entity Recognition via encoder and class intervention
    Ding, Long
    Ouyang, Chunping
    Liu, Yongbin
    Tao, Zhihua
    Wan, Yaping
    Gao, Zheng
    AI OPEN, 2024, 5 : 39 - 45