FEW-NERD: A Few-shot Named Entity Recognition Dataset

被引：0

作者：

Ding, Ning ^{[1
,3
]}

Xu, Guangwei ^{[2
]}

Chen, Yulin ^{[3
]}

Wang, Xiaobin ^{[2
]}

Han, Xu ^{[1
]}

Xie, Pengjun ^{[2
]}

Zheng, Hai-Tao ^{[3
]}

Liu, Zhiyuan ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

[2] Alibaba Grp, Hangzhou, Peoples R China

[3] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China

来源：

59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, considerable literature has grown up around the theme of few-shot named entity recognition (NER), but little published benchmark data specifically focused on the practical and challenging task. Current approaches collect existing supervised NER datasets and reorganize them into the few-shot setting for empirical study. These strategies conventionally aim to recognize coarse-grained entity types with few examples, while in practice, most unseen entity types are fine-grained. In this paper, we present FEW-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types. FEW-NERD consists of 188,238 sentences from Wikipedia, 4,601,160 words are included and each is annotated as context or a part of a two-level entity type. To the best of our knowledge, this is the first few-shot NER dataset and the largest human-crafted NER dataset. We construct benchmark tasks with different emphases to comprehensively assess the generalization capability of models. Extensive empirical results and analysis show that FEW-NERD is challenging and the problem requires further research.

引用

页码：3198 / 3213

页数：16

共 50 条

[21] Logit Adjustment with Normalization and Augmentation in Few-Shot Named Entity Recognition
Zhang, Jinglei
Wen, Guochang
Liao, NingLin
Du, DongDong
Gao, Qing
Zhang, Minghui
Cao, XiXin
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024, 2024, 14886 : 398 - 410
[22] Few-Shot Class-Incremental Learning for Named Entity Recognition
Wang, Rui
Yu, Tong
Zhao, Handong
Kim, Sungchul
Mitra, Subrata
Zhang, Ruiyi
Henao, Ricardo
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 571 - 582
[23] Few-shot Named Entity Recognition with Self-describing Networks
Chen, Jiawei
Liu, Qing
Lin, Hongyu
Han, Xianpei
Su, Le
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5711 - 5722
[24] Focusing, Bridging and Prompting for Few-shot Nested Named Entity Recognition
Xu, Yuanyuan
Yang, Zeng
Zhang, Linhai
Zhou, Deyu
Wu, Tiandeng
Zhou, Rong
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2621 - 2637
[25] Causal Intervention-based Few-Shot Named Entity Recognition
Yang, Zhen
Liu, Yongbin
Ouyang, Chunping
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 15635 - 15646
[26] Coarse-to-fine Few-shot Learning for Named Entity Recognition
Ma, Ruotian
Zhang, Lin
Chen, Xuanting
Zhou, Xin
Wang, Junzhe
Gui, Tao
Zhang, Qi
Gao, Xiang
Chen, Yunwen
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4115 - 4129
[27] Decomposed Meta-Learning for Few-Shot Named Entity Recognition
Ma, Tingting
Jiang, Huiqiang
Wu, Qianhui
Zhao, Tiejun
Lin, Chin-Yew
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1584 - 1596
[28] Few-Shot Named Entity Recognition via Meta-Learning
Li, Jing
Chiu, Billy
Feng, Shanshan
Wang, Hao
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4245 - 4256
[29] Few-shot named entity recognition with hybrid multi-prototype learning
Zenghua Liao
Junbo Fei
Weixin Zeng
Xiang Zhao
World Wide Web, 2023, 26 : 2521 - 2544
[30] Type-Aware Decomposed Framework for Few-Shot Named Entity Recognition
Li, Yongqi
Yu, Yu
Qian, Tieyun
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8911 - 8927

← 1 2 3 4 5 →