SLNER: Chinese Few-Shot Named Entity Recognition with Enhanced Span and Label Semantics

被引：3

作者：

Ren, Zhe ^{[1
,2
]}

Qin, Xizhong ^{[1
,2
]}

Ran, Wensheng ^{[3
]}

机构：

[1] Xinjiang Univ, Coll Informat Sci & Engn, Urumqi 830049, Peoples R China

[2] Xinjiang Key Lab Signal Detect & Proc, Urumqi 830049, Peoples R China

[3] Xinjiang Uygur Autonomous Regin Prod Qual Supervis, Urumqi 830049, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期

关键词：

natural language processing; Chinese named entity recognition; few-shot learning; feature representation; label semantics; neural network; deep learning; attention mechanism; low-resource domain dataset;

D O I：

10.3390/app13158609

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Few-shot named entity recognition requires sufficient prior knowledge to transfer valuable knowledge to the target domain with only a few labeled examples. Existing Chinese few-shot named entity recognition methods suffer from inadequate prior knowledge and limitations in feature representation. In this paper, we utilize enhanced Span and Label semantic representations for Chinese few-shot Named Entity Recognition (SLNER) to address the problem. Specifically, SLNER utilizes two encoders. One encoder is used to encode the text and its spans, and we employ the biaffine attention mechanism and self-attention to obtain enhanced span representations. This approach fully leverages the internal composition of entity mentions, leading to more accurate feature representations. The other encoder encodes the full label names to obtain label representations. Label names are broad representations of specific entity categories and share similar semantic meanings with entities. This similarity allows label names to offer valuable prior knowledge in few-shot scenarios. Finally, our model learns to match span representations with label representations. We conducted extensive experiments on three sampling benchmark Chinese datasets and a self-built food safety risk domain dataset. The experimental results show that our model outperforms the F1 scores of 0.20-6.57% of previous state-of-the-art methods in few-shot settings.

引用

页数：19

共 43 条

[1] Joint entity recognition and relation extraction as a multi-head selection problem
Bekoulis, Giannis
Deleu, Johannes
Demeester, Thomas
Develder, Chris
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 34 - 45
[2] Chen JW, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P5711
[3] Chen Xiang, 2022, P 29 INT C COMP LING, P2374
[4] Chiu JPC., 2016, T ASS COMPUTATIONAL, V4, P357, DOI [DOI 10.1162/TACLA00104, 10.1162/tacl_a_00104, DOI 10.1162/TACL_A_00104]
[5] Cui LY, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4115
[6] Cuiy LY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P1835
[7] Das SSS, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P6338
[8] Devlin J., 2019, NAACL
[9] Chinese NER by Span-Level Self-Attention
Dong, Xiaoyu
Xin, Xin
Guo, Ping
[J]. 2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 68 - 72
[10] Hon YT, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1381

← 1 2 3 4 5 →