An angular shrinkage BERT model for few-shot relation extraction with none-of-the-above detection

被引:3
作者
Wang, Junwen [1 ]
Gao, Yongbin [1 ]
Fang, Zhijun [1 ]
机构
[1] Shanghai Univ Engn Sci, Shanghai, Peoples R China
关键词
Few-shot learning; Relation extraction; None-of-the-above detection;
D O I
10.1016/j.patrec.2023.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot relation extraction aims to solve the problem of insufficient annotated data in relation extrac-tion tasks. Through the comparison between samples, few-shot relation extraction achieves lower-cost relation classification. However, most existing methods only do classification within the scope of enumer-ated relations. For one of the main challenges faced by the application of few-shot relation extraction-the recognition of the none-of-the-above instances, there has been few works on it. In this paper, we pro-pose an angular shrinkage BERT model for the few-shot relation extraction task with none-of-the-above detection, which uses an additive angular loss to enlarge the margins of different classes in the feature space, and obtain highly discriminative features to improve the recognition ability for none-of-the-above instances. Meanwhile, we present a two-stage training strategy to enhance the stability of the perfor-mance. We evaluate our model on the most used few-shot relation extraction dataset FewRel. Experi-mental results show that our approach outperforms previous sentence-pair methods in scenarios con-taining none-of-the-above instances, and also achieves improvements on the traditional few-shot relation extraction task compared with our baseline model.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 158
页数:8
相关论文
共 27 条
[21]  
Yang S, 2021, ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, P987
[22]  
Ye ZX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2872
[23]  
Yu B., 2020, P 28 INT C COMP LING, P3842
[24]  
Zhang XS, 2019, AAAI CONF ARTIF INTE, P7484
[25]  
Zhang YH, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2205
[26]  
Zhao Y, 2019, PR MACH LEARN RES, V101, P1156
[27]  
Zhong ZX, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P50