An angular shrinkage BERT model for few-shot relation extraction with none-of-the-above detection

被引:3
作者
Wang, Junwen [1 ]
Gao, Yongbin [1 ]
Fang, Zhijun [1 ]
机构
[1] Shanghai Univ Engn Sci, Shanghai, Peoples R China
关键词
Few-shot learning; Relation extraction; None-of-the-above detection;
D O I
10.1016/j.patrec.2023.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot relation extraction aims to solve the problem of insufficient annotated data in relation extrac-tion tasks. Through the comparison between samples, few-shot relation extraction achieves lower-cost relation classification. However, most existing methods only do classification within the scope of enumer-ated relations. For one of the main challenges faced by the application of few-shot relation extraction-the recognition of the none-of-the-above instances, there has been few works on it. In this paper, we pro-pose an angular shrinkage BERT model for the few-shot relation extraction task with none-of-the-above detection, which uses an additive angular loss to enlarge the margins of different classes in the feature space, and obtain highly discriminative features to improve the recognition ability for none-of-the-above instances. Meanwhile, we present a two-stage training strategy to enhance the stability of the perfor-mance. We evaluate our model on the most used few-shot relation extraction dataset FewRel. Experi-mental results show that our approach outperforms previous sentence-pair methods in scenarios con-taining none-of-the-above instances, and also achieves improvements on the traditional few-shot relation extraction task compared with our baseline model.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 158
页数:8
相关论文
共 27 条
[1]  
Chen GM, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P2501
[2]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Yang, Jing ;
Xue, Niannan ;
Kotsia, Irene ;
Zafeiriou, Stefanos .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :5962-5979
[3]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4]  
Gao TY, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P6250
[5]  
Gao TY, 2019, AAAI CONF ARTIF INTE, P6407
[6]  
Liu W, 2017, ADV SOC SCI EDUC HUM, V99, P212
[7]  
Liu WY, 2016, PR MACH LEARN RES, V48
[8]  
Ranjan R, 2017, Arxiv, DOI arXiv:1703.09507
[9]  
Reimers N, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3982
[10]   Revisiting Few-shot Relation Classification: Evaluation Data and Classification Schemes [J].
Sabo, Ofer ;
Elazar, Yanai ;
Goldberg, Yoav ;
Dagan, Ido .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 :691-706