An angular shrinkage BERT model for few-shot relation extraction with none-of-the-above detection

被引:3
作者
Wang, Junwen [1 ]
Gao, Yongbin [1 ]
Fang, Zhijun [1 ]
机构
[1] Shanghai Univ Engn Sci, Shanghai, Peoples R China
关键词
Few-shot learning; Relation extraction; None-of-the-above detection;
D O I
10.1016/j.patrec.2023.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot relation extraction aims to solve the problem of insufficient annotated data in relation extrac-tion tasks. Through the comparison between samples, few-shot relation extraction achieves lower-cost relation classification. However, most existing methods only do classification within the scope of enumer-ated relations. For one of the main challenges faced by the application of few-shot relation extraction-the recognition of the none-of-the-above instances, there has been few works on it. In this paper, we pro-pose an angular shrinkage BERT model for the few-shot relation extraction task with none-of-the-above detection, which uses an additive angular loss to enlarge the margins of different classes in the feature space, and obtain highly discriminative features to improve the recognition ability for none-of-the-above instances. Meanwhile, we present a two-stage training strategy to enhance the stability of the perfor-mance. We evaluate our model on the most used few-shot relation extraction dataset FewRel. Experi-mental results show that our approach outperforms previous sentence-pair methods in scenarios con-taining none-of-the-above instances, and also achieves improvements on the traditional few-shot relation extraction task compared with our baseline model.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 158
页数:8
相关论文
共 27 条
[11]  
Snell J, 2017, ADV NEUR IN, V30
[12]  
Soares LB, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2895
[13]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[14]  
Vaswani A, 2017, ADV NEUR IN, V30
[15]   NormFace: L2 Hypersphere Embedding for Face Verification [J].
Wang, Feng ;
Xiang, Xiang ;
Cheng, Jian ;
Yuille, Alan L. .
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, :1041-1049
[16]   Additive Margin Softmax for Face Verification [J].
Wang, Feng ;
Cheng, Jian ;
Liu, Weiyang ;
Liu, Haijun .
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (07) :926-930
[17]   CosFace: Large Margin Cosine Loss for Deep Face Recognition [J].
Wang, Hao ;
Wang, Yitong ;
Zhou, Zheng ;
Ji, Xing ;
Gong, Dihong ;
Zhou, Jingchao ;
Li, Zhifeng ;
Liu, Wei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5265-5274
[18]   Enhanced prototypical network for few-shot relation extraction [J].
Wen, Wen ;
Liu, Yongbin ;
Ouyang, Chunping ;
Lin, Qiang ;
Chung, Tonglee .
INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
[19]   Enriching Pre-trained Language Model with Entity Information for Relation Classification [J].
Wu, Shanchan ;
He, Yifan .
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, :2361-2364
[20]   Enhance Prototypical Network with Text Descriptions for Few-shot Relation Classification [J].
Yang, Kaijia ;
Zheng, Nantao ;
Dai, Xinyu ;
He, Liang ;
Huang, Shujian ;
Chen, Jiajun .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :2273-2276