CLINER: exploring task-relevant features and label semantic for few-shot named entity recognition

被引:2
作者
Li, Xuewei [1 ,2 ,3 ]
Li, Xinliang [1 ,2 ,5 ]
Zhao, Mankun [1 ,2 ,3 ]
Yang, Ming [4 ]
Yu, Ruiguo [1 ,2 ,3 ]
Yu, Mei [1 ,2 ,3 ]
Yu, Jian [1 ,2 ,3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Tianjin Univ, Tianjin Key Lab Adv Networking TANKLab, Tianjin 300350, Peoples R China
[3] Kennesaw State Univ, Tianjin Key Lab Cognit Comp & Applicat, Tianjin 300350, Peoples R China
[4] Tianjin Univ, Coll Comp & Software Engn, Kennesaw, GA 30144 USA
[5] Tianjin Univ, Tianjin Int Engn Inst, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot named entity recognition; Contrastive learning; Label semantic;
D O I
10.1007/s00521-023-09285-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot named entity recognition aims at recognizing novel-class named entities in low resources scenarios. Low resource scenarios contain limited data in the support set with sparse labels. Existing methods neglect the relevance of the support set to the task and the semantics of label naming. In this paper, on the basis of contrastive learning, we propose a multi-task learning framework CLINER for Few-Shot NER. We construct a mechanism for joint learning of label semantic information and support set information. For label support set information, we find a view in the support set that is most relevant to the current task, maximizing the utilization of each support set. Momentum encoder, a dynamic queue, is constructed to keep track of positive and negative examples learned from previous support sets, and keep it updated. For label semantic information, it is implied in the label naming and is derived explicitly by pre-trained language encoder. Experiments demonstrate that our model improves the overall performance comparing with recent baseline models, achieves state-of-the-art results on the commonly used standard datasets. The source code of CLINER will be available at: https://github.com/yizumi426/CLINER.
引用
收藏
页码:4679 / 4691
页数:13
相关论文
共 40 条
[1]  
Akbik A., 2018, COLING 2018 27 INT C, P1638
[2]  
Athiwaratkun B, 2018, ARXIV
[3]  
Chen Ting, 2019, 25 AMERICAS C INFORM
[4]  
Chiu J.P., 2016, Transactions of the Association for Computational Linguistics, V4, P357, DOI 10.1162/tacl_a_00104
[5]  
Cuiy LY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P1835
[6]  
Das SSS, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P6338
[7]   Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection [J].
Deng, Shumin ;
Zhang, Ningyu ;
Kang, Jiaojian ;
Zhang, Yichi ;
Zhang, Wei ;
Chen, Huajun .
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, :151-159
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]  
Ding N, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P3198
[10]   Multi-task Self-Supervised Visual Learning [J].
Doersch, Carl ;
Zisserman, Andrew .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2070-2079