Improving distantly supervised named entity recognition by emphasizing uncertain examples

被引:0
作者
Nie, Binling [1 ]
Shao, Yiming [1 ]
Wang, Yigang [1 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Peoples R China
关键词
Named entity recognition; Distantly supervised; Uncertainty estimation;
D O I
10.1007/s10044-024-01392-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distantly supervised named entity recognition (DS-NER) aims to acquire knowledge from noisy labels. Recently, label re-weighting and label correction based frameworks have been recognized as promising approaches for DS-NER. These methods mainly handle easy or hard examples, yet neglect the impact of uncertain examples that are predicted correctly sometimes and incorrectly some other times during optimization. In this paper, we propose UE-NER, an Uncertainty Estimation method for DS-NER, which estimates the uncertainty of training examples and emphasizes uncertain ones, thus leads to more accurate and robust performance. To enable uncertainty reasoning, we formulate DS-NER as a span-level classification problem and the variance in predicted probability of the correct class across iterations of minibatch SGD is taken as the uncertainty measure. We further design an enhanced encoder to combine the power of the named entity and other spans in the sentence to boost recognition performance. Experimental results on two benchmark datasets demonstrate the superiority of the proposed UE-NER over existing DS-NER methods.
引用
收藏
页数:12
相关论文
共 38 条
[1]  
Ba JL, 2016, arXiv, DOI 10.48550/arXiv:1607.06450
[2]  
Bojanowski P., 2017, T ASSOC COMPUT LING, V5, P135, DOI [DOI 10.1162/TACLA00051, DOI 10.1162/TACL_A_00051, 10.1162/tacl_a_00051]
[3]  
Cao YX, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P261
[4]  
Chen H, 2019, AAAI CONF ARTIF INTE, P6236
[5]  
Chen Miao, 2020, P 3 CLIN NAT LANG PR, P234, DOI DOI 10.18653/V1/2020.CLINICALNLP-1.26
[6]  
Chiu J. P., 2016, Transactions of the Association for Computational Linguistics, V4, P357, DOI [DOI 10.1162/TACLA00104, 10.1162/tacl_a_00104]
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Gabor K., 2018, P 12 INT WORKSH SEM, P679, DOI [DOI 10.18653/V1/S18-1111, 10.18653/v1/S18-1111]
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Huang ZH, 2015, Arxiv, DOI [arXiv:1508.01991, 10.48550/arXiv.1508.01991]