Debiased and Denoised Entity Recognition from Distant Supervision

被引：0

作者：

Wang, Haobo ^{[1
]}

Dong, Yiwen ^{[1
]}

Xiao, Ruixuan ^{[1
]}

Huang, Fei ^{[2
]}

Chen, Gang ^{[1
]}

Zhao, Junbo ^{[1
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] Alibaba Grp, Hangzhou, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While distant supervision has been extensively explored and exploited in NLP tasks like named entity recognition, a major obstacle stems from the inevitable noisy distant labels tagged unsupervisedly. A few past works approach this problem by adopting a self-training framework with a sample-selection mechanism. In this work, we innovatively identify two types of biases that were omitted by prior work, and these biases lead to inferior performance of the distant-supervised NER setup. First, we characterize the noise concealed in the distant labels as highly structural rather than fully randomized. Second, the self-training framework would ubiquitously introduce an inherent bias that causes erroneous behavior in both sample selection and eventually prediction. To cope with these problems, we propose a novel self-training framework, dubbed DesERT. This framework augments the conventional NER predicative pathway to a dual form that effectively adapts the sample-selection process to conform to its innate distributional-bias structure. The other crucial component of DesERT composes a debiased module aiming to enhance the token representations, hence the quality of the pseudo-labels. Extensive experiments are conducted to validate the DesERT. The results show that our framework establishes a new state-of-art performance, it achieves a +2.22% average F1 score improvement on five standardized benchmarking datasets. Lastly, DesERT demonstrates its effectiveness under a new DSNER benchmark where additional distant supervision comes from the ChatGPT model.

引用

页数：23

共 49 条

[1] [Anonymous], 2015, P WORKSHOP NOISY USE, DOI DOI 10.18653/V1/W15-4322
[2] [Anonymous], 2017, NEURIPS
[3] Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
Arazo, Eric
Ortego, Diego
Albert, Paul
O'Connor, Noel E.
McGuinness, Kevin
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[4] Balasuriya D., 2009, P 1ST2009 WORKSHOP P, P10
[5] Bang Yejin, 2023, ABS230204023 CORR
[6] Brown TB, 2020, ADV NEUR IN, V33
[7] Cao YX, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P261
[8] Chen BX, 2022, ADV NEUR IN
[9] Chen D., 2020, P 58 ANN M ASS COMP, P5940, DOI DOI 10.18653/V1/2020.ACL-MAIN.527
[10] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

← 1 2 3 4 5 →