Debiased and Denoised Entity Recognition from Distant Supervision

被引:0
作者
Wang, Haobo [1 ]
Dong, Yiwen [1 ]
Xiao, Ruixuan [1 ]
Huang, Fei [2 ]
Chen, Gang [1 ]
Zhao, Junbo [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While distant supervision has been extensively explored and exploited in NLP tasks like named entity recognition, a major obstacle stems from the inevitable noisy distant labels tagged unsupervisedly. A few past works approach this problem by adopting a self-training framework with a sample-selection mechanism. In this work, we innovatively identify two types of biases that were omitted by prior work, and these biases lead to inferior performance of the distant-supervised NER setup. First, we characterize the noise concealed in the distant labels as highly structural rather than fully randomized. Second, the self-training framework would ubiquitously introduce an inherent bias that causes erroneous behavior in both sample selection and eventually prediction. To cope with these problems, we propose a novel self-training framework, dubbed DesERT. This framework augments the conventional NER predicative pathway to a dual form that effectively adapts the sample-selection process to conform to its innate distributional-bias structure. The other crucial component of DesERT composes a debiased module aiming to enhance the token representations, hence the quality of the pseudo-labels. Extensive experiments are conducted to validate the DesERT. The results show that our framework establishes a new state-of-art performance, it achieves a +2.22% average F1 score improvement on five standardized benchmarking datasets. Lastly, DesERT demonstrates its effectiveness under a new DSNER benchmark where additional distant supervision comes from the ChatGPT model.
引用
收藏
页数:23
相关论文
共 49 条
  • [1] [Anonymous], 2015, P WORKSHOP NOISY USE, DOI DOI 10.18653/V1/W15-4322
  • [2] [Anonymous], 2017, NEURIPS
  • [3] Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
    Arazo, Eric
    Ortego, Diego
    Albert, Paul
    O'Connor, Noel E.
    McGuinness, Kevin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Balasuriya D., 2009, P 1ST2009 WORKSHOP P, P10
  • [5] Bang Yejin, 2023, ABS230204023 CORR
  • [6] Brown TB, 2020, ADV NEUR IN, V33
  • [7] Cao YX, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P261
  • [8] Chen BX, 2022, ADV NEUR IN
  • [9] Chen D., 2020, P 58 ANN M ASS COMP, P5940, DOI DOI 10.18653/V1/2020.ACL-MAIN.527
  • [10] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171