Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning

被引:0
作者
Hu, Helan [1 ,2 ]
Si, Shuzheng [1 ,2 ]
Zhao, Haozhe [1 ,2 ]
Zeng, Shuang [3 ]
An, Kaikai [1 ,2 ]
Cai, Zefan [1 ,2 ]
Chang, Baobao [1 ,4 ]
机构
[1] Peking Univ, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing, Peoples R China
[3] Tencent Inc, Shenzhen, Peoples R China
[4] Jiangsu Normal Univ, Jiangsu Collaborat Innovat Ctr Language Abil, Xuzhou, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024 | 2024年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the burden of annotation, but meanwhile suffers from the label noise. Recent works attempt to adopt the teacher-student framework to gradually refine the training labels and improve the overall robustness. However, we argue that these teacher-student methods achieve limited performance because the poor calibration of the teacher network produces incorrectly pseudo-labeled samples, leading to error propagation. Therefore, we attempt to mitigate this issue by proposing: (1) Uncertainty-Aware Teacher Learning that leverages the prediction uncertainty to reduce the number of incorrect pseudo labels in the self-training stage; (2) Student-Student Collaborative Learning that allows the transfer of reliable labels between two student networks instead of indiscriminately relying on all pseudo labels from its teacher, and further enables a full exploration of mislabeled samples rather than simply filtering unreliable pseudo-labeled samples. We evaluate our proposed method on five DS-NER datasets, demonstrating that our method is superior to the state-of-the-art DS-NER denoising methods.
引用
收藏
页码:5533 / 5546
页数:14
相关论文
共 36 条
[21]  
Ratinov Lev, 2009, PROC 13 C COMPUT NAT, P147
[22]  
Rizve Mamshad Nayeem, 2021, 9 INT C LEARNING RE
[23]  
Sang E. F., 2003, Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003
[24]  
Sanh V, 2020, Arxiv, DOI arXiv:1910.01108
[25]  
Shang JB, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2054
[26]  
Si SZ, 2023, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, P3883
[27]  
Si SZ, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P4839
[28]  
Si Shuzheng, 2022, P 29 INT C COMPUTATI, P2313
[29]  
Si Shuzheng, 2024, Advances in Neural Information Processing Systems, V36
[30]   Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization [J].
Wei, Hongxin ;
Feng, Lei ;
Chen, Xiangyu ;
An, Bo .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13723-13732