Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning

被引：0

作者：

Hu, Helan ^{[1
,2
]}

Si, Shuzheng ^{[1
,2
]}

Zhao, Haozhe ^{[1
,2
]}

Zeng, Shuang ^{[3
]}

An, Kaikai ^{[1
,2
]}

Cai, Zefan ^{[1
,2
]}

Chang, Baobao ^{[1
,4
]}

机构：

[1] Peking Univ, Natl Key Lab Multimedia Informat Proc, Beijing, Peoples R China

[2] Peking Univ, Sch Software & Microelect, Beijing, Peoples R China

[3] Tencent Inc, Shenzhen, Peoples R China

[4] Jiangsu Normal Univ, Jiangsu Collaborat Innovat Ctr Language Abil, Xuzhou, Peoples R China

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024 | 2024年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the burden of annotation, but meanwhile suffers from the label noise. Recent works attempt to adopt the teacher-student framework to gradually refine the training labels and improve the overall robustness. However, we argue that these teacher-student methods achieve limited performance because the poor calibration of the teacher network produces incorrectly pseudo-labeled samples, leading to error propagation. Therefore, we attempt to mitigate this issue by proposing: (1) Uncertainty-Aware Teacher Learning that leverages the prediction uncertainty to reduce the number of incorrect pseudo labels in the self-training stage; (2) Student-Student Collaborative Learning that allows the transfer of reliable labels between two student networks instead of indiscriminately relying on all pseudo labels from its teacher, and further enables a full exploration of mislabeled samples rather than simply filtering unreliable pseudo-labeled samples. We evaluate our proposed method on five DS-NER datasets, demonstrating that our method is superior to the state-of-the-art DS-NER denoising methods.

引用

页码：5533 / 5546

页数：14

共 36 条

[21]

Ratinov Lev, 2009, PROC 13 C COMPUT NAT, P147

[22]

Rizve Mamshad Nayeem, 2021, 9 INT C LEARNING RE

[23]

Sang E. F., 2003, Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003

[24]

Sanh V, 2020, Arxiv, DOI arXiv:1910.01108

[25]

Shang JB, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2054

[26]

Si SZ, 2023, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, P3883

[27]

Si SZ, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P4839

[28]

Si Shuzheng, 2022, P 29 INT C COMPUTATI, P2313

[29]

Si Shuzheng, 2024, Advances in Neural Information Processing Systems, V36

[30] Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization [J].

Wei, Hongxin ;

Feng, Lei ;

Chen, Xiangyu ;

An, Bo .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13723-13732

← 1 2 3 4 →