Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-Grained Student Ensemble

被引:0
|
作者
Qu, Xiaoye [1 ]
Zeng, Jun [2 ]
Liu, Daizong [3 ]
Wang, Zhefeng [1 ]
Huai, Baoxing [1 ]
Zhou, Pan [4 ]
机构
[1] Huawei Cloud, Shenzhen, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Software Engn, Wuhan, Peoples R China
[3] Peking Univ, Beijing, Peoples R China
[4] Huazhong Univ Sci & Technol, Sch Cyber Sci & Engn, Hubei Engn Res Ctr Big Data Secur, Hubei Key Lab Distributed Syst Secur, Wuhan, Peoples R China
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11 | 2023年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity problem in NER by automatically generating training samples. Unfortunately, the distant supervision may induce noisy labels, thus undermining the robustness of the learned models and restricting the practical application. To relieve this problem, recent works adopt self-training teacher-student frameworks to gradually refine the training labels and improve the generalization ability of NER models. However, we argue that the performance of the current self-training frameworks for DS-NER is severely underestimated by their plain designs, including both inadequate student learning and coarse-grained teacher updating. Therefore, in this paper, we make the first attempt to alleviate these issues by proposing: (1) adaptive teacher learning comprised of joint training of two teacher-student networks and considering both consistent and inconsistent predictions between two teachers, thus promoting comprehensive student learning. (2) fine-grained student ensemble that updates each fragment of the teacher model with a temporal moving average of the corresponding fragment of the student, which enhances consistent predictions on each model fragment against noise. To verify the effectiveness of our proposed method, we conduct experiments on four DS-NER datasets. The experimental results demonstrate that our method significantly surpasses previous SOTA methods. The code is available at https://github.com/zenhjunpro/ATSEN.
引用
收藏
页码:13501 / 13509
页数:9
相关论文
共 50 条
  • [1] Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling
    Zhang, Haoyu
    Long, Dingkun
    Xu, Guangwei
    Zhu, Muhua
    Xie, Pengjun
    Huang, Fei
    Wang, Ji
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3808 - 3815
  • [2] Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning
    Zhang, Xinghua
    Yu, Bowen
    Liu, Tingwen
    Zhang, Zhenyu
    Sheng, Jiawei
    Xue Mengge
    Xu, Hongbo
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10746 - 10757
  • [3] Improving Distantly-Supervised Named Entity Recognition with Self-Collaborative Denoising Learning
    Zhang, Xinghua
    Yu, Bowen
    Liu, Tingwen
    Zhang, Zhenyu
    Sheng, Jiawei
    Xue, Mengge
    Xu, Hongbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1518 - 1529
  • [4] Fine-Grained Named Entity Typing over Distantly Supervised Data Based on Refined Representations
    Ali, Muhammad Asif
    Sun, Yifang
    Li, Bing
    Wang, Wei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7391 - 7398
  • [5] Fine-Grained Named Entity Recognition for Sinhala
    Azeez, Rameela
    Ranathunga, Surangika
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 295 - 300
  • [6] Fine-grained Named Entity Recognition for Turkish
    Khudoyberdieva, Lola
    Diri, Banu
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [7] Multilingual Fine-Grained Named Entity Recognition
    Lupancu, Viorica-Camelia
    Iftene, Adrian
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2023, 31 (03) : 321 - 339
  • [8] Fine-grained Dutch named entity recognition
    Bart Desmet
    Véronique Hoste
    Language Resources and Evaluation, 2014, 48 : 307 - 343
  • [9] Fine-grained Dutch named entity recognition
    Desmet, Bart
    Hoste, Veronique
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (02) : 307 - 343
  • [10] Noise-Robust Training with Dynamic Loss and Contrastive Learning for Distantly-Supervised Named Entity Recognition
    Ma, Zhiyuan
    Du, Jintao
    Zhou, Shuheng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10119 - 10128