A Class-Rebalancing Self-Training Framework for Distantly-Supervised Named Entity Recognition

被引:0
|
作者
Li, Qi [1 ,2 ]
Xie, Tingyu [1 ,2 ]
Peng, Peng [2 ]
Wang, Hongwei [1 ,2 ]
Wang, Gaoang [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, ZJU UIUC Inst, Hangzhou, Zhejiang, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023) | 2023年
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Distant supervision reduces the reliance on human annotation in the named entity recognition tasks. The class-level imbalanced distant annotation is a realistic and unexplored problem, and the popular method of self-training can not handle class-level imbalanced learning. More importantly, self-training is dominated by the high-performance class in selecting candidates, and deteriorates the low-performance class with the bias of generated pseudo label. To address the class-level imbalance performance, we propose a class-rebalancing self-training framework for improving the distantly-supervised named entity recognition. In candidate selection, a class-wise flexible threshold is designed to fully explore other classes besides the high-performance class. In label generation, injecting the distant label, a hybrid pseudo label is adopted to provide straight semantic information for the low-performance class. Experiments on five flat and two nested datasets show that our model achieves state-of-the-art results. We also conduct extensive research to analyze the effectiveness of the flexible threshold and the hybrid pseudo label.
引用
收藏
页码:11054 / 11068
页数:15
相关论文
共 50 条
  • [31] Distantly Supervised Named Entity Recognition via Confidence-Based Multi-Class Positive and Unlabeled Learning
    Zhou, Kang
    Li, Yuepei
    Li, Qi
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7198 - 7211
  • [32] Semi-supervised disentangled framework for transferable named entity recognition
    Hao, Zhifeng
    Lv, Di
    Li, Zijian
    Cai, Ruichu
    Wen, Wen
    Xu, Boyan
    NEURAL NETWORKS, 2021, 135 : 127 - 138
  • [33] DISTALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem
    Banerjee, Somnath
    Dutta, Avik
    Agrawal, Aaditya
    Hazra, Rima
    Mukherjee, Animesh
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT X, ECML PKDD 2024, 2024, 14950 : 313 - 331
  • [34] MProto: Multi-Prototype Network with Denoised Optimal Transport for Distantly Supervised Named Entity Recognition
    Wu, Shuhui
    Shen, Yongliang
    Tan, Zeqi
    Ren, Wenqi
    Guo, Jietian
    Pu, Shiliang
    Lu, Weiming
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 2361 - 2374
  • [35] SEMI-SUPERVISED LANDCOVER CLASSIFICATION WITH ADAPTIVE PIXEL-REBALANCING SELF-TRAINING
    Lu, Xiaoqiang
    Cao, Guojin
    Gou, Tong
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 4611 - 4614
  • [36] Federated Self-training for Semi-supervised Audio Recognition
    Tsouvalas, Vasileios
    Saeed, Aaqib
    Ozcelebi, Tanir
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (06)
  • [37] SEMI-SUPERVISED FACE RECOGNITION WITH LDA SELF-TRAINING
    Zhao, Xuran
    Evans, Nicholas
    Dugelay, Jean-Luc
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [38] Semi-supervised Gait Recognition Based on Self-training
    Li, Yanan
    Yin, Yilong
    Liu, Lili
    Pang, Shaohua
    Yu, Qiuhong
    2012 IEEE NINTH INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL-BASED SURVEILLANCE (AVSS), 2012, : 288 - 293
  • [39] Distantly Supervised Biomedical Relation Extraction via Negative Learning and Noisy Student Self-Training
    Dai, Yuanfei
    Zhang, Bin
    Wang, Shiping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1697 - 1708
  • [40] HAMNER: Headword Amplified Multi-Span Distantly Supervised Method for Domain Specific Named Entity Recognition
    Liu, Shifeng
    Sun, Yifang
    Li, Bing
    Wang, Wei
    Zhao, Xiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8401 - 8408