Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

被引:0
作者
Meng, Yu [1 ]
Zhang, Yunyi [1 ]
Huang, Jiaxin [1 ]
Wang, Xuan [1 ]
Zhang, Yu [1 ]
Ji, Heng [1 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
来源
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of training named entity recognition (NER) models using only distantly-labeled data, which can be automatically obtained by matching entity mentions in the raw text with entity types in a knowledge base. The biggest challenge of distantlysupervised NER is that the distant supervision may induce incomplete and noisy labels, rendering the straightforward application of supervised learning ineffective. In this paper, we propose (1) a noise-robust learning scheme comprised of a new loss function and a noisy label removal step, for training NER models on distantly-labeled data, and (2) a self-training method that uses contextualized augmentations created by pre-trained language models to improve the generalization ability of the NER model. On three benchmark datasets, our method achieves superior performance, outperforming existing distantlysupervised NER models by significant margins(1).
引用
收藏
页码:10367 / 10378
页数:12
相关论文
共 50 条
[41]   Research-based-named Entity Recognition Learning Text Biomedical Extraction by Adoption of Training Bidirectional Language Model (BiLM) [J].
Abed, Alshreef ;
Jingling, Yuan ;
Li, Lin .
Journal of Computers (Taiwan), 2020, 31 (04) :157-173
[42]   DeepSpacy-NER: an efficient deep learning model for named entity recognition for Punjabi language [J].
Singh, Navdeep ;
Kumar, Munish ;
Singh, Bavalpreet ;
Singh, Jaskaran .
EVOLVING SYSTEMS, 2023, 14 (04) :673-683
[43]   DeepSpacy-NER: an efficient deep learning model for named entity recognition for Punjabi language [J].
Navdeep Singh ;
Munish Kumar ;
Bavalpreet Singh ;
Jaskaran Singh .
Evolving Systems, 2023, 14 :673-683
[44]   Transfer Learning for Named Entity Recognition in Setswana Language Using CNN-BiLSTM Model [J].
Chabalala, Shumile ;
Ojo, Sunday O. ;
Owolawi, Pius A. .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (02) :472-481
[45]   A Robust Semi-Supervised Broad Learning System Guided by Ensemble-Based Self-Training [J].
Guo, Jifeng ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) :6410-6422
[46]   Semi-Supervised Learning Approach for Indonesian Named Entity Recognition (NER) Using Co-Training Algorithm [J].
Aryoyudanta, Bayu ;
Adji, Teguh Bharata ;
Llidayah, Lndriana .
2016 INTERNATIONAL SEMINAR ON INTELLIGENT TECHNOLOGY AND ITS APPLICATIONS (ISITIA): RECENT TRENDS IN INTELLIGENT COMPUTATIONAL TECHNOLOGIES FOR SUSTAINABLE ENERGY, 2016, :7-11
[47]   Semi-supervised deep learning based named entity recognition model to parse education section of resumes [J].
Gaur, Bodhvi ;
Saluja, Gurpreet Singh ;
Sivakumar, Hamsa Bharathi ;
Singh, Sanjay .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11) :5705-5718
[48]   Semi-supervised deep learning based named entity recognition model to parse education section of resumes [J].
Bodhvi Gaur ;
Gurpreet Singh Saluja ;
Hamsa Bharathi Sivakumar ;
Sanjay Singh .
Neural Computing and Applications, 2021, 33 :5705-5718
[49]   Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation [J].
Hwang, Min-Jae ;
Kulikov, Ilia ;
Peloquin, Benjamin ;
Gong, Hongyu ;
Chen, Peng-Jen ;
Lee, Ann .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, :15524-15541
[50]   Language inference-based learning for Low-Resource Chinese clinical named entity recognition using language model [J].
Cui, Zhaojian ;
Yu, Kai ;
Yuan, Zhenming ;
Dong, Xiaofeng ;
Luo, Weibin .
JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149