CycleNER: An Unsupervised Training Approach for Named Entity Recognition

被引:19
|
作者
Iovine, Andrea [1 ]
Fang, Anjie [2 ]
Fetahu, Besnik [2 ]
Rokhlenko, Oleg [2 ]
Malmasi, Shervin [2 ]
机构
[1] Univ Bari Aldo Moro, Bari, Italy
[2] Amazoncom Inc, Bellevue, WA USA
来源
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22) | 2022年
关键词
natural language processing; named entity recognition; cycleconsistency; training; unsupervised training;
D O I
10.1145/3485447.3512012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is a crucial natural language understanding task for many down-stream tasks such as question answering and retrieval. Despite significant progress in developing NER models for multiple languages and domains, scaling to emerging and/or low-resource domains still remains challenging, due to the costly nature of acquiring training data. We propose CycleNER, an unsupervised approach based on cycle-consistency training that uses two functions: (i) sentence-to-entity - S2E and (ii) entity-to-sentence - E2S, to carry out the NER task. CycleNER does not require annotations but a set of sentences with no entity labels and another independent set of entity examples. Through cycle-consistency training, the output from one function is used as input for the other (e.g. S2E. E2S) to align the representation spaces of both functions and therefore enable unsupervised training. Evaluation on several domains comparing CycleNER against supervised and unsupervised competitors shows that CycleNER achieves highly competitive performance with only a few thousand input sentences. We demonstrate competitive performance against supervised models, achieving 73% of supervised performance without any annotations on CoNLL03, while significantly outperforming unsupervised approaches.
引用
收藏
页码:2916 / 2924
页数:9
相关论文
共 50 条
  • [31] A Survey on Deep Learning for Named Entity Recognition
    Li, Jing
    Sun, Aixin
    Han, Jianglei
    Li, Chenliang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (01) : 50 - 70
  • [32] Mixup Based Cross-Consistency Training for Named Entity Recognition
    Youn, Geonsik
    Yoon, Bohan
    Ji, Seungbin
    Ko, Dahee
    Rhee, Jongtae
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [33] Efficient combined approach for named entity recognition in spoken language
    Zidouni, Azeddine
    Rosset, Sophie
    Glotin, Herve
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1293 - +
  • [34] A Named Entity Recognition Based Approach for Privacy Requirements Engineering
    Herwanto, Guntur Budi
    Quirchmayr, Gerald
    Tjoa, A. Min
    29TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW 2021), 2021, : 406 - 411
  • [35] Arabic Named Entity Recognition
    Benajiba, Yassine
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 151 - 152
  • [36] An Overview of Named Entity Recognition
    Sun, Peng
    Yang, Xuezhen
    Zhao, Xiaobing
    Wang, Zhijuan
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 273 - 278
  • [37] Named Entity Recognition and Classification for Medical Prospectuses
    Chirila, Oana Sorina
    Chirila, Ciprian-Bogdan
    Stoicu-Tivadar, Lacramioara
    HEALTH INFORMATICS VISION: FROM DATA VIA INFORMATION TO KNOWLEDGE, 2019, 262 : 284 - 287
  • [38] Arabic Named Entity Recognition: A BERT-BGRU Approach
    Alsaaran, Norah
    Alrabiah, Maha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 471 - 485
  • [39] Named Entity Recognition in Query
    Guo, Jiafeng
    Xu, Gu
    Cheng, Xueqi
    Li, Hang
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 267 - 274
  • [40] RENA: A Named Entity Recognition System for Arabic
    El Bazi, Ismail
    Laachfoubi, Nabil
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 396 - 404