CycleNER: An Unsupervised Training Approach for Named Entity Recognition

被引:19
|
作者
Iovine, Andrea [1 ]
Fang, Anjie [2 ]
Fetahu, Besnik [2 ]
Rokhlenko, Oleg [2 ]
Malmasi, Shervin [2 ]
机构
[1] Univ Bari Aldo Moro, Bari, Italy
[2] Amazoncom Inc, Bellevue, WA USA
来源
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22) | 2022年
关键词
natural language processing; named entity recognition; cycleconsistency; training; unsupervised training;
D O I
10.1145/3485447.3512012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is a crucial natural language understanding task for many down-stream tasks such as question answering and retrieval. Despite significant progress in developing NER models for multiple languages and domains, scaling to emerging and/or low-resource domains still remains challenging, due to the costly nature of acquiring training data. We propose CycleNER, an unsupervised approach based on cycle-consistency training that uses two functions: (i) sentence-to-entity - S2E and (ii) entity-to-sentence - E2S, to carry out the NER task. CycleNER does not require annotations but a set of sentences with no entity labels and another independent set of entity examples. Through cycle-consistency training, the output from one function is used as input for the other (e.g. S2E. E2S) to align the representation spaces of both functions and therefore enable unsupervised training. Evaluation on several domains comparing CycleNER against supervised and unsupervised competitors shows that CycleNER achieves highly competitive performance with only a few thousand input sentences. We demonstrate competitive performance against supervised models, achieving 73% of supervised performance without any annotations on CoNLL03, while significantly outperforming unsupervised approaches.
引用
收藏
页码:2916 / 2924
页数:9
相关论文
共 50 条
  • [41] Deep learning for named entity recognition: a survey
    Hu Z.
    Hou W.
    Liu X.
    Neural Comput. Appl., 16 (8995-9022): : 8995 - 9022
  • [42] Named entity recognition for Hindi language : A survey
    Sharma, Richa
    Morwal, Sudha
    Agarwal, Basant
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (04) : 569 - 580
  • [43] Turkish Named Entity Recognition with Deep Learning
    Gunes, Asim
    Tantug, A. Cuneyd
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [44] Echo State Networks for Named Entity Recognition
    Ramamurthy, Rajkumar
    Stenzel, Robin
    Sifa, Rafet
    Ladi, Anna
    Bauckhage, Christian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 110 - 120
  • [45] A CONDITIONAL RANDOM FIELDS APPROACH TO BIOMEDICAL NAMED ENTITY RECOGNITION
    Wang Haochang Zhao Tiejun Li Sheng Yu Hao (School of Computer Science and Technology
    Journal of Electronics(China), 2007, (06) : 838 - 844
  • [46] Named Entity Recognition for Addresses: An Empirical Study
    CEOVIC, H. E. L. E. N. A.
    KURDIJA, A. D. R. I. A. N. S. A. T. J. A.
    DELAC, G. O. R. A. N.
    SILIC, M. A. R. I. N.
    IEEE ACCESS, 2022, 10 : 42094 - 42106
  • [47] Generalisation in named entity recognition: A quantitative analysis
    Augenstein, Isabelle
    Derczynski, Leon
    Bontcheva, Kalina
    COMPUTER SPEECH AND LANGUAGE, 2017, 44 : 61 - 83
  • [48] A Language Independent Approach for Named Entity Recognition in Subject Headings
    Freire, Nuno
    Borbinha, Jose
    Calado, Pavel
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, TPDL 2011, 2011, 6966 : 52 - 61
  • [49] Named Entity Recognition for Malayalam Language: A CRF based Approach
    Prasad, Gowri
    Fousiya, K. K.
    Kumar, M. Anand
    Soman, K. P.
    2015 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2015, : 16 - 19
  • [50] Named Entity Recognition Approach for Malay Crime News Retrieval
    Saad, Saidah
    Mansor, Mohamed Kamil
    GEMA ONLINE JOURNAL OF LANGUAGE STUDIES, 2018, 18 (04): : 216 - 235