CycleNER: An Unsupervised Training Approach for Named Entity Recognition

被引:19
|
作者
Iovine, Andrea [1 ]
Fang, Anjie [2 ]
Fetahu, Besnik [2 ]
Rokhlenko, Oleg [2 ]
Malmasi, Shervin [2 ]
机构
[1] Univ Bari Aldo Moro, Bari, Italy
[2] Amazoncom Inc, Bellevue, WA USA
来源
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22) | 2022年
关键词
natural language processing; named entity recognition; cycleconsistency; training; unsupervised training;
D O I
10.1145/3485447.3512012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is a crucial natural language understanding task for many down-stream tasks such as question answering and retrieval. Despite significant progress in developing NER models for multiple languages and domains, scaling to emerging and/or low-resource domains still remains challenging, due to the costly nature of acquiring training data. We propose CycleNER, an unsupervised approach based on cycle-consistency training that uses two functions: (i) sentence-to-entity - S2E and (ii) entity-to-sentence - E2S, to carry out the NER task. CycleNER does not require annotations but a set of sentences with no entity labels and another independent set of entity examples. Through cycle-consistency training, the output from one function is used as input for the other (e.g. S2E. E2S) to align the representation spaces of both functions and therefore enable unsupervised training. Evaluation on several domains comparing CycleNER against supervised and unsupervised competitors shows that CycleNER achieves highly competitive performance with only a few thousand input sentences. We demonstrate competitive performance against supervised models, achieving 73% of supervised performance without any annotations on CoNLL03, while significantly outperforming unsupervised approaches.
引用
收藏
页码:2916 / 2924
页数:9
相关论文
共 50 条
  • [1] A New Approach for Named Entity Recognition
    Ertopcu, Burak
    Kanburoglu, Ali Bugra
    Topsakal, Ozan
    Acikgoz, Onur
    Gurkan, Ali Tunca
    Ozenc, Berke
    Cam, Ilker
    Avar, Begum
    Ercan, Gokhan
    Yildiz, Olcay Taner
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 474 - 479
  • [2] A Named Entity Recognition Approach for Albanian
    Skenduli, Marjana Prifti
    Biba, Marenglen
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1532 - 1537
  • [3] Unsupervised cross-domain named entity recognition using entity-aware adversarial training
    Peng, Qi
    Zheng, Changmeng
    Cai, Yi
    Wang, Tao
    Xie, Haoran
    Li, Qing
    NEURAL NETWORKS, 2021, 138 (138) : 68 - 77
  • [4] A Hybrid Approach for Persian Named Entity Recognition
    Hamed Moradi
    Farid Ahmadi
    Mohammad-Reza Feizi-Derakhshi
    Iranian Journal of Science and Technology, Transactions A: Science, 2017, 41 : 215 - 222
  • [5] A Hybrid Approach for Persian Named Entity Recognition
    Moradi, Hamed
    Ahmadi, Farid
    Feizi-Derakhshi, Mohammad-Reza
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION A-SCIENCE, 2017, 41 (A1): : 215 - 222
  • [6] A hybrid approach to Arabic named entity recognition
    Shaalan, Khaled
    Oudah, Mai
    JOURNAL OF INFORMATION SCIENCE, 2014, 40 (01) : 67 - 87
  • [7] Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts
    Zhang, Shaodian
    Elhadad, Noemie
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (06) : 1088 - 1098
  • [8] A Self-training Approach for Few-Shot Named Entity Recognition
    Qian, Yudong
    Zheng, Weiguo
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 183 - 191
  • [9] Persian Named Entity Recognition
    Dashtipour, Kia
    Gogate, Mandar
    Adeel, Ahsan
    Algarafi, Abdulrahman
    Howard, Newton
    Hussain, Amir
    2017 IEEE 16TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2017, : 79 - 83
  • [10] Generation of training data for named entity recognition of artworks
    Jain, Nitisha
    Sierra-Munera, Alejandro
    Ehmueller, Jan
    Krestel, Ralf
    SEMANTIC WEB, 2023, 14 (02) : 239 - 260