Span-based Named Entity Recognition by Generating and Compressing Information

被引:0
作者
Nguyen, Nhung T. H. [1 ]
Miwa, Makoto [2 ,3 ]
Ananiadou, Sophia [1 ,3 ,4 ]
机构
[1] Univ Manchester, Natl Ctr Text Min, Dept Comp Sci, Manchester, Lancs, England
[2] Toyota Technol Inst, Toyota, Japan
[3] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr AIRC, Tokyo, Japan
[4] Alan Turing Inst, London, England
来源
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The information bottleneck (IB) principle has been proven effective in various NLP applications. The existing work, however, only used either generative or information compression models to improve the performance of the target task. In this paper, we propose to combine the two types of IB models into one system to enhance Named Entity Recognition (NER). For one type of IB model, we incorporate two unsupervised generative components, span reconstruction and synonym generation, into a span-based NER system. The span reconstruction ensures that the contextualised span representation keeps the span information, while the synonym generation makes synonyms have similar representations even in different contexts. For the other type of IB model, we add a supervised IB layer that performs information compression into the system to preserve useful features for NER in the resulting span representations. Experiments on five different corpora indicate that jointly training both generative and information compression models can enhance the performance of the baseline span-based NER system. Our source code is publicly available at https://github.com/nguyennth/joint-ib-models.
引用
收藏
页码:1984 / 1996
页数:13
相关论文
共 47 条
  • [1] Optuna: A Next-generation Hyperparameter Optimization Framework
    Akiba, Takuya
    Sano, Shotaro
    Yanase, Toshihiko
    Ohta, Takeru
    Koyama, Masanori
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2623 - 2631
  • [2] Alemi Alexander A., 2017, P INT C LEARNING REP, P3615
  • [3] The Unified Medical Language System (UMLS): integrating biomedical terminology
    Bodenreider, O
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D267 - D270
  • [4] Bowman S., 2016, P 20 SIGNLL C COMPUT, P10, DOI DOI 10.18653/V1/K16-1002
  • [5] Chen MD, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P215
  • [6] Chen W, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P4852
  • [7] Christopoulou F, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P11
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] NCBI disease corpus: A resource for disease name recognition and concept normalization
    Dogan, Rezarta Islamaj
    Leaman, Robert
    Lu, Zhiyong
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 47 : 1 - 10
  • [10] Effland Thomas, 2019, OpenReview