Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition

被引:27
作者
Ghaddar, Abbas [1 ]
Langlais, Philippe [2 ]
Rashid, Ahmad [1 ]
Rezagholizadeh, Mehdi [1 ]
机构
[1] Montreal Res Ctr, Huawei Noahs Ark Lab, Montreal, PQ, Canada
[2] Univ Montreal, RALI DIRO, Montreal, PQ, Canada
关键词
Computational linguistics - Long short-term memory;
D O I
10.1162/tacl_a_00386
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we examine the ability of NER models to use contextual information when predicting the type of an ambiguous entity. We introduce NRB, a new testbed carefully designed to diagnose Name Regularity Bias of NERmodels. Our results indicate that all stateof-the-art models we tested show such a bias; BERT fine-tuned models significantly outperforming feature-based (LSTM-CRF) ones on NRB, despite having comparable (sometimes lower) performance on standard benchmarks. To mitigate this bias, we propose a novel model-agnostic training method that adds learnable adversarial noise to some entity mentions, thus enforcing models to focus more strongly on the contextual signal, leading to significant gains on NRB. Combining it with two other training strategies, data augmentation and parameter freezing, leads to further gains.
引用
收藏
页码:586 / 604
页数:19
相关论文
共 97 条
  • [1] Agarwal Oshin, 2020, ARXIV PREPRINT ARXIV, DOI [10.1162/coli a_00397, DOI 10.1162/COLIA_00397]
  • [2] Akbik A, 2018, P 27 INT C COMP LING, P1638
  • [3] Al-Rfou R., 2015, P 2015 SIAM INT C DA, P586, DOI DOI 10.1137/1.9781611974010.66
  • [4] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [5] [Anonymous], 2018, ARXIV PREPRINT ARXIV
  • [6] Generalisation in named entity recognition: A quantitative analysis
    Augenstein, Isabelle
    Derczynski, Leon
    Bontcheva, Kalina
    [J]. COMPUTER SPEECH AND LANGUAGE, 2017, 44 : 61 - 83
  • [7] Balasubramanian S, 2020, 5TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2020), P205
  • [8] Bekoulis G, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2830
  • [9] Belinkov Yonatan, 2019, P 8 JOINT C LEX COMP, P256, DOI DOI 10.18653/V1/S19-1028
  • [10] Bernier-Colborne Gabriel, 2020, P 12 LANG RES EV C, P1697