Towards a Unified Multi-Domain Multilingual Named Entity Recognition Model

被引:0
作者
Kulkarni, Mayank [2 ]
Preotiuc-Pietro, Daniel [1 ]
Radhakrishnan, Karthik [1 ]
Winata, Genta Indra [1 ]
Wu, Shijie [1 ]
Xie, Lingjue [1 ]
Yang, Shaohua [1 ]
机构
[1] Bloomberg, New York, NY 10022 USA
[2] Amazon Alexa AI, Boston, MA USA
来源
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition is a key Natural Language Processing task whose performance is sensitive to choice of genre and language. A unified NER model across multiple genres and languages is more practical and efficient through leveraging commonalities across genres or languages. In this paper, we propose a novel setup for NER which includes multi-domain and multilingual training and evaluation across 13 domains and 4 languages. We explore a range of approaches to building a unified model using domain and language adaptation techniques. Our experiments highlight multiple nuances to consider while building a unified model, including that naive data pooling fails to obtain good performance, that domain-specific adaptations are more important than language-specific ones and that including domain-specific adaptations in a unified model can reach performance close to training multiple dedicated monolingual models at a fraction of their parameter count.
引用
收藏
页码:2210 / 2219
页数:10
相关论文
共 50 条
  • [21] A cascaded approach to biomedical named entity recognition using a unified model
    Chan, Shing-Kit
    Lam, Wai
    Yu, Xiaofeng
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 93 - 102
  • [22] SatelliteNER: An Effective Named Entity Recognition Model for the Satellite Domain
    Jafari, Omid
    Nagarkar, Parth
    Thatte, Bhagwan
    Ingram, Carl
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2020, : 100 - 107
  • [23] Towards Bangla Named Entity Recognition
    Chowdhury, Shammur Absar
    Alam, Firoj
    Khan, Naira
    [J]. 2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
  • [24] A Unified Model for Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media
    He, Hangfeng
    Sun, Xu
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3216 - 3222
  • [25] Towards Robust Named Entity Recognition via Temporal Domain Adaptation and Entity Context Understanding
    Agarwal, Oshin
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12866 - 12867
  • [26] MMBERT: a unified framework for biomedical named entity recognition
    Lei Fu
    Zuquan Weng
    Jiheng Zhang
    Haihe Xie
    Yiqing Cao
    [J]. Medical & Biological Engineering & Computing, 2024, 62 : 327 - 341
  • [27] Tuning Multilingual Transformers for Named Entity Recognition on Slavic Languages
    Arkhipov, Mikhail
    Trofimova, Maria
    Kuratov, Yuri
    Sorokin, Alexey
    [J]. 7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, : 89 - 93
  • [28] On the Strength of Character Language Models for Multilingual Named Entity Recognition
    Yu, Xiaodong
    Mayhew, Stephen
    Sammons, Mark
    Roth, Dan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3073 - 3077
  • [29] Multilingual named entity recognition based on the BiGRU-CNN-CRF hybrid model
    Ayifu M.
    Wushouer S.
    Palidan M.
    [J]. International Journal of Information and Communication Technology, 2019, 15 (03) : 223 - 242
  • [30] MMBERT: a unified framework for biomedical named entity recognition
    Fu, Lei
    Weng, Zuquan
    Zhang, Jiheng
    Xie, Haihe
    Cao, Yiqing
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (01) : 327 - 341