Towards a Unified Multi-Domain Multilingual Named Entity Recognition Model

被引:0
作者
Kulkarni, Mayank [2 ]
Preotiuc-Pietro, Daniel [1 ]
Radhakrishnan, Karthik [1 ]
Winata, Genta Indra [1 ]
Wu, Shijie [1 ]
Xie, Lingjue [1 ]
Yang, Shaohua [1 ]
机构
[1] Bloomberg, New York, NY 10022 USA
[2] Amazon Alexa AI, Boston, MA USA
来源
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition is a key Natural Language Processing task whose performance is sensitive to choice of genre and language. A unified NER model across multiple genres and languages is more practical and efficient through leveraging commonalities across genres or languages. In this paper, we propose a novel setup for NER which includes multi-domain and multilingual training and evaluation across 13 domains and 4 languages. We explore a range of approaches to building a unified model using domain and language adaptation techniques. Our experiments highlight multiple nuances to consider while building a unified model, including that naive data pooling fails to obtain good performance, that domain-specific adaptations are more important than language-specific ones and that including domain-specific adaptations in a unified model can reach performance close to training multiple dedicated monolingual models at a fraction of their parameter count.
引用
收藏
页码:2210 / 2219
页数:10
相关论文
共 50 条
[21]   Towards Unified Multi-Domain Machine Translation With Mixture of Domain Experts [J].
Lu, Jinliang ;
Zhang, Jiajun .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 :3488-3498
[22]   A cascaded approach to biomedical named entity recognition using a unified model [J].
Chan, Shing-Kit ;
Lam, Wai ;
Yu, Xiaofeng .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :93-102
[23]   SatelliteNER: An Effective Named Entity Recognition Model for the Satellite Domain [J].
Jafari, Omid ;
Nagarkar, Parth ;
Thatte, Bhagwan ;
Ingram, Carl .
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2020, :100-107
[24]   Towards Bangla Named Entity Recognition [J].
Chowdhury, Shammur Absar ;
Alam, Firoj ;
Khan, Naira .
2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
[25]   A Unified Model for Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media [J].
He, Hangfeng ;
Sun, Xu .
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, :3216-3222
[26]   Towards Robust Named Entity Recognition via Temporal Domain Adaptation and Entity Context Understanding [J].
Agarwal, Oshin .
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, :12866-12867
[27]   MMBERT: a unified framework for biomedical named entity recognition [J].
Lei Fu ;
Zuquan Weng ;
Jiheng Zhang ;
Haihe Xie ;
Yiqing Cao .
Medical & Biological Engineering & Computing, 2024, 62 :327-341
[28]   Tuning Multilingual Transformers for Named Entity Recognition on Slavic Languages [J].
Arkhipov, Mikhail ;
Trofimova, Maria ;
Kuratov, Yuri ;
Sorokin, Alexey .
7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, :89-93
[29]   On the Strength of Character Language Models for Multilingual Named Entity Recognition [J].
Yu, Xiaodong ;
Mayhew, Stephen ;
Sammons, Mark ;
Roth, Dan .
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, :3073-3077
[30]   Multilingual named entity recognition based on the BiGRU-CNN-CRF hybrid model [J].
Ayifu M. ;
Wushouer S. ;
Palidan M. .
International Journal of Information and Communication Technology, 2019, 15 (03) :223-242