Towards a Unified Multi-Domain Multilingual Named Entity Recognition Model

被引:0
作者
Kulkarni, Mayank [2 ]
Preotiuc-Pietro, Daniel [1 ]
Radhakrishnan, Karthik [1 ]
Winata, Genta Indra [1 ]
Wu, Shijie [1 ]
Xie, Lingjue [1 ]
Yang, Shaohua [1 ]
机构
[1] Bloomberg, New York, NY 10022 USA
[2] Amazon Alexa AI, Boston, MA USA
来源
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition is a key Natural Language Processing task whose performance is sensitive to choice of genre and language. A unified NER model across multiple genres and languages is more practical and efficient through leveraging commonalities across genres or languages. In this paper, we propose a novel setup for NER which includes multi-domain and multilingual training and evaluation across 13 domains and 4 languages. We explore a range of approaches to building a unified model using domain and language adaptation techniques. Our experiments highlight multiple nuances to consider while building a unified model, including that naive data pooling fails to obtain good performance, that domain-specific adaptations are more important than language-specific ones and that including domain-specific adaptations in a unified model can reach performance close to training multiple dedicated monolingual models at a fraction of their parameter count.
引用
收藏
页码:2210 / 2219
页数:10
相关论文
共 50 条
[41]   TLR at BSNLP2019: A Multilingual Named Entity Recognition System [J].
Moreno, Jose G. ;
Pontes, Elvys Linhares ;
Coustaty, Mickael ;
Doucet, Antoine .
7TH WORKSHOP ON BALTO-SLAVIC NATURAL LANGUAGE PROCESSING (BSNLP'2019), 2019, :83-88
[42]   Dataset Enhancement and Multilingual Transfer for Named Entity Recognition in the Indonesian Language [J].
Khairunnisa, Siti Oryza ;
Chen, Zhousi ;
Komachi, Mamoru .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)
[43]   Firefly Algorithm Based Multilingual Named Entity Recognition for Indian Languages [J].
Biswas, Sitanath ;
Dash, Sujata ;
Acharya, Sweta .
ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 :540-552
[44]   Named Entity Recognition via Unified Information Extraction Framework [J].
Chen, Xinyue ;
Zhang, Zhenguo ;
Lu, Xinghua .
2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, :308-313
[45]   Improving unified named entity recognition by incorporating mention relevance [J].
Ji, Lijun ;
Yan, Danfeng ;
Cheng, Zhuoran ;
Song, Yan .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30) :22223-22234
[46]   Improving unified named entity recognition by incorporating mention relevance [J].
Lijun Ji ;
Danfeng Yan ;
Zhuoran Cheng ;
Yan Song .
Neural Computing and Applications, 2023, 35 :22223-22234
[47]   Multi-Grained Named Entity Recognition [J].
Xia, Congying ;
Zhang, Chenwei ;
Yang, Tao ;
Li, Yaliang ;
Du, Nan ;
Wu, Xian ;
Fan, Wei ;
Ma, Fenglong ;
Yu, Philip .
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, :1430-1440
[48]   Domain Named Entity Recognition Method Based on Skip-gram Model [J].
Feng Yan-hong ;
Yu Hong ;
Sun Geng ;
Yu Xun-ran .
PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, :510-514
[49]   Creating a Dataset for Named Entity Recognition in the Archaeology Domain [J].
Brandsen, Alex ;
Verberne, Suzan ;
Wansleeben, Milco ;
Lambers, Karsten .
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, :4573-4577
[50]   Domain Adaptation with Active Learning for Named Entity Recognition [J].
Sun, Huiyu ;
Grishman, Ralph ;
Wang, Yingchao .
CLOUD COMPUTING AND SECURITY, ICCCS 2016, PT II, 2016, 10040 :611-622