Multi-domain evaluation framework for named entity recognition tools

被引:9
作者
Abdallah, Zahraa S. [1 ]
Carman, Mark [1 ]
Haffari, Gholamreza [1 ]
机构
[1] Monash Univ, Sch Informat Technol, Clayton, Vic, Australia
关键词
Named entity recognition; Multi-domain evaluation; Qualitative data analysis; Benchmark evaluation;
D O I
10.1016/j.csl.2016.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting structured information from unstructured text is important for the qualitative data analysis. Leveraging NLP techniques for qualitative data analysis will effectively accelerate the annotation process, allow for large-scale analysis and provide more insights into the text to improve the performance. The first step for gaining insights from the text is Named Entity Recognition (NER). A significant challenge that directly impacts the performance of the NER process is the domain diversity in qualitative data. The represented text varies according to its domain in many aspects including taxonomies, length, formality and format. In this paper we discuss and analyse the performance of state-of-the-art tools across domains to elaborate their robustness and reliability. In order to do that, we developed a standard, expandable and flexible framework to analyse and test tools performance using corpora representing text across various domains. We performed extensive analysis and comparison of tools across various domains and from various perspectives. The resulting comparison and analysis are of significant importance for providing a holistic illustration of the state-of-the-art tools. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:34 / 55
页数:22
相关论文
共 50 条
  • [21] MMBERT: a unified framework for biomedical named entity recognition
    Fu, Lei
    Weng, Zuquan
    Zhang, Jiheng
    Xie, Haihe
    Cao, Yiqing
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (01) : 327 - 341
  • [22] Evaluation of Named Entity Recognition in Handwritten Documents
    Villanova-Aparisi, David
    Martinez-Hinarejos, Carlos-D
    Romero, Veronica
    Pastor-Gadea, Moises
    [J]. DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 568 - 582
  • [23] MULTI-DOMAIN MACHINE LEARNING APPROACH OF NAMED ENTITY RECOGNITION FOR ARABIC BOOKING CHATBOT ENGINES USING PRE-TRAINED BIDIRECTIONAL TRANSFORMERS
    Sadder, Boshra
    Sadder, Rahma
    Abandah, Gheith
    Jafar, Iyad
    [J]. JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2024, 10 (01): : 1 - 16
  • [24] SatelliteNER: An Effective Named Entity Recognition Model for the Satellite Domain
    Jafari, Omid
    Nagarkar, Parth
    Thatte, Bhagwan
    Ingram, Carl
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KMIS), VOL 3, 2020, : 100 - 107
  • [25] Government Domain Named Entity Recognition for South African Languages
    Eiselen, Roald
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3344 - 3348
  • [26] Chinese Named Entity Recognition Within the Electric Power Domain
    Feng, Jun
    Wang, Hongkai
    Peng, Liangying
    Wang, Yidan
    Song, Haomin
    Guo, Hongju
    [J]. EMERGING INFORMATION SECURITY AND APPLICATIONS, EISA 2023, 2024, 2004 : 133 - 146
  • [27] Semantically-Informed Domain Adaptation for Named Entity Recognition
    Borovikova, Mariya
    Ferre, Arnaud
    Bossy, Robert
    Roche, Mathieu
    Nedellee, Claire
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2024, 2024, 14670 : 55 - 64
  • [28] Named entity recognition in medical domain combined with knowledge graph
    Jin Z.
    He X.
    Yue S.
    Xiong Y.
    Luo J.
    [J]. Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2023, 55 (05): : 50 - 58
  • [29] Using BERT and Augmentation in Named Entity Recognition for Cybersecurity Domain
    Tikhomirov, Mikhail
    Loukachevitch, N.
    Sirotina, Anastasiia
    Dobrov, Boris
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2020), 2020, 12089 : 16 - 24
  • [30] Named Entity Recognition in Aviation Products Domain Based on BERT
    Yang, Mingye
    Namoano, Bernadin
    Farsi, Maryam
    Erkoyuncu, John Ahmet
    [J]. IEEE ACCESS, 2024, 12 : 189710 - 189721