Computational Reproducibility of Named Entity Recognition methods in the biomedical domain

被引:0
作者
Garcia-Serrano, Ana [1 ]
Hennig, Sebastian [2 ]
Nuernberger, Andreas [2 ]
机构
[1] ETSI Informat UNED, Madrid, Spain
[2] Comp Sci Dept OVGU, Magdeburg, Germany
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2021年 / 66期
关键词
Named Entity Recognition (NER); Biomedical; supervised and unsupervised models; Unified Medical Language System; EXTRACTION; METAMAP; CTAKES; FAMILY;
D O I
10.26342/2021-66-12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised Named Entity Recognition (NER) approaches do not depend on labelled data to function properly but rather on a source of knowledge, in which promising candidates can be looked up to find the corresponding concept. In the biomedical domain knowledge source like this already exists; namely the Unified Medical Language System (UMLS). In this paper, three different unsupervised NER models using UMLS, namely MetaMap, cTakes and MetaMapLite are evaluated and compared from the results published by Demner-Fushman, Rogers and Aronson (2017) and Reategui and Ratte (2018). The Unsupervised Biomedical Named Entity Recognition framework (UB-NER) is developed, with which the results of the experiments of the three models, five datasets and two NER tasks are presented.
引用
收藏
页码:141 / 152
页数:12
相关论文
共 23 条
[1]  
[Anonymous], 2007, P 45 M ASS COMP LING
[2]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[3]  
Benavent J, 2010, EXPERIENCES IMAGECLE, V1176
[4]   Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases [J].
Bhasuran, Balu ;
Murugesan, Gurusamy ;
Abdulkadhar, Sabenabanu ;
Natarajan, Jeyakumar .
JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 :1-9
[5]   Gimli: open source and high-performance biomedical name recognition [J].
Campos, David ;
Matos, Sergio ;
Oliveira, Jose Luis .
BMC BIOINFORMATICS, 2013, 14
[6]   Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition [J].
Cho, Minsoo ;
Ha, Jihwan ;
Park, Chihyun ;
Park, Sanghyun .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
[7]   MetaMap Lite: an evaluation of a new Java']Java implementation of MetaMap [J].
Demner-Fushman, Dina ;
Rogers, Willie J. ;
Aronson, Alan R. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (04) :841-844
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]   NCBI disease corpus: A resource for disease name recognition and concept normalization [J].
Dogan, Rezarta Islamaj ;
Leaman, Robert ;
Lu, Zhiyong .
JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 47 :1-10
[10]  
Garcia-Serrano A., 2019, LSI2 UNED EHEALTH KD, V2421