GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text

被引:97
作者
Zhu, Qile [1 ,2 ]
Li, Xiaolin [1 ,3 ]
Conesa, Ana [4 ,5 ]
Pereira, Cecile [4 ]
机构
[1] Univ Florida, Natl Sci Fdn Ctr Big Learning, Gainesville, FL 32611 USA
[2] Univ Florida, Dept Comp & Informat Sci & Engn, Gainesville, FL 32611 USA
[3] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
[4] Univ Florida, Inst Food & Agr Sci, Dept Microbiol & Cell Sci, Gainesville, FL 32611 USA
[5] Ctr Invest Principe Felipe, Genom Gene Express Lab, Valencia 42012, Spain
基金
美国国家卫生研究院; 美国食品与农业研究所; 美国国家科学基金会;
关键词
NEURAL-NETWORKS; NORMALIZATION;
D O I
10.1093/bioinformatics/btx815
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Best performing named entity recognition (NER) methods for biomedical literature are based on hand-crafted features or task-specific rules, which are costly to produce and difficult to generalize to other corpora. End-to-end neural networks achieve state-of-the-art performance without hand-crafted features and task-specific knowledge in non-biomedical NER tasks. However, in the biomedical domain, using the same architecture does not yield competitive performance compared with conventional machine learning models. Results: We propose a novel end-to-end deep learning approach for biomedical NER tasks that leverages the local contexts based on n-gram character and word embeddings via Convolutional Neural Network (CNN). We call this approach GRAM-CNN. To automatically label a word, this method uses the local information around a word. Therefore, the GRAM-CNN method does not require any specific knowledge or feature engineering and can be theoretically applied to a wide range of existing NER problems. The GRAM-CNN approach was evaluated on three well-known biomedical datasets containing different BioNER entities. It obtained an F1-score of 87.26% on the Biocreative II dataset, 87.26% on the NCBI dataset and 72.57% on the JNLPBA dataset. Those results put GRAM-CNN in the lead of the biological NER methods. To the best of our knowledge, we are the first to apply CNN based structures to BioNER problems.
引用
收藏
页码:1547 / 1554
页数:8
相关论文
共 40 条
[1]  
ABADI M., 2015, TensorFlow: large-scale machine learning on heterogeneous systems
[2]  
Ananiadou S, 1994, P 15 INT C COMP LING, P1034, DOI DOI 10.3115/991250.991317
[3]  
Ando R., 2007, P 2 BIOCREATIVE CHAL, P101
[4]  
[Anonymous], 2012, CoRR
[5]  
[Anonymous], 2016, ARXIV161107709
[6]  
[Anonymous], 2001, PROC 18 INT C MACH L
[7]  
[Anonymous], 2016, Proceedings of the 15th workshop on biomedical natural language processing, DOI DOI 10.18653/V1/W16-2922
[8]  
Bird S, 2009, Natural language processing with python, DOI DOI 10.5555/1717171
[9]   Gimli: open source and high-performance biomedical name recognition [J].
Campos, David ;
Matos, Sergio ;
Oliveira, Jose Luis .
BMC BIOINFORMATICS, 2013, 14
[10]  
Collier N., 2000, P 18 C COMPUTATIONAL, V1, P201, DOI [DOI 10.3115/990820.990850, 10.3115/990820.990850]