Chemical-gene relation extraction using recursive neural network

被引:34
作者
Lim, Sangrak [1 ]
Kang, Jaewoo [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Anam Dong 5 Ga, Seoul 136713, South Korea
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2018年
基金
新加坡国家研究基金会;
关键词
D O I
10.1093/database/bay060
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this article, we describe our system for the CHEMPROT task of the BioCreative VI challenge. Although considerable research on the named entity recognition of genes and drugs has been conducted, there is limited research on extracting relationships between them. Extracting relations between chemical compounds and genes from the literature is an important element in pharmacological and clinical research. The CHEMPROT task of BioCreative VI aims to promote the development of text mining systems that can be used to automatically extract relationships between chemical compounds and genes. We tested three recursive neural network approaches to improve the performance of relation extraction. In the BioCreative VI challenge, we developed a tree-Long Short-Term Memory networks (tree-LSTM) model with several additional features including a position feature and a subtree containment feature, and we also applied an ensemble method. After the challenge, we applied additional pre-processing steps to the tree-LSTM model, and we tested the performance of another recursive neural network model called Stack-augmented Parser Interpreter Neural Network (SPINN). Our tree-LSTM model achieved an F-score of 58.53% in the BioCreative VI challenge. Our tree-LSTM model with additional pre-processing and the SPINN model obtained F-scores of 63.7 and 64.1%, respectively.
引用
收藏
页数:11
相关论文
共 29 条
[1]  
Abadi M., 2016, TENSORFLOW LARGESCAL
[2]  
[Anonymous], 2016, ARXIV160904873
[3]  
[Anonymous], 2016, ARXIV160301547
[4]  
[Anonymous], 2017, ARXIV170803743
[5]  
[Anonymous], 2017, ARXIV170202181
[6]  
[Anonymous], 2014, P 18 C EMP METH NAT, DOI DOI 10.3115/V1/D14-1082
[7]  
Bengio Y, 2001, ADV NEUR IN, V13, P932
[8]  
Bowman SR, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1466
[9]   Comparative experiments on learning information extractors for proteins and their interactions [J].
Bunescu, R ;
Ge, RF ;
Kate, RJ ;
Marcotte, EM ;
Mooney, RJ ;
Ramani, AK ;
Wong, YW .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) :139-155
[10]   Toward an automatic method for extracting cancer- and other disease-related point mutations from the biomedical literature [J].
Doughty, Emily ;
Kertesz-Farkas, Attila ;
Bodenreider, Olivier ;
Thompson, Gary ;
Adadey, Asa ;
Peterson, Thomas ;
Kann, Maricel G. .
BIOINFORMATICS, 2011, 27 (03) :408-415