A novel feature-based approach to extract drug-drug interactions from biomedical text

被引:64
作者
Bui, Quoc-Chinh [1 ]
Sloot, Peter M. A. [2 ,3 ,4 ]
van Mulligen, Erik M. [1 ]
Kors, Jan A. [1 ]
机构
[1] Erasmus Univ, Med Ctr Rotterdam, Dept Med Informat, NL-3000 DR Rotterdam, Netherlands
[2] Univ Amsterdam, Inst Informat, NL-1012 WX Amsterdam, Netherlands
[3] Nanyang Technol Univ, Complex Inst, Singapore 639798, Singapore
[4] ITMO Univ, St Petersburg, Russia
关键词
PROTEIN INTERACTION EXTRACTION; KERNEL; CORPUS;
D O I
10.1093/bioinformatics/btu557
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Knowledge of drug-drug interactions (DDIs) is crucial for health-care professionals to avoid adverse effects when co-administering drugs to patients. As most newly discovered DDIs are made available through scientific publications, automatic DDI extraction is highly relevant. Results: We propose a novel feature-based approach to extract DDIs from text. Our approach consists of three steps. First, we apply text preprocessing to convert input sentences from a given dataset into structured representations. Second, we map each candidate DDI pair from that dataset into a suitable syntactic structure. Based on that, a novel set of features is used to generate feature vectors for these candidate DDI pairs. Third, the obtained feature vectors are used to train a support vector machine (SVM) classifier. When evaluated on two DDI extraction challenge test datasets from 2011 and 2013, our system achieves F-scores of 71.1% and 83.5%, respectively, outperforming any state-of-the-art DDI extraction system.
引用
收藏
页码:3365 / 3371
页数:7
相关论文
共 25 条
[1]   All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning [J].
Airola, Antti ;
Pyysalo, Sampo ;
Bjoerne, Jari ;
Pahikkala, Tapio ;
Ginter, Filip ;
Salakoski, Tapio .
BMC BIOINFORMATICS, 2008, 9 (Suppl 11)
[2]   A robust approach to extract biomedical events from literature [J].
Bui, Quoc-Chinh ;
Sloot, Peter M. A. .
BIOINFORMATICS, 2012, 28 (20) :2654-2661
[3]   A hybrid approach to extract protein-protein interactions [J].
Bui, Quoc-Chinh ;
Katrenko, Sophia ;
Sloot, Peter M. A. .
BIOINFORMATICS, 2011, 27 (02) :259-265
[4]  
Chowdhury M., 2013, Proceedings of the the 7th international workshop on semanticevaluation (SemEval 2013), Atlanta, Georgia, USA, P351
[5]  
Chowdhury M.F.M., 2013, P 2013 C N AM CHAPT, P765
[6]   The structural and content aspects of abstracts versus bodies of full text journal articles are different [J].
Cohen, K. Bretonnel ;
Johnson, Helen L. ;
Verspoor, Karin ;
Roeder, Christophe ;
Hunter, Lawrence E. .
BMC BIOINFORMATICS, 2010, 11
[7]   Hospital admissions/visits associated with drug-drug interactions: a systematic review and meta-analysis [J].
Dechanont, Supinya ;
Maphanta, Sirada ;
Butthum, Bodin ;
Kongkaew, Chuenjid .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, 23 (05) :489-497
[8]  
GIULIANO C, 2006, P 11 C EUR CHAPT ASS, P401
[9]   Mining the pharmacogenomics literature-a survey of the state of the art [J].
Hahn, Udo ;
Cohen, K. Bretonnel ;
Garten, Yael ;
Shah, Nigam H. .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (04) :460-494
[10]   Extracting Drug-Drug Interaction from the Biomedical Literature Using a Stacked Generalization-Based Approach [J].
He, Linna ;
Yang, Zhihao ;
Zhao, Zhehuan ;
Lin, Hongfei ;
Li, Yanpeng .
PLOS ONE, 2013, 8 (06)