Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs

被引:225
作者
Liu, Mei [1 ]
Wu, Yonghui [1 ]
Chen, Yukun [1 ]
Sun, Jingchun [1 ]
Zhao, Zhongming [1 ]
Chen, Xue-wen [2 ,3 ]
Matheny, Michael Edwin [1 ,4 ,5 ,6 ]
Xu, Hua [1 ]
机构
[1] Vanderbilt Univ, Sch Med, Dept Biomed Informat, Nashville, TN 37232 USA
[2] Univ Kansas, Bioinformat & Computat Life Sci Lab, Informat & Telecommun Technol Ctr, Lawrence, KS 66045 USA
[3] Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
[4] Vanderbilt Univ, Sch Med, Dept Biostat, Nashville, TN 37232 USA
[5] Vanderbilt Univ, Sch Med, Div Gen Internal Med, Nashville, TN 37232 USA
[6] Vet Hlth Adm, Nashville, TN USA
关键词
SIGNAL-DETECTION; DISCOVERY; SYSTEMS; RESOURCE; LIBRARY; EVENTS; KEGG;
D O I
10.1136/amiajnl-2011-000699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Adverse drug reaction (ADR) is one of the major causes of failure in drug development. Severe ADRs that go undetected until the post-marketing phase of a drug often lead to patient morbidity. Accurate prediction of potential ADRs is required in the entire life cycle of a drug, including early stages of drug design, different phases of clinical trials, and post-marketing surveillance. Methods Many studies have utilized either chemical structures or molecular pathways of the drugs to predict ADRs. Here, the authors propose a machine-learning-based approach for ADR prediction by integrating the phenotypic characteristics of a drug, including indications and other known ADRs, with the drug's chemical structures and biological properties, including protein targets and pathway information. A large-scale study was conducted to predict 1385 known ADRs of 832 approved drugs, and five machine-learning algorithms for this task were compared. Results This evaluation, based on a fivefold cross-validation, showed that the support vector machine algorithm outperformed the others. Of the three types of information, phenotypic data were the most informative for ADR prediction. When biological and phenotypic features were added to the baseline chemical information, the ADR prediction model achieved significant improvements in area under the curve (from 0.9054 to 0.9524), precision (from 43.37% to 66.17%), and recall (from 49.25% to 63.06%). Most importantly, the proposed model successfully predicted the ADRs associated with withdrawal of rofecoxib and cerivastatin. Conclusion The results suggest that phenotypic information on drugs is valuable for ADR prediction. Moreover, they demonstrate that different models that combine chemical, biological, or phenotypic information can be built from approved drugs, and they have the potential to detect clinically important ADRs in both preclinical and post-marketing phases.
引用
收藏
页码:E28 / E35
页数:8
相关论文
共 49 条
[1]   Pharmacovigilance Data Mining With Methods Based on False Discovery Rates: A Comparative Simulation Study [J].
Ahmed, I. ;
Thiessard, F. ;
Miremont-Salame, G. ;
Begaud, B. ;
Tubert-Bitter, P. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2010, 88 (04) :492-498
[2]   False Discovery Rate Estimation for Frequentist Pharmacovigilance Signal Detection Methods [J].
Ahmed, I. ;
Dalmasso, C. ;
Haramburu, F. ;
Thiessard, F. ;
Broet, P. ;
Tubert-Bitter, P. .
BIOMETRICS, 2010, 66 (01) :301-309
[3]   A Bayesian neural network method for adverse drug reaction signal generation [J].
Bate, A ;
Lindquist, M ;
Edwards, IR ;
Olsson, S ;
Orre, R ;
Lansner, A ;
De Freitas, RM .
EUROPEAN JOURNAL OF CLINICAL PHARMACOLOGY, 1998, 54 (04) :315-321
[4]   Analysis of pharmacology data and the prediction of adverse drug reactions and off-target effects from chemical structure [J].
Bender, Andreas ;
Scheiber, Josef ;
Glick, Meir ;
Davies, John W. ;
Azzaoui, Kamal ;
Hamon, Jacques ;
Urban, Laszlo ;
Whitebread, Steven ;
Jenkins, Jeremy L. .
CHEMMEDCHEM, 2007, 2 (06) :861-873
[5]   ONE-SIDED CONFIDENCE CONTOURS FOR PROBABILITY DISTRIBUTION FUNCTIONS [J].
BIRNBAUM, ZW ;
TINGEY, FH .
ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (04) :592-596
[6]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]  
Bolton EE, 2010, ANN REP COMP CHEM, V4, P217, DOI 10.1016/S1574-1400(08)00012-1
[8]   Predicting Adverse Drug Events Using Pharmacological Network Models [J].
Cami, Aurel ;
Arnold, Alana ;
Manzi, Shannon ;
Reis, Ben .
SCIENCE TRANSLATIONAL MEDICINE, 2011, 3 (114)
[9]   Drug target identification using side-effect similarity [J].
Campillos, Monica ;
Kuhn, Michael ;
Gavin, Anne-Claude ;
Jensen, Lars Juhl ;
Bork, Peer .
SCIENCE, 2008, 321 (5886) :263-266
[10]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)