In silico Prediction of Drug Induced Liver Toxicity Using Substructure Pattern Recognition Method

被引:70
作者
Zhang, Chen [1 ]
Cheng, Feixiong [1 ,2 ]
Li, Weihua [1 ]
Liu, Guixia [1 ]
Lee, Philip W. [1 ]
Tang, Yun [1 ]
机构
[1] E China Univ Sci & Technol, Sch Pharm, Shanghai Key Lab New Drug Design, 130 Meilong Rd, Shanghai 200237, Peoples R China
[2] Vanderbilt Univ, Sch Med, Dept Biomed Informat, Nashville, TN 37212 USA
基金
中国国家自然科学基金;
关键词
Drug-induced liver injury; machine learning; substructure pattern recognition; structural alerts; PRIVILEGED STRUCTURES; NEAREST NEIGHBOR; READ-ACROSS; INJURY; CLASSIFICATION; HEPATOTOXICITY; SELECTION; MODELS;
D O I
10.1002/minf.201500055
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Drug-induced liver injury (DILI) is a leading cause of acute liver failure in the US and less severe liver injury worldwide. It is also one of the major reasons of drug withdrawal from the market. Thus, DILI has become one of the most important concerns of drugs, and should be predicted in very early stage of drug discovery process. In this study, a comprehensive data set containing 1317 diverse compounds was collected from publications. Then, high accuracy classification models were built using five machine learning methods based on MACCS and FP4 fingerprints after evaluating by substructure pattern recognition method. The best model was built using SVM method together with FP4 fingerprint at the IG value threshold of 0.0005. Its overall predictive accuracies were 79.7% and 64.5% for the training and test sets, separately, which yielded overall accuracy of 75.0% for the external validation dataset, consisting of 88 compounds collected from a benchmark DILI database - the Liver Toxicity Knowledge Base. This model could be used for drug-induced liver toxicity prediction. Moreover, some key substructure patterns correlated with drug-induced liver toxicity were also identified as structural alerts.
引用
收藏
页码:136 / 144
页数:9
相关论文
共 53 条
  • [1] [Anonymous], 2012, JAMA, V308, P2073
  • [2] Emerging chemical patterns: A new methodology for molecular classification and compound selection
    Auer, Jens
    Bajorath, Juergen
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) : 2502 - 2514
  • [3] Assessing the accuracy of prediction algorithms for classification: an overview
    Baldi, P
    Brunak, S
    Chauvin, Y
    Andersen, CAF
    Nielsen, H
    [J]. BIOINFORMATICS, 2000, 16 (05) : 412 - 424
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [6] The Liver Toxicity Knowledge Base: A Systems Approach to a Complex End Point
    Chen, M.
    Zhang, J.
    Wang, Y.
    Liu, Z.
    Kelly, R.
    Zhou, G.
    Fang, H.
    Borlak, J.
    Tong, W.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2013, 93 (05) : 409 - 412
  • [7] Quantitative Structure-Activity Relationship Models for Predicting Drug-Induced Liver Injury Based on FDA-Approved Drug Labeling Annotation and Using a Large Collection of Drugs
    Chen, Minjun
    Hong, Huixiao
    Fang, Hong
    Kelly, Reagan
    Zhou, Guangxu
    Borlak, Jurgen
    Tong, Weida
    [J]. TOXICOLOGICAL SCIENCES, 2013, 136 (01) : 242 - 249
  • [8] FDA-approved drug labeling for the study of drug-induced liver injury
    Chen, Minjun
    Vijay, Vikrant
    Shi, Qiang
    Liu, Zhichao
    Fang, Hong
    Tong, Weida
    [J]. DRUG DISCOVERY TODAY, 2011, 16 (15-16) : 697 - 703
  • [9] Cheng FX, 2013, CURR TOP MED CHEM, V13, P1273
  • [10] Adverse Drug Events: Database Construction and in Silico Prediction
    Cheng, Feixiong
    Li, Weihua
    Wang, Xichuan
    Zhou, Yadi
    Wu, Zengrui
    Shen, Jie
    Tang, Yun
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (04) : 744 - 752