Comparing Machine Learning Algorithms for Predicting Drug-Induced Liver Injury (DILI)

被引:55
作者
Minerali, Eni [1 ]
Foil, Daniel H. [1 ]
Zorn, Kimberley M. [1 ]
Lane, Thomas R. [1 ]
Ekins, Sean [1 ]
机构
[1] Collaborat Pharmaceut Inc, Raleigh, NC 27606 USA
基金
美国国家卫生研究院;
关键词
Assay Central; bayesian; drug-induced liver injury; machine learning; MegaTox; HEPATOTOXICITY; TOXICOLOGY; ATRIUM(R); WITHDRAWN; AGREEMENT; MODEL; RISK;
D O I
10.1021/acs.molpharmaceut.0c00326
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Drug-induced liver injury (DILI) is one the most unpredictable adverse reactions to xenobiotics in humans and the leading cause of postmarketing withdrawals of approved drugs. To date, these drugs have been collated by the FDA to form the DILIRank database, which classifies DILI severity and potential. These classifications have been used by various research groups in generating computational predictions for this type of liver injury. Recently, groups from Pfizer and AstraZeneca have collated DILI in vitro data and physicochemical properties for compounds that can be used along with data from the FDA to build machine learning models for DILI. In this study, we have used these data sets, as well as the Biopharmaceutics Drug Disposition Classification System data set, to generate Bayesian machine learning models with our inhouse software, Assay Central. The performance of all machine learning models was assessed through both the internal 5-fold cross-validation metrics and prediction accuracy of an external test set of compounds with known hepatotoxicity. The best-performing Bayesian model was based on the DILI-concern category from the DILIRank database with an ROC of 0.814, a sensitivity of 0.741, a specificity of 0.755, and an accuracy of 0.746. A comparison of alternative machine learning algorithms, such as k-nearest neighbors, support vector classification, AdaBoosted decision trees, and deep learning methods, produced similar statistics to those generated with the Bayesian algorithm in Assay Central. This study demonstrates machine learning models grouped in a tool called MegaTox that can be used to predict early-stage clinical compounds, as well as recent FDA-approved drugs, to identify potential DILI.
引用
收藏
页码:2628 / 2637
页数:10
相关论文
共 59 条
  • [1] Predicting Drug-Induced Liver Injury Using Ensemble Learning Methods and Molecular Fingerprints
    Ai, Haixin
    Chen, Wen
    Zhang, Li
    Huang, Liangchao
    Yin, Zimo
    Hu, Huan
    Zhao, Qi
    Zhao, Jian
    Liu, Hongsheng
    [J]. TOXICOLOGICAL SCIENCES, 2018, 165 (01) : 100 - 107
  • [2] Moving beyond Binary Predictions of Human Drug-Induced Liver Injury (DILI) toward Contrasting Relative Risk Potential
    Aleo, Michael D.
    Shah, Falgun
    Allen, Scott
    Barton, Hugh A.
    Costales, Chester
    Lazzaro, Sarah
    Leung, Louis
    Nilson, Andrea
    Obach, R. Scott
    Rodrigues, A. David
    Will, Yvonne
    [J]. CHEMICAL RESEARCH IN TOXICOLOGY, 2020, 33 (01) : 223 - 238
  • [3] [Anonymous], 2022, New Drugs at FDA: CDERs New Molecular Entities and New Therapeutic Biological Products
  • [4] [Anonymous], 2013, PLOS ONE
  • [5] [Anonymous], 2006, ICM 06 P 23 INTL C M
  • [6] BDDCS Applied to Over 900 Drugs
    Benet, Leslie Z.
    Broccatelli, Fabio
    Oprea, Tudor I.
    [J]. AAPS JOURNAL, 2011, 13 (04): : 519 - 547
  • [7] Carletta J, 1996, COMPUT LINGUIST, V22, P249
  • [8] Applicability Domain Analysis (ADAN): A Robust Method for Assessing the Reliability of Drug Property Predictions
    Carrio, Pau
    Pinto, Marta
    Ecker, Gerhard
    Sanz, Ferran
    Pastor, Manuel
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (05) : 1500 - 1511
  • [9] Chan R, 2018, TOXICOL RES-UK, V7, P358, DOI [10.1039/c8tx00016f, 10.1039/C8TX00016F]
  • [10] DILIrank: the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans
    Chen, Minjun
    Suzuki, Ayako
    Thakkar, Shraddha
    Yu, Ke
    Hu, Chuchu
    Tong, Weida
    [J]. DRUG DISCOVERY TODAY, 2016, 21 (04) : 648 - 653