ProfhEX: AI-based platform for small molecules liability profiling

被引:5
作者
Lunghini, Filippo [1 ]
Fava, Anna [1 ]
Pisapia, Vincenzo [2 ]
Sacco, Francesco [2 ]
Iaconis, Daniela [1 ]
Beccari, Andrea Rosario [1 ]
机构
[1] Dompe Farmaceut SpA, EXSCALATE, Via Tommaso Amicis 95, I-80123 Naples, Italy
[2] SAS Inst, Profess Serv Dept, Via Darwin 20-22, I-20143 Milan, Italy
关键词
Virtual screening; Liability profiling; Polypharmacology; Machine learning; Webservice; DRUG; POLYPHARMACOLOGY; PREDICTION; DISCOVERY; CHALLENGES; ANALOGS; DESIGN; TOOL;
D O I
10.1186/s13321-023-00728-6
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug's adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289 ' 202 activity data for a total of 210'116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R-2 determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: .
引用
收藏
页数:17
相关论文
共 76 条
  • [1] Achenbach J, 2011, FUTURE MED CHEM, V3, P961, DOI [10.4155/fmc.11.62, 10.4155/FMC.11.62]
  • [2] Piperidinyl-nicotinamides as potent and selective somatostatin receptor subtype 5 antagonists
    Alker, Andre
    Binggeli, Alfred
    Christ, Andreas D.
    Green, Luke
    Maerki, Hans Peter
    Martin, Rainer E.
    Mohr, Peter
    [J]. BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2010, 20 (15) : 4521 - 4525
  • [3] Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity
    Allen, Chad H. G.
    Mervin, Lewis H.
    Mahmoud, Samar Y.
    Bender, Andreas
    [J]. JOURNAL OF CHEMINFORMATICS, 2019, 11 (1)
  • [4] Polypharmacology: Challenges and Opportunities in Drug Discovery
    Anighoro, Andrew
    Bajorath, Juergen
    Rastelli, Giulio
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (19) : 7874 - 7887
  • [5] [Anonymous], 2007, ENVJMMONO20072 OECD, P2
  • [6] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
  • [7] MolData, a molecular benchmark for disease and target based machine learning
    Arshadi, Arash Keshavarzi
    Salem, Milad
    Firouzbakht, Arash
    Yuan, Jiann Shiun
    [J]. JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [8] An FDA/CDER perspective on nonclinical testing strategies: Classical toxicology approaches and new approach methodologies (NAMs)
    Avila, Amy M.
    Bebenek, Ilona
    Bonzo, Jessica A.
    Bourcier, Todd
    Bruno, Karen L. Davis
    Carlson, David B.
    Dubinion, John
    Elayan, Ikram
    Harrouk, Wafa
    Lee, Shwu-Luan
    Mendrick, Donna L.
    Merrill, Jill C.
    Peretz, Jackye
    Place, Emily
    Saulnier, Muriel
    Wange, Ronald L.
    Yao, Jia
    Zhao, Dong
    Brown, Paul C.
    [J]. REGULATORY TOXICOLOGY AND PHARMACOLOGY, 2020, 114
  • [9] Polypharmacology Browser PPB2: Target Prediction Combining Nearest Neighbors with Machine Learning
    Awale, Mahendra
    Reymond, Jean-Louis
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (01) : 10 - 17
  • [10] Bassan A., 2021, COMPUT TOXICOL, V20, P100188, DOI [10.1016/j.comtox.2021.100188, DOI 10.1016/J.COMTOX.2021.100188]