Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning

被引:33
|
作者
Ricci-Lopez, Joel [1 ,2 ]
Aguila, Sergio A. [2 ]
Gilson, Michael K. [3 ]
Brizuela, Carlos A. [1 ]
机构
[1] Ctr Invest Cient & Educ Super Ensenada CICESE, Ensenada 22860, Baja California, Mexico
[2] Univ Nacl Autonoma Mexico, Ctr Nanociencias & Nanotecnol, Ensenada 22860, Baja California, Mexico
[3] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
关键词
PROTEIN FLEXIBILITY; RECEPTOR FLEXIBILITY; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; DRUG-DISCOVERY; LIGAND DOCKING; CONFORMATIONS; COMBINATION; IMPROVEMENT;
D O I
10.1021/acs.jcim.1c00511
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
One of the main challenges of structure-based virtual screening (SBVS) is the incorporation of the receptor's flexibility, as its explicit representation in every docking run implies a high computational cost. Therefore, a common alternative to include the receptor's flexibility is the approach known as ensemble docking. Ensemble docking consists of using a set of receptor conformations and performing the docking assays over each of them. However, there is still no agreement on how to combine the ensemble docking results to obtain the final ligand ranking. A common choice is to use consensus strategies to aggregate the ensemble docking scores, but these strategies exhibit slight improvement regarding the single-structure approach. Here, we claim that using machine learning (ML) methodologies over the ensemble docking results could improve the predictive power of SBVS. To test this hypothesis, four proteins were selected as study cases: CDK2, FXa, EGFR, and HSP90. Protein conformational ensembles were built from crystallographic structures, whereas the evaluated compound library comprised up to three benchmarking data sets (DUD, DEKOIS 2.0, and CSAR-2012) and cocrystallized molecules. Ensemble docking results were processed through 30 repetitions of 4-fold cross-validation to train and validate two ML classifiers: logistic regression and gradient boosting trees. Our results indicate that the ML classifiers significantly outperform traditional consensus strategies and even the best performance case achieved with single-structure docking. We provide statistical evidence that supports the effectiveness of ML to improve the ensemble docking performance.
引用
收藏
页码:5362 / 5376
页数:15
相关论文
共 50 条
  • [31] Outstanding challenges in protein-ligand docking and structure-based virtual screening
    Waszkowycz, Bohdan
    Clark, David E.
    Gancia, Emanuela
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2011, 1 (02) : 229 - 259
  • [32] Structure-based virtual screening of perfluoroalkyl and polyfluoroalkyl substances (PFASs) as endocrine disruptors of androgen receptor activity using molecular docking and machine learning
    Singam, Ettayapuram Ramaprasad Azhagiya
    Tachachartvanich, Phum
    Fourches, Denis
    Soshilov, Anatoly
    Hsieh, Jennifer C. Y.
    La Merrill, Michele A.
    Smith, Martyn T.
    Durkin, Kathleen A.
    ENVIRONMENTAL RESEARCH, 2020, 190
  • [33] Improving structure-based virtual screening by multivariate analysis of scoring data
    Jacobsson, M
    Lidén, P
    Stjernschantz, E
    Boström, H
    Norinder, U
    JOURNAL OF MEDICINAL CHEMISTRY, 2003, 46 (26) : 5781 - 5789
  • [34] Structure-Based Pharmacophores for Virtual Screening
    Loewer, Martin
    Proschak, Ewgenij
    MOLECULAR INFORMATICS, 2011, 30 (05) : 398 - 404
  • [35] Structure-based virtual screening: an overview
    Lyne, PD
    DRUG DISCOVERY TODAY, 2002, 7 (20) : 1047 - 1055
  • [36] Structure-based virtual ligand screening
    Villoutreix, Bruno O.
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2006, 7 (05) : 367 - 367
  • [37] Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening
    Ain, Qurrat Ul
    Aleksandrova, Antoniya
    Roessler, Florian D.
    Ballester, Pedro J.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2015, 5 (06) : 405 - 424
  • [38] A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening
    Scantlebury, Jack
    Vost, Lucy
    Carbery, Anna
    Hadfield, Thomas E.
    Turnbull, Oliver M.
    Brown, Nathan
    Chenthamarakshan, Vijil
    Das, Payel
    Grosjean, Harold
    von Delft, Frank
    Deane, Charlotte M.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (10) : 2960 - 2974
  • [39] Structure-Based Drug Screening and Ligand-Based Drug Screening with Machine Learning
    Fukunishi, Yoshifumi
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2009, 12 (04) : 397 - 408
  • [40] Docking Score ML: Target-Specific Machine Learning Models Improving Docking-Based Virtual Screening in 155 Targets
    Liu, Haihan
    Hu, Baichun
    Chen, Peiying
    Wang, Xiao
    Wang, Hanxun
    Wang, Shizun
    Wang, Jian
    Lin, Bin
    Cheng, Maosheng
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (14) : 5413 - 5426