Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning

被引:33
|
作者
Ricci-Lopez, Joel [1 ,2 ]
Aguila, Sergio A. [2 ]
Gilson, Michael K. [3 ]
Brizuela, Carlos A. [1 ]
机构
[1] Ctr Invest Cient & Educ Super Ensenada CICESE, Ensenada 22860, Baja California, Mexico
[2] Univ Nacl Autonoma Mexico, Ctr Nanociencias & Nanotecnol, Ensenada 22860, Baja California, Mexico
[3] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
关键词
PROTEIN FLEXIBILITY; RECEPTOR FLEXIBILITY; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; DRUG-DISCOVERY; LIGAND DOCKING; CONFORMATIONS; COMBINATION; IMPROVEMENT;
D O I
10.1021/acs.jcim.1c00511
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
One of the main challenges of structure-based virtual screening (SBVS) is the incorporation of the receptor's flexibility, as its explicit representation in every docking run implies a high computational cost. Therefore, a common alternative to include the receptor's flexibility is the approach known as ensemble docking. Ensemble docking consists of using a set of receptor conformations and performing the docking assays over each of them. However, there is still no agreement on how to combine the ensemble docking results to obtain the final ligand ranking. A common choice is to use consensus strategies to aggregate the ensemble docking scores, but these strategies exhibit slight improvement regarding the single-structure approach. Here, we claim that using machine learning (ML) methodologies over the ensemble docking results could improve the predictive power of SBVS. To test this hypothesis, four proteins were selected as study cases: CDK2, FXa, EGFR, and HSP90. Protein conformational ensembles were built from crystallographic structures, whereas the evaluated compound library comprised up to three benchmarking data sets (DUD, DEKOIS 2.0, and CSAR-2012) and cocrystallized molecules. Ensemble docking results were processed through 30 repetitions of 4-fold cross-validation to train and validate two ML classifiers: logistic regression and gradient boosting trees. Our results indicate that the ML classifiers significantly outperform traditional consensus strategies and even the best performance case achieved with single-structure docking. We provide statistical evidence that supports the effectiveness of ML to improve the ensemble docking performance.
引用
收藏
页码:5362 / 5376
页数:15
相关论文
共 50 条
  • [21] Improving ensemble docking for drug discovery by machine learning
    Wong, Chung F.
    JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY, 2019, 18 (03):
  • [22] PyPLIF HIPPOS and Receptor Ensemble Docking Increase the Prediction Accuracy of the Structure-Based Virtual Screening Protocol Targeting Acetylcholinesterase
    Istyastono, Enade P.
    Riswanto, Florentinus Dika Octa
    Yuniarti, Nunung
    Prasasty, Vivitri D.
    Mungkasi, Sudi
    MOLECULES, 2022, 27 (17):
  • [23] Improving structure-based virtual screening performance via learning from scoring function components
    Xiong, Guo-Li
    Ye, Wen-Ling
    Shen, Chao
    Lu, Ai-Ping
    Hou, Ting-Jun
    Cao, Dong-Sheng
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [24] Assessment of the Generalization Abilities of Machine-Learning Scoring Functions for Structure-Based Virtual Screening
    Zhu, Hui
    Yang, Jincai
    Huang, Niu
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (22) : 5485 - 5502
  • [25] Comprehensive machine learning boosts structure-based virtual screening for PARP1 inhibitors
    Caba, Klaudia
    Tran-Nguyen, Viet-Khoa
    Rahman, Taufiq
    Ballester, Pedro J.
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01)
  • [26] In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual Screening
    Sieg, Jochen
    Flachsenberg, Florian
    Rarey, Matthias
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) : 947 - 961
  • [27] Comprehensive machine learning boosts structure-based virtual screening for PARP1 inhibitors
    Klaudia Caba
    Viet-Khoa Tran-Nguyen
    Taufiq Rahman
    Pedro J. Ballester
    Journal of Cheminformatics, 16
  • [28] Machine Learning Consensus Scoring Improves Performance Across Targets in Structure-Based Virtual Screening
    Ericksen, Spencer S.
    Wu, Haozhen
    Zhang, Huikun
    Michael, Lauren A.
    Newton, Michael A.
    Hoffmann, F. Michael
    Wildman, Scott A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (07) : 1579 - 1590
  • [29] Machine learning consensus scoring improves performance across targets in structure-based virtual screening
    Ericksen, Spencer
    Wu, Haozhen
    Zhang, Huikun
    Michael, Lauren
    Newton, Michael
    Hoffmann, F.
    Wildman, Scott
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [30] Structure-based virtual screening and molecular docking for the identification of potential biofilm inhibitors
    Praveen, Kumar
    Archana, Mohanan G.
    RESEARCH JOURNAL OF BIOTECHNOLOGY, 2020, 15 (04): : 107 - 115