Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning

被引：33

作者：

Ricci-Lopez, Joel ^{[1
,2
]}

Aguila, Sergio A. ^{[2
]}

Gilson, Michael K. ^{[3
]}

Brizuela, Carlos A. ^{[1
]}

机构：

[1] Ctr Invest Cient & Educ Super Ensenada CICESE, Ensenada 22860, Baja California, Mexico

[2] Univ Nacl Autonoma Mexico, Ctr Nanociencias & Nanotecnol, Ensenada 22860, Baja California, Mexico

[3] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA

来源：

JOURNAL OF CHEMICAL INFORMATION AND MODELING | 2021年 / 61卷 / 11期

关键词：

PROTEIN FLEXIBILITY; RECEPTOR FLEXIBILITY; MOLECULAR DOCKING; SCORING FUNCTIONS; BINDING-AFFINITY; DRUG-DISCOVERY; LIGAND DOCKING; CONFORMATIONS; COMBINATION; IMPROVEMENT;

D O I：

10.1021/acs.jcim.1c00511

中图分类号：

R914 [药物化学];

学科分类号：

100701 ;

摘要：

One of the main challenges of structure-based virtual screening (SBVS) is the incorporation of the receptor's flexibility, as its explicit representation in every docking run implies a high computational cost. Therefore, a common alternative to include the receptor's flexibility is the approach known as ensemble docking. Ensemble docking consists of using a set of receptor conformations and performing the docking assays over each of them. However, there is still no agreement on how to combine the ensemble docking results to obtain the final ligand ranking. A common choice is to use consensus strategies to aggregate the ensemble docking scores, but these strategies exhibit slight improvement regarding the single-structure approach. Here, we claim that using machine learning (ML) methodologies over the ensemble docking results could improve the predictive power of SBVS. To test this hypothesis, four proteins were selected as study cases: CDK2, FXa, EGFR, and HSP90. Protein conformational ensembles were built from crystallographic structures, whereas the evaluated compound library comprised up to three benchmarking data sets (DUD, DEKOIS 2.0, and CSAR-2012) and cocrystallized molecules. Ensemble docking results were processed through 30 repetitions of 4-fold cross-validation to train and validate two ML classifiers: logistic regression and gradient boosting trees. Our results indicate that the ML classifiers significantly outperform traditional consensus strategies and even the best performance case achieved with single-structure docking. We provide statistical evidence that supports the effectiveness of ML to improve the ensemble docking performance.

引用

页码：5362 / 5376

页数：15

共 118 条

[1] Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening
Ain, Qurrat Ul
Aleksandrova, Antoniya
Roessler, Florian D.
Ballester, Pedro J.
[J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2015, 5 (06) : 405 - 424
[2] ENRI: A tool for selecting structure-based virtual screening target conformations
Akbar, Rahmad
Jusoh, Siti Azma
Amaro, Rommie E.
Helms, Volkhard
[J]. CHEMICAL BIOLOGY & DRUG DESIGN, 2017, 89 (05) : 762 - 771
[3] Discovery of drug-like inhibitors of an essential RNA-editing ligase in Trypanosoma brucei
Amaro, Rommie E.
Schnaufer, Achim
Interthal, Heidrun
Hol, Wim
Stuart, Kenneth D.
McCammon, J. Andrew
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (45) : 17278 - 17283
[4] An improved relaxed complex scheme for receptor flexibility in computer-aided drug design
Amaro, Rommie E.
Baron, Riccardo
McCammon, J. Andrew
[J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2008, 22 (09) : 693 - 705
[5] Ensemble Docking in Drug Discovery
Amaro, Rommie E.
Baudry, Jerome
Chodera, John
Demir, Ozlem
McCammon, J. Andrew
Miao, Yinglong
Smith, Jeremy C.
[J]. BIOPHYSICAL JOURNAL, 2018, 114 (10) : 2271 - 2278
[6] [Anonymous], 2013, ROCKEFELLER U
[7] [Anonymous], 2010, DUD DATASET PARTIAL
[8] [Anonymous], 2015, DEKOIS 2 0 DATASET
[9] [Anonymous], 2013, CSAR 2012 DATASET
[10] Improvement of Virtual Screening Results by Docking Data Feature Analysis
Arciniega, Marcelino
Lange, Oliver F.
[J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (05) : 1401 - 1411

← 1 2 3 4 5 6 7 8 9 10 →