Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking

被引:167
|
作者
Gentile, Francesco [1 ]
Yaacoub, Jean Charle [1 ]
Gleave, James [1 ]
Fernandez, Michael [1 ]
Ton, Anh-Tien [1 ]
Ban, Fuqiang [1 ]
Stern, Abraham [2 ]
Cherkasov, Artem [1 ]
机构
[1] Univ British Columbia, Vancouver Prostate Ctr, Dept Urol Sci, Vancouver, BC, Canada
[2] NVIDIA Corp, Santa Clara, CA USA
关键词
DISCOVERY; GENERATION; VALIDATION; ALGORITHM; ACCURATE; BINDING; ICM;
D O I
10.1038/s41596-021-00659-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the recent explosion of chemical libraries beyond a billion molecules, more efficient virtual screening approaches are needed. The Deep Docking (DD) platform enables up to 100-fold acceleration of structure-based virtual screening by docking only a subset of a chemical library, iteratively synchronized with a ligand-based prediction of the remaining docking scores. This method results in hundreds- to thousands-fold virtual hit enrichment (without significant loss of potential drug candidates) and hence enables the screening of billion molecule-sized chemical libraries without using extraordinary computational resources. Herein, we present and discuss the generalized DD protocol that has been proven successful in various computer-aided drug discovery (CADD) campaigns and can be applied in conjunction with any conventional docking program. The protocol encompasses eight consecutive stages: molecular library preparation, receptor preparation, random sampling of a library, ligand preparation, molecular docking, model training, model inference and the residual docking. The standard DD workflow enables iterative application of stages 3-7 with continuous augmentation of the training set, and the number of such iterations can be adjusted by the user. A predefined recall value allows for control of the percentage of top-scoring molecules that are retained by DD and can be adjusted to control the library size reduction. The procedure takes 1-2 weeks (depending on the available resources) and can be completely automated on computing clusters managed by job schedulers. This open-source protocol, at https://github.com/jamesgleavu/DD _ protocol, can be readily deployed by CADD researchers and can significantly accelerate the effective exploration of ultra-large portions of a chemical space.
引用
收藏
页码:672 / +
页数:28
相关论文
共 9 条
  • [1] Ultra-large chemical libraries for the discovery of high-affinity peptide binders
    Quartararo, Anthony J.
    Gates, Zachary P.
    Somsen, Bente A.
    Hartrampf, Nina
    Ye, Xiyun
    Shimada, Arisa
    Kajihara, Yasuhiro
    Ottmann, Christian
    Pentelute, Bradley L.
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [2] Ultra-Large Virtual Screening: Definition, Recent Advances, and Challenges in Drug Design
    Verissimo, Gabriel Correa
    Ferreira, Rafaela Salgado
    Maltarollo, Vinicius Goncalves
    MOLECULAR INFORMATICS, 2025, 44 (01)
  • [3] VirtualFlow Ants-Ultra-Large Virtual Screenings with Artificial Intelligence Driven Docking Algorithm Based on Ant Colony Optimization
    Gorgulla, Christoph
    Cinaroglu, Suleyman Selim
    Fischer, Patrick D.
    Fackeldey, Konstantin
    Wagner, Gerhard
    Arthanari, Haribabu
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (11)
  • [4] Artificial intelligence-enabled screening for diabetic retinopathy: a real-world, multicenter and prospective study
    Zhang, Yifei
    Shi, Juan
    Peng, Ying
    Zhao, Zhiyun
    Zheng, Qidong
    Wang, Zilong
    Liu, Kun
    Jiao, Shengyin
    Qiu, Kexin
    Zhou, Ziheng
    Yan, Li
    Zhao, Dong
    Jiang, Hongwei
    Dai, Yuancheng
    Su, Benli
    Gu, Pei
    Su, Heng
    Wan, Qin
    Peng, Yongde
    Liu, Jianjun
    Hu, Ling
    Ke, Tingyu
    Chen, Lei
    Xu, Fengmei
    Dong, Qijuan
    Terzopoulos, Demetri
    Ning, Guang
    Xu, Xun
    Ding, Xiaowei
    Wang, Weiqing
    BMJ OPEN DIABETES RESEARCH & CARE, 2020, 8 (01)
  • [5] Structure-Based Virtual Screening of Ultra-Large Library Yields Potent Antagonists for a Lipid GPCR
    Sadybekov, Arman A.
    Brouillette, Rebecca L.
    Marin, Egor
    Sadybekov, Anastasiia V.
    Luginina, Aleksandra
    Gusach, Anastasiia
    Mishin, Alexey
    Besserer-Offroy, Elie
    Longpre, Jean-Michel
    Borshchevskiy, Valentin
    Cherezov, Vadim
    Sarret, Philippe
    Katritch, Vsevolod
    BIOMOLECULES, 2020, 10 (12) : 1 - 15
  • [6] Novel Inhibitors of androgen receptor's DNA binding domain identified using an ultra-large virtual screening
    Radaeva, Mariia
    Morin, Helene
    Pandey, Mohit
    Ban, Fuqiang
    Guo, Maria
    LeBlanc, Eric
    Lallous, Nada
    Cherkasov, Artem
    MOLECULAR INFORMATICS, 2023, 42 (8-9)
  • [7] PharmacoNet: deep learning-guided pharmacophore modeling for ultra-large-scale virtual screening
    Seo, Seonghwan
    Kim, Woo Youn
    CHEMICAL SCIENCE, 2024, 15 (46) : 19473 - 19487
  • [8] Development of Ligand-based Big Data Deep Neural Network Models for Virtual Screening of Large Compound Libraries
    Xiao, Tao
    Qi, Xingxing
    Chen, Yuzong
    Jiang, Yuyang
    MOLECULAR INFORMATICS, 2018, 37 (11)
  • [9] Machine Learning-Boosted Docking Enables the Efficient Structure-Based Virtual Screening of Giga-Scale Enumerated Chemical Libraries
    Sivula, Toni
    Yetukuri, Laxman
    Kalliokoski, Tuomo
    Kasnanen, Heikki
    Poso, Antti
    Pohner, Ina
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (18) : 5773 - 5783