MEMES: Machine learning framework for Enhanced MolEcular Screening

被引:35
作者
Mehta, Sarvesh [1 ]
Laghuvarapu, Siddhartha [1 ]
Pathak, Yashaswi [1 ]
Sethi, Aaftaab [2 ]
Alvala, Mallika [3 ]
Priyakumar, U. Deva [1 ]
机构
[1] Int Inst Informat Technol, Ctr Computat Nat Sci & Bioinformat, Hyderabad 500032, India
[2] Natl Inst Pharmaceut Educ & Res, Dept Med Chem, Hyderabad 500037, Andhra Pradesh, India
[3] Narsee Monjee Inst Management Sci, Sch Pharm & Technol Management, Hyderabad, India
关键词
DRUG DISCOVERY; DESIGN; OPTIMIZATION;
D O I
10.1039/d1sc02783b
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In drug discovery applications, high throughput virtual screening exercises are routinely performed to determine an initial set of candidate molecules referred to as "hits". In such an experiment, each molecule from a large small-molecule drug library is evaluated in terms of physical properties such as the docking score against a target receptor. In real-life drug discovery experiments, drug libraries are extremely large but still there is only a minor representation of the essentially infinite chemical space, and evaluation of physical properties for each molecule in the library is not computationally feasible. In the current study, a novel Machine learning framework for Enhanced MolEcular Screening (MEMES) based on Bayesian optimization is proposed for efficient sampling of the chemical space. The proposed framework is demonstrated to identify 90% of the top-1000 molecules from a molecular library of size about 100 million, while calculating the docking score only for about 6% of the complete library. We believe that such a framework would tremendously help to reduce the computational effort in not only drug-discovery but also areas that require such high-throughput experiments.
引用
收藏
页码:11710 / 11721
页数:12
相关论文
共 59 条
[1]   DeepPocket: Ligand Binding Site Detection and Segmentation using 3D Convolutional Neural Networks [J].
Aggarwal, Rishal ;
Gupta, Akash ;
Chelur, Vineeth ;
Jawahar, C., V ;
Priyakumar, U. Deva .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (21) :5069-5079
[2]  
[Anonymous], 2012, Nucleic Acids Res, DOI DOI 10.1093/NAR/GKR777
[3]  
Bagal V., 2021, LIGGPT MOL GENERATIO, DOI [10.26434/chemrxiv.14561901.v1, DOI 10.26434/CHEMRXIV.14561901.V1]
[4]   Application of Generative Autoencoder in De Novo Molecular Design [J].
Blaschke, Thomas ;
Olivecrona, Marcus ;
Engkvist, Ola ;
Bajorath, Jurgen ;
Chen, Hongming .
MOLECULAR INFORMATICS, 2018, 37 (1-2)
[5]   970 Million Druglike Small Molecules for Virtual Screening in the Chemical Universe Database GDB-13 [J].
Blum, Lorenz C. ;
Reymond, Jean-Louis .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2009, 131 (25) :8732-+
[6]  
BUI T, 2016, INT C MACH LEARN, P1472
[7]   Structure-Based Virtual Screening for Drug Discovery: a Problem-Centric Review [J].
Cheng, Tiejun ;
Li, Qingliang ;
Zhou, Zhigang ;
Wang, Yanli ;
Bryant, Stephen H. .
AAPS JOURNAL, 2012, 14 (01) :133-141
[8]  
Cieplinski T., 2020, ARXIV PREPRINT ARXIV
[9]  
Dai H., 2018, P INT C LEARN REPR
[10]  
Dai Z., 2015, Comput. Sci.