Molecule generation toward target protein (SARS-CoV-2) using reinforcement learning-based graph neural network via knowledge graph

被引:0
作者
Amit Ranjan
Hritik Kumar
Deepshikha Kumari
Archit Anand
Rajiv Misra
机构
[1] Indian Institute of Technology Patna,Department of Computer Science and Engineering
[2] Indian Institute of Technology Patna,Department of Electrical Engineering
[3] Indian Institute of Technology Patna,Department of Metallurgical and Materials Engineering
[4] Indian Institute of Technology Patna,undefined
来源
Network Modeling Analysis in Health Informatics and Bioinformatics | / 12卷
关键词
Binding affinity prediction; Graph neural network; Knowledge graph; Molecule generation; Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
AI-driven approaches are widely used in drug discovery, where candidate molecules are generated and tested on a target protein for binding affinity prediction. However, generating new compounds with desirable molecular properties such as Quantitative Estimate of Drug-likeness (QED) and Dopamine Receptor D2 activity (DRD2) while adhering to distinct chemical laws is challenging. To address these challenges, we proposed a graph-based deep learning framework to generate potential therapeutic drugs targeting the SARS-CoV-2 protein. Our proposed framework consists of two modules: a novel reinforcement learning (RL)-based graph generative module with knowledge graph (KG) and a graph early fusion approach (GEFA) for binding affinity prediction. The first module uses a gated graph neural network (GGNN) model under the RL environment for generating novel molecular compounds with desired properties and a custom-made KG for molecule screening. The second module uses GEFA to predict binding affinity scores between the generated compounds and target proteins. Experiments show how fine-tuning the GGNN model under the RL environment enhances the molecules with desired properties to generate 100%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$100\%$$\end{document} valid and 100%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$100\%$$\end{document} unique compounds using different scoring functions. Additionally, KG-based screening reduces the search space of generated candidate molecules by 96.64%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$96.64\%$$\end{document} while retaining 95.38%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.38\%$$\end{document} of promising binding molecules against SARS-CoV-2 protein, i.e., 3C-like protease (3CLpro). We achieved a binding affinity score of 8.185 from the top rank of generated compound. In addition, we compared top-ranked generated compounds to Indinavir on different parameters, including drug-likeness and medicinal chemistry, for qualitative analysis from a drug development perspective.
引用
收藏
相关论文
共 105 条
[1]  
Arús-Pous J(2019)Exploring the gdb-13 chemical space using deep generative models J Cheminf 11 1-14
[2]  
Blaschke T(2014)ilogp: a simple, robust, and efficient description of n-octanol/water partition coefficient for drug design using the gb/sa approach J Chem Inf Model 54 3284-3301
[3]  
Ulander S(2011)Comprehensive analysis of kinase inhibitor selectivity Nat Biotechnol 29 1046-1051
[4]  
Reymond JL(2000)Drug discovery: a historical perspective Science 287 1960-1964
[5]  
Chen H(2020)Coronavirus puts drug repurposing on the fast track Nat Biotechnol 38 379-381
[6]  
Engkvist O(2020)Drug-target affinity prediction using graph neural network and contact maps RSC Adv 10 20701-20712
[7]  
Daina A(2018)Multi-objective de novo drug design with conditional graph generative model J Cheminf 10 1-24
[8]  
Michielin O(2020)Drug treatment options for the 2019-new coronavirus (2019-ncov) Biosci Trends 14 69-71
[9]  
Zoete V(2018)Overview of the detection methods for equilibrium dissociation constant kd of drug-receptor interaction J Pharmac Anal 8 147-152
[10]  
Davis MI(2021)Graph networks for molecular design Mach Learn: Sci Technol 2 025023-533