Molecule generation toward target protein (SARS-CoV-2) using reinforcement learning-based graph neural network via knowledge graph

被引:4
作者
Ranjan, Amit [1 ,4 ]
Kumar, Hritik [2 ,4 ]
Kumari, Deepshikha [1 ,4 ]
Anand, Archit [3 ,4 ]
Misra, Rajiv [1 ,4 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Bihta 801106, Bihar, India
[2] Indian Inst Technol Patna, Dept Elect Engn, Bihta 801106, Bihar, India
[3] Indian Inst Technol Patna, Dept Met & Mat Engn, Bihta 801106, Bihar, India
[4] Indian Inst Technol Patna, Bihta 801106, Bihar, India
来源
NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS | 2023年 / 12卷 / 01期
关键词
Binding affinity prediction; Graph neural network; Knowledge graph; Molecule generation; Reinforcement learning; DRUG; PREDICTION;
D O I
10.1007/s13721-023-00409-2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
AI-driven approaches are widely used in drug discovery, where candidate molecules are generated and tested on a target protein for binding affinity prediction. However, generating new compounds with desirable molecular properties such as Quantitative Estimate of Drug-likeness (QED) and Dopamine Receptor D2 activity (DRD2) while adhering to distinct chemical laws is challenging. To address these challenges, we proposed a graph-based deep learning framework to generate potential therapeutic drugs targeting the SARS-CoV-2 protein. Our proposed framework consists of two modules: a novel reinforcement learning (RL)-based graph generative module with knowledge graph (KG) and a graph early fusion approach (GEFA) for binding affinity prediction. The first module uses a gated graph neural network (GGNN) model under the RL environment for generating novel molecular compounds with desired properties and a custom-made KG for molecule screening. The second module uses GEFA to predict binding affinity scores between the generated compounds and target proteins. Experiments show how fine-tuning the GGNN model under the RL environment enhances the molecules with desired properties to generate 100% valid and 100% unique compounds using different scoring functions. Additionally, KG-based screening reduces the search space of generated candidate molecules by 96.64% while retaining 95.38% of promising binding molecules against SARS-CoV-2 protein, i.e., 3C-like protease (3CLpro). We achieved a binding affinity score of 8.185 from the top rank of generated compound. In addition, we compared top-ranked generated compounds to Indinavir on different parameters, including drug-likeness and medicinal chemistry, for qualitative analysis from a drug development perspective.
引用
收藏
页数:13
相关论文
共 35 条
[1]   Exploring the GDB-13 chemical space using deep generative models [J].
Arus-Pous, Josep ;
Blaschke, Thomas ;
Ulander, Silas ;
Reymond, Jean-Louis ;
Chen, Hongming ;
Engkvist, Ola .
JOURNAL OF CHEMINFORMATICS, 2019, 11 (1)
[2]  
Chen Yu Wai, 2020, F1000Res, V9, P129, DOI 10.12688/f1000research.22457.1
[3]   iLOGP: A Simple, Robust, and Efficient Description of n-Octanol/Water Partition Coefficient for Drug Design Using the GB/SA Approach [J].
Daina, Antoine ;
Michielin, Olivier ;
Zoete, Vincent .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (12) :3284-3301
[4]  
Dalke A, 2014, MACCS KEY 44
[5]   Comprehensive analysis of kinase inhibitor selectivity [J].
Davis, Mindy I. ;
Hunt, Jeremy P. ;
Herrgard, Sanna ;
Ciceri, Pietro ;
Wodicka, Lisa M. ;
Pallares, Gabriel ;
Hocker, Michael ;
Treiber, Daniel K. ;
Zarrinkar, Patrick P. .
NATURE BIOTECHNOLOGY, 2011, 29 (11) :1046-U124
[6]   Drug discovery: A historical perspective [J].
Drews, J .
SCIENCE, 2000, 287 (5460) :1960-1964
[7]  
Gilmer J, 2017, PR MACH LEARN RES, V70
[8]  
Guimaraes GL, 2018, Arxiv, DOI arXiv:1705.10843
[9]   Coronavirus puts drug repurposing on the fast track [J].
Harrison, Charlotte .
NATURE BIOTECHNOLOGY, 2020, 38 (04) :379-381
[10]   Drug-target affinity prediction using graph neural network and contact maps [J].
Jiang, Mingjian ;
Li, Zhen ;
Zhang, Shugang ;
Wang, Shuang ;
Wang, Xiaofeng ;
Yuan, Qing ;
Wei, Zhiqiang .
RSC ADVANCES, 2020, 10 (35) :20701-20712