MTMol-GPT: De novo multi-target molecular generation with transformer-based generative adversarial imitation learning

被引:29
作者
Ai, Chengwei [1 ]
Yang, Hongpeng [2 ]
Liu, Xiaoyi [2 ]
Dong, Ruihan [3 ]
Ding, Yijie [4 ]
Guo, Fei [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC USA
[3] Peking Univ, Acad Adv Interdisciplinary Studies, Chinese Acad Sci, Beijing, Peoples R China
[4] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
AGONISTS; DESIGN;
D O I
10.1371/journal.pcbi.1012229
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
De novo drug design is crucial in advancing drug discovery, which aims to generate new drugs with specific pharmacological properties. Recently, deep generative models have achieved inspiring progress in generating drug-like compounds. However, the models prioritize a single target drug generation for pharmacological intervention, neglecting the complicated inherent mechanisms of diseases, and influenced by multiple factors. Consequently, developing novel multi-target drugs that simultaneously target specific targets can enhance anti-tumor efficacy and address issues related to resistance mechanisms. To address this issue and inspired by Generative Pre-trained Transformers (GPT) models, we propose an upgraded GPT model with generative adversarial imitation learning for multi-target molecular generation called MTMol-GPT. The multi-target molecular generator employs a dual discriminator model using the Inverse Reinforcement Learning (IRL) method for a concurrently multi-target molecular generation. Extensive results show that MTMol-GPT generates various valid, novel, and effective multi-target molecules for various complex diseases, demonstrating robustness and generalization capability. In addition, molecular docking and pharmacophore mapping experiments demonstrate the drug-likeness properties and effectiveness of generated molecules potentially improve neuropsychiatric interventions. Furthermore, our model's generalizability is exemplified by a case study focusing on the multi-targeted drug design for breast cancer. As a broadly applicable solution for multiple targets, MTMol-GPT provides new insight into future directions to enhance potential complex disease therapeutics by generating high-quality multi-target molecules in drug discovery. Advancing drug discovery with the vast scale of possible structures of drug-like compounds (between 1023 and 1060) relies on Artificial Intelligence (AI). The process of generating SMILES has been accomplished by using language models like the Generative Pre-trained Transformer (GPT) model. In the meantime, regulating multiple targets to achieve the desired physiological responses is booming for treating complicated diseases. However, the recent deep generative models focus on single-target drug generation, and single-target drugs have limitations, such as requiring higher doses and causing mutations. For example, activation of Src kinase in certain breast cancer cells leads to EGFR phosphorylation and downstream effects. Consequently, designing multi-target molecules against multi-target simultaneously is highly required. Thus, we have proposed an upgraded GPT model MTMol-GPT with generative adversarial imitation learning for generating high-quality multiple targets drug-like compounds. For example, MTMol-GPT generates novel drugs targeting protein pairs such as DRD2 and HTR1A, EGFR and Src. As a broadly applicable tool, MTMol-GPT provides a rapid and accurate method for advancing drug discovery, ensuring the efficient generation of high-quality multi-target drugs.
引用
收藏
页数:23
相关论文
共 65 条
[1]   PLIP 2021: expanding the scope of the protein-ligand interaction profiler to DNA and RNA [J].
Adasme, Melissa F. ;
Linnemann, Katja L. ;
Bolz, Sarah Naomi ;
Kaiser, Florian ;
Salentin, Sebastian ;
Haupt, V. Joachim ;
Schroeder, Michael .
NUCLEIC ACIDS RESEARCH, 2021, 49 (W1) :W530-W534
[2]   Deep inverse reinforcement learning for structural evolution of small molecules [J].
Agyemang, Brighter ;
Wu, Wei-Ping ;
Addo, Daniel ;
Kpiebaareh, Michael Y. ;
Nanor, Ebenezer ;
Haruna, Charles Roland .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
[3]  
[Anonymous], 2006, RDKit: Open -source cheminformatics'
[4]   Exploring the GDB-13 chemical space using deep generative models [J].
Arus-Pous, Josep ;
Blaschke, Thomas ;
Ulander, Silas ;
Reymond, Jean-Louis ;
Chen, Hongming ;
Engkvist, Ola .
JOURNAL OF CHEMINFORMATICS, 2019, 11 (1)
[5]   Structure-based Virtual Screening Approaches in Kinase-directed Drug Discovery [J].
Bajusz, David ;
Ferenczy, Gyorgy G. ;
Keseru, Gyorgy M. .
CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2017, 17 (20) :2235-2259
[6]   The properties of known drugs .1. Molecular frameworks [J].
Bemis, GW ;
Murcko, MA .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) :2887-2893
[7]  
Bengio E, 2021, ADV NEUR IN, V34
[8]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[9]  
Chen Jie, 2022, IEEE Transactions on Cybernetics
[10]  
Chen XC, 2021, Arxiv, DOI arXiv:2109.03540