Generative chemistry: drug discovery with deep learning generative models

被引:92
作者
Bian, Yuemin [1 ,2 ,3 ]
Xie, Xiang-Qun [1 ,2 ,3 ,4 ,5 ,6 ]
机构
[1] Univ Pittsburgh, Dept Pharmaceut Sci, Sch Pharm, Pittsburgh, PA 15261 USA
[2] Univ Pittsburgh, Computat Chem Genom Screening Ctr, Sch Pharm, Pittsburgh, PA 15261 USA
[3] Univ Pittsburgh, NIH, Natl Ctr Excellence Computat Drug Abuse Res, Pittsburgh, PA 15261 USA
[4] Univ Pittsburgh, Drug Discovery Inst, 335 Sutherland Dr,206 Salk Pavilion, Pittsburgh, PA 15261 USA
[5] Univ Pittsburgh, Sch Med, Dept Computat Biol, Pittsburgh, PA 15261 USA
[6] Univ Pittsburgh, Sch Med, Dept Struct Biol, Pittsburgh, PA 15261 USA
关键词
Drug discovery; Deep learning; Generative model; Recurrent neural network; Variational autoencoder; Adversarial autoencoder; Generative adversarial network; MOLECULAR DESIGN; DATABASE; SYSTEMS; REPRESENTATION; DESCRIPTORS; COMPLEXITY; CHALLENGES; LANGUAGE; SMILES;
D O I
10.1007/s00894-021-04674-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The de novo design of molecular structures using deep learning generative models introduces an encouraging solution to drug discovery in the face of the continuously increased cost of new drug development. From the generation of original texts, images, and videos, to the scratching of novel molecular structures the creativity of deep learning generative models exhibits the height machine intelligence can achieve. The purpose of this paper is to review the latest advances in generative chemistry which relies on generative modeling to expedite the drug discovery process. This review starts with a brief history of artificial intelligence in drug discovery to outline this emerging paradigm. Commonly used chemical databases, molecular representations, and tools in cheminformatics and machine learning are covered as the infrastructure for generative chemistry. The detailed discussions on utilizing cutting-edge generative architectures, including recurrent neural network, variational autoencoder, adversarial autoencoder, and generative adversarial network for compound generation are focused. Challenges and future perspectives follow.
引用
收藏
页数:18
相关论文
共 147 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Al-Rfou R, 2016, ARXIV1605
[3]   Unified rational protein engineering with sequence-based deep representation learning [J].
Alley, Ethan C. ;
Khimulya, Grigory ;
Biswas, Surojit ;
AlQuraishi, Mohammed ;
Church, George M. .
NATURE METHODS, 2019, 16 (12) :1315-+
[4]  
Ambure P., 2014, Methods in Pharmacology and Toxicology
[5]  
Methods in pharmacology and toxicology, P257, DOI [10.1007/7653_2014_35, DOI 10.1007/7653_2014_35]
[6]  
[Anonymous], 2016, P ICLR
[7]  
[Anonymous], 2014, EMPIRICAL EVALUATION
[8]  
Arabie P., 2006, Studies in classification, data analysis, and knowledge organization
[9]  
Aspuru-Guzik A, 2017, ARXIV PREPRINT ARXIV
[10]  
Bacilieri Magdalena, 2006, Curr Drug Discov Technol, V3, P155, DOI 10.2174/157016306780136781