Integrating QSAR modelling and deep learning in drug discovery: the emergence of deep QSAR

被引:127
作者
Tropsha, Alexander [1 ]
Isayev, Olexandr [2 ]
Varnek, Alexandre [3 ]
Schneider, Gisbert [4 ]
Cherkasov, Artem [5 ,6 ]
机构
[1] Univ North Carolina, Chapel Hill, NC 27599 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] Univ Strasbourg, Strasbourg, France
[4] ETH, Zurich, Switzerland
[5] Univ British Columbia, Vancouver, BC, Canada
[6] Photonic Inc, Coquitlam, BC, Canada
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
ARTIFICIAL-INTELLIGENCE; CHEMICAL-REACTIONS; MOLECULAR DESIGN; NEURAL-NETWORKS; PREDICTION; CHEMOGRAPHY; DOCKING; REPRESENTATION; GENERATION; LIBRARIES;
D O I
10.1038/s41573-023-00832-0
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Quantitative structure-activity relationship (QSAR) modelling, an approach that was introduced 60 years ago, is widely used in computer-aided drug design. In recent years, progress in artificial intelligence techniques, such as deep learning, the rapid growth of databases of molecules for virtual screening and dramatic improvements in computational power have supported the emergence of a new field of QSAR applications that we term 'deep QSAR'. Marking a decade from the pioneering applications of deep QSAR to tasks involved in small-molecule drug discovery, we herein describe key advances in the field, including deep generative and reinforcement learning approaches in molecular design, deep learning models for synthetic planning and the application of deep QSAR models in structure-based virtual screening. We also reflect on the emergence of quantum computing, which promises to further accelerate deep QSAR applications and the need for open-source and democratized resources to support computer-aided drug design. Advances with deep learning, the growth of databases of molecules for virtual screening and improvements in computational power have supported the emergence of a new field of quantitative structure-activity relationship (QSAR) modelling applications that Tropsha et al. term 'deep QSAR'. This article discusses key advances in the field, including deep generative and reinforcement learning approaches in molecular design, deep learning models for synthetic planning, and the use of deep QSAR models in structure-based virtual screening.
引用
收藏
页码:141 / 155
页数:15
相关论文
共 163 条
[1]   The rise of self-driving labs in chemical and materials sciences [J].
Abolhasani, Milad ;
Kumacheva, Eugenia .
NATURE SYNTHESIS, 2023, 2 (06) :483-492
[2]   Prediction of Optimal Conditions of Hydrogenation Reaction Using the Likelihood Ranking Approach [J].
Afonina, Valentina A. ;
Mazitov, Daniyar A. ;
Nurmukhametova, Albina ;
Shevelev, Maxim D. ;
Khasanova, Dina A. ;
Nugmanov, Ramil I. ;
Burilov, Vladimir A. ;
Madzhidov, Timur I. ;
Varnek, Alexandre .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (01)
[3]   Efficient iterative virtual screening with Apache Spark and conformal prediction [J].
Ahmed, Laeeq ;
Georgiev, Valentin ;
Capuccini, Marco ;
Toor, Salman ;
Schaal, Wesley ;
Laure, Erwin ;
Spjuth, Ola .
JOURNAL OF CHEMINFORMATICS, 2018, 10
[4]   Roughness of Molecular Property Landscapes and Its Impact on Modellability [J].
Aldeghi, Matteo ;
Graff, David E. ;
Frey, Nathan ;
Morrone, Joseph A. ;
Pyzer-Knapp, Edward O. ;
Jordan, Kirk E. ;
Coley, Connor W. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (19) :4660-4671
[5]   Curated Data In - Trustworthy In Silico Models Out: The Impact of Data Quality on the Reliability of Artificial Intelligence Models as Alternatives to Animal Testing [J].
Alves, Vinicius M. ;
Auerbach, Scott S. ;
Kleinstreuer, Nicole ;
Rooney, John P. ;
Muratov, Eugene N. ;
Rusyn, Ivan ;
Tropsha, Alexander ;
Schmitt, Charles .
ATLA-ALTERNATIVES TO LABORATORY ANIMALS, 2021, 49 (03) :73-82
[6]   Rapid Identification of Potential Inhibitors of SARS-CoV-2 Main Protease by Deep Docking of 1.3 Billion Compounds [J].
Anh-Tien Ton ;
Gentile, Francesco ;
Hsing, Michael ;
Ban, Fuqiang ;
Cherkasov, Artem .
MOLECULAR INFORMATICS, 2020, 39 (08)
[7]   Geometric deep learning on molecular representations [J].
Atz, Kenneth ;
Grisoni, Francesca ;
Schneider, Gisbert .
NATURE MACHINE INTELLIGENCE, 2021, 3 (12) :1023-1032
[8]  
Babuji Y., 2020, PREPRINT
[9]   Best Practices of Computer-Aided Drug Discovery: Lessons Learned from the Development of a Preclinical Candidate for Prostate Cancer with a New Mechanism of Action [J].
Ban, Fuqiang ;
Dalal, Kush ;
Li, Huifang ;
LeBlanc, Eric ;
Rennie, Paul S. ;
Cherkasov, Artem .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (05) :1018-1028
[10]   Very fast prediction and rationalization of pKa values for protein-ligand complexes [J].
Bas, Delphine C. ;
Rogers, David M. ;
Jensen, Jan H. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 73 (03) :765-783