Machine learning in chemoinformatics and drug discovery

被引:614
作者
Lo, Yu-Chen [1 ]
Rensi, Stefano E. [1 ]
Torng, Wen [1 ]
Altman, Russ B. [1 ]
机构
[1] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
关键词
ARTIFICIAL NEURAL-NETWORKS; COMPOUND CLASSIFICATION; MOLECULAR SIMILARITY; LINEAR-REGRESSION; RANDOM FOREST; QSAR; PREDICTION; MODEL; DESCRIPTORS; SEARCH;
D O I
10.1016/j.drudis.2018.05.010
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Chemoinformatics is an established discipline focusing on extracting, processing and extrapolating meaningful data from chemical structures. With the rapid explosion of chemical 'big' data from HTS and combinatorial synthesis, machine learning has become an indispensable tool for drug designers to mine chemical information from large compound databases to design drugs with important biological properties. To process the chemical data, we first reviewed multiple processing layers in the chemoinformatics pipeline followed by the introduction of commonly used machine learning models in drug discovery and QSAR analysis. Here, we present basic principles and recent case studies to demonstrate the utility of machine learning techniques in chemoinformatics analyses; and we discuss limitations and future directions to guide further development in this evolving field.
引用
收藏
页码:1538 / 1546
页数:9
相关论文
共 105 条
[11]  
[Anonymous], 2014, INT C LEARN REPR
[12]  
[Anonymous], 1990, Advances in neural information processing systems
[13]  
[Anonymous], 2017, GENERATING FOCUSED M
[14]  
[Anonymous], GENERALIZATION CONVO
[15]  
[Anonymous], 2017, P INT C MACH LEARN
[16]   Characterizing the Chemical Space of ERK2 Kinase Inhibitors Using Descriptors Computed from Molecular Dynamics Trajectories [J].
Ash, Jeremy ;
Fourches, Denis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (06) :1286-1299
[17]   Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening [J].
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :233-245
[18]  
Bajorath J, 2017, METHODS MOL BIOL, V1526, P231, DOI 10.1007/978-1-4939-6613-4_13
[19]   A renaissance of neural networks in drug discovery [J].
Baskin, Igor I. ;
Winkler, David ;
Tetko, Igor V. .
EXPERT OPINION ON DRUG DISCOVERY, 2016, 11 (08) :785-795
[20]   The continuous molecular fields approach to building 3D-QSAR models [J].
Baskin, Igor I. ;
Zhokhova, Nelly I. .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2013, 27 (05) :427-442