Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery

被引:113
作者
Rodriguez-Perez, Raquel [1 ,2 ]
Bajorath, Juergen [1 ,2 ]
机构
[1] Rheinische Friedrich Wilhelms Univ, Dept Life Sci Informat, LIMES Program Unit Chem Biol & Med Chem, B IT, Friedrich Hirzebruch Allee 6, D-53115 Bonn, Germany
[2] Novartis Inst Biomed Res, Novartis Campus, CH-4002 Basel, Switzerland
关键词
Support vector machines; Machine learning; Compound classification; Property prediction; Regression; ACTIVITY CLIFFS; PREDICTION; CLASSIFICATION; REPRESENTATIONS; INFORMATION; INHIBITORS;
D O I
10.1007/s10822-022-00442-9
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The support vector machine (SVM) algorithm is one of the most widely used machine learning (ML) methods for predicting active compounds and molecular properties. In chemoinformatics and drug discovery, SVM has been a state-of-the-art ML approach for more than a decade. A unique attribute of SVM is that it operates in feature spaces of increasing dimensionality. Hence, SVM conceptually departs from the paradigm of low dimensionality that applies to many other methods for chemical space navigation. The SVM approach is applicable to compound classification, and ranking, multi-class predictions, and -in algorithmically modified form- regression modeling. In the emerging era of deep learning (DL), SVM retains its relevance as one of the premier ML methods in chemoinformatics, for reasons discussed herein. We describe the SVM methodology including strengths and weaknesses and discuss selected applications that have contributed to the evolution of SVM as a premier approach for compound classification, property predictions, and virtual compound screening.
引用
收藏
页码:355 / 362
页数:8
相关论文
共 47 条
[1]  
[Anonymous], 1979, Estimation of Dependences Based on Empirical Data
[2]   Visualization and Interpretation of Support Vector Machine Activity Predictions [J].
Balfer, Jenny ;
Bajorath, Juergen .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (06) :1136-1147
[3]   Systematic Artifacts in Support Vector Regression-Based Compound Potency Prediction Revealed by Statistical and Activity Landscape Analysis [J].
Balfer, Jenny ;
Bajorath, Juergen .
PLOS ONE, 2015, 10 (03)
[4]   Modeling of Compound Profiling Experiments Using Support Vector Machines [J].
Balfer, Jenny ;
Heikamp, Kathrin ;
Laufer, Stefan ;
Bajorath, Juergen .
CHEMICAL BIOLOGY & DRUG DESIGN, 2014, 84 (01) :75-85
[5]   Rule extraction from support vector machines A review [J].
Barakat, Nahla ;
Bradley, Andrew P. .
NEUROCOMPUTING, 2010, 74 (1-3) :178-190
[6]   A renaissance of neural networks in drug discovery [J].
Baskin, Igor I. ;
Winkler, David ;
Tetko, Igor V. .
EXPERT OPINION ON DRUG DISCOVERY, 2016, 11 (08) :785-795
[7]  
Bishop C., 2006, Pattern Recognition and Machine Learning
[8]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[9]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[10]   The rise of deep learning in drug discovery [J].
Chen, Hongming ;
Engkvist, Ola ;
Wang, Yinhai ;
Olivecrona, Marcus ;
Blaschke, Thomas .
DRUG DISCOVERY TODAY, 2018, 23 (06) :1241-1250