Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery

被引:86
作者
Rodriguez-Perez, Raquel [1 ,2 ]
Bajorath, Juergen [1 ,2 ]
机构
[1] Rheinische Friedrich Wilhelms Univ, Dept Life Sci Informat, LIMES Program Unit Chem Biol & Med Chem, B IT, Friedrich Hirzebruch Allee 6, D-53115 Bonn, Germany
[2] Novartis Inst Biomed Res, Novartis Campus, CH-4002 Basel, Switzerland
关键词
Support vector machines; Machine learning; Compound classification; Property prediction; Regression; ACTIVITY CLIFFS; PREDICTION; CLASSIFICATION; REPRESENTATIONS; INFORMATION; INHIBITORS;
D O I
10.1007/s10822-022-00442-9
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The support vector machine (SVM) algorithm is one of the most widely used machine learning (ML) methods for predicting active compounds and molecular properties. In chemoinformatics and drug discovery, SVM has been a state-of-the-art ML approach for more than a decade. A unique attribute of SVM is that it operates in feature spaces of increasing dimensionality. Hence, SVM conceptually departs from the paradigm of low dimensionality that applies to many other methods for chemical space navigation. The SVM approach is applicable to compound classification, and ranking, multi-class predictions, and -in algorithmically modified form- regression modeling. In the emerging era of deep learning (DL), SVM retains its relevance as one of the premier ML methods in chemoinformatics, for reasons discussed herein. We describe the SVM methodology including strengths and weaknesses and discuss selected applications that have contributed to the evolution of SVM as a premier approach for compound classification, property predictions, and virtual compound screening.
引用
收藏
页码:355 / 362
页数:8
相关论文
共 47 条
  • [1] Visualization and Interpretation of Support Vector Machine Activity Predictions
    Balfer, Jenny
    Bajorath, Juergen
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (06) : 1136 - 1147
  • [2] Systematic Artifacts in Support Vector Regression-Based Compound Potency Prediction Revealed by Statistical and Activity Landscape Analysis
    Balfer, Jenny
    Bajorath, Juergen
    [J]. PLOS ONE, 2015, 10 (03):
  • [3] Modeling of Compound Profiling Experiments Using Support Vector Machines
    Balfer, Jenny
    Heikamp, Kathrin
    Laufer, Stefan
    Bajorath, Juergen
    [J]. CHEMICAL BIOLOGY & DRUG DESIGN, 2014, 84 (01) : 75 - 85
  • [4] Rule extraction from support vector machines A review
    Barakat, Nahla
    Bradley, Andrew P.
    [J]. NEUROCOMPUTING, 2010, 74 (1-3) : 178 - 190
  • [5] A renaissance of neural networks in drug discovery
    Baskin, Igor I.
    Winkler, David
    Tetko, Igor V.
    [J]. EXPERT OPINION ON DRUG DISCOVERY, 2016, 11 (08) : 785 - 795
  • [6] Bishop C. M., 2006, PATTERN RECOGN
  • [7] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [8] Drug design by machine learning: support vector machines for pharmaceutical data analysis
    Burbidge, R
    Trotter, M
    Buxton, B
    Holden, S
    [J]. COMPUTERS & CHEMISTRY, 2001, 26 (01): : 5 - 14
  • [9] The rise of deep learning in drug discovery
    Chen, Hongming
    Engkvist, Ola
    Wang, Yinhai
    Olivecrona, Marcus
    Blaschke, Thomas
    [J]. DRUG DISCOVERY TODAY, 2018, 23 (06) : 1241 - 1250
  • [10] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297