Prediction of Compounds with Closely Related Activity Profiles Using Weighted Support Vector Machine Linear Combinations

被引:19
作者
Heikamp, Kathrin [1 ]
Bajorath, Juergen [1 ]
机构
[1] Univ Bonn, B IT, Dept Life Sci Informat, LIMES Program Unit Chem Biol & Med Chem, D-53113 Bonn, Germany
关键词
CHEMOGENOMICS; TARGET;
D O I
10.1021/ci400090t
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Using support vector machine (SVM) ranking, a complex multi-class prediction task has been investigated involving sets of compounds that were active against related targets and represented all possible combinations of single-, dual-, and triple-target activities. Standard SVM models were not capable of differentiating compounds with overlapping yet distinct activity profiles. To address this problem, we designed differentially weighted SVM linear combinations that were found to preferentially detect compounds with desired activity profiles and deprioritize others. Hence, combining independently derived SVM models using negative and positive linear weighting factors balanced relative contributions from individual reference sets and successfully distinguished between compounds with overlapping activity profiles.
引用
收藏
页码:791 / 801
页数:11
相关论文
共 26 条
[1]  
[Anonymous], MOL OP ENV MOE
[2]   Molecular similarity searching using atom environments, information-based feature selection, and a naive Bayesian classifier [J].
Bender, A ;
Mussa, HY ;
Glen, RC ;
Reiling, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :170-178
[3]   Virtual screen for ligands of orphan G protein-coupled receptors [J].
Bock, JR ;
Gough, DA .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (05) :1402-1414
[4]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[5]   Chemogenomics: An emerging strategy for rapid target and drug discovery [J].
Bredel, M ;
Jacoby, E .
NATURE REVIEWS GENETICS, 2004, 5 (04) :262-275
[6]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]  
Duda R O., 2000, Pattern Classification, V2nd, P20
[9]   Support-vector-machine-based ranking significantly improves the effectiveness of similarity searching using 2D fingerprints and multiple reference compounds [J].
Geppert, Hanna ;
Horvath, Tamds ;
Gaertner, Thomas ;
Wrobel, Stefan ;
Bajorath, Juergen .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (04) :742-746
[10]   Ligand Prediction from Protein Sequence and Small Molecule Information Using Support Vector Machines and Fingerprint Descriptors [J].
Geppert, Hanna ;
Humrich, Jens ;
Stumpfe, Dagmar ;
Gaertner, Thomas ;
Bajorath, Juergen .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (04) :767-779