On the parameter optimization of Support Vector Machines for binary classification

被引:55
作者
Gaspar, Paulo [1 ]
Carbonell, Jaime [2 ]
Luis Oliveira, Jose [1 ]
机构
[1] Univ Aveiro, DETI IEETA, Campus Univ Santiago, P-3810193 Aveiro, Portugal
[2] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
D O I
10.2390/biecoll-jib-2012-201
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classifying biological data is a common task in the biomedical context. Predicting the class of new, unknown information allows researchers to gain insight and make decisions based on the available data. Also, using classification methods often implies choosing the best parameters to obtain optimal class separation, and the number of parameters might be large in biological datasets. Support Vector Machines provide a well-established and powerful classification method to analyse data and find the minimal-risk separation between different classes. Finding that separation strongly depends on the available feature set and the tuning of hyper-parameters. Techniques for feature selection and SVM parameters optimization are known to improve classification accuracy, and its literature is extensive. In this paper we review the strategies that are used to improve the classification performance of SVMs and perform our own experimentation to study the influence of features and hyper-parameters in the optimization process, using several known kernels.
引用
收藏
页数:11
相关论文
共 28 条
  • [1] Support vector machines combined with feature selection for breast cancer diagnosis
    Akay, Mehmet Fatih
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3240 - 3247
  • [2] Applying support vector machines to imbalanced datasets
    Akbani, R
    Kwek, S
    Japkowicz, N
    [J]. MACHINE LEARNING: ECML 2004, PROCEEDINGS, 2004, 3201 : 39 - 50
  • [3] Ali A., 2002, P 2 INT C HYBR INT S, P321
  • [4] Automatic parameter selection for polynomial kernel
    Ali, S
    Smith, KA
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2003, : 243 - 249
  • [5] On-line handwriting recognition with support vector machines - A kernel approach
    Bahlmann, C
    Haasdonk, B
    Burkhardt, H
    [J]. EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS, 2002, : 49 - 54
  • [6] Basak J., 2008, PROC 19 INT C PATTER, P1, DOI [10.1109/ICPR.2008.4761475, DOI 10.1109/ICPR.2008.4761475]
  • [7] Conditionally positive definite kernels for SVM based image recognition
    Boughorbel, S
    Tarel, JP
    Boujemaa, N
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 113 - 116
  • [8] Support vector machines experts for time series forecasting
    Cao, LJ
    [J]. NEUROCOMPUTING, 2003, 51 : 321 - 339
  • [9] Choosing multiple parameters for support vector machines
    Chapelle, O
    Vapnik, V
    Bousquet, O
    Mukherjee, S
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 131 - 159
  • [10] Chen YW, 2006, STUD FUZZ SOFT COMP, V207, P315