An integrated approach of feature selection and parameter optimisation of kernel to enhance the performance of support vector machine

被引:1
作者
Sarojini, Balakrishnan [1 ]
机构
[1] Women Univ, Avinashilingam Inst Home Sci & Higher Educ, Dept Comp Sci, Coimbatore 641043, Tamil Nadu, India
关键词
big data; feature selection; support vector machine; SVM; kernel parameters; optimisation; accuracy and AUC;
D O I
10.1504/IJCNDS.2015.070982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining the big data is a challenging task due to the size of the databases and the complexity in maintaining precise and non-redundant data. Classification algorithms need to analyse hundreds of independent features in these high dimensional databases for effective prediction. The performance of classification algorithms could be enhanced data if irrelevant and redundant data are removed. Feature selection algorithms help in identifying prominent features that could enhance the performance of the classifier. Additionally, the classification performance of support vector machine (SVM) could be enhanced by setting appropriate kernel parameters. The kernel parameters of SVM are tuned for each feature subset generated by feature selection and the performance is analysed. The feature subset that enhances the classification performance of SVM is the optimal feature subset of the dataset. Experiments are done on three medical datasets. The empirical results prove that integrating feature selection and optimising the kernel parameters enhance the performance of the SVM classifier. The approach is validated in terms of increase in accuracy and area under receiver operating characteristic (AUC) of the classifier.
引用
收藏
页码:265 / 278
页数:14
相关论文
共 15 条
[1]  
Asuncion A., 2007, UCI MACHINE LEARNING
[2]  
Bellman, 2015, ADAPTIVE CONTROL PRO
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]  
Chen Y. wei, 2005, COMBINING SVMS VARIO
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]   Feature selection for support vector machines by means of genetic algorithms [J].
Fröhlich, H ;
Chapelle, O ;
Schölkopf, B .
15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, :142-148
[7]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[8]  
Hsu C. W., 2003, TECHNICAL REPORT
[9]   A simple decomposition method for support vector machines [J].
Hsu, CW ;
Lin, CJ .
MACHINE LEARNING, 2002, 46 (1-3) :291-314
[10]   Prediction model building and feature selection with support vector machines in breast cancer diagnosis [J].
Huang, Cheng-Lung ;
Liao, Hung-Chang ;
Chen, Mu-Chen .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (01) :578-587