A novel hybrid algorithm for feature selection

被引:0
作者
Yuefeng Zheng
Ying Li
Gang Wang
Yupeng Chen
Qian Xu
Jiahao Fan
Xueting Cui
机构
[1] Jilin University,College of Computer Science and Technology
[2] Jilin University,Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education
[3] BODA College of Jilin Normal University,undefined
来源
Personal and Ubiquitous Computing | 2018年 / 22卷
关键词
Cuckoo search algorithm; Classification; Dimensionality reduction; Feature selection; Maximum Spearman and minimum covariance;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important filtering method for data analysis, pattern classification, data mining, and so on. Feature selection reduces the number of features by removing irrelevant and redundant data. In this paper, we propose a hybrid filter–wrapper feature subset selection algorithm called the maximum Spearman minimum covariance cuckoo search (MSMCCS). First, based on Spearman and covariance, a filter algorithm is proposed called maximum Spearman minimum covariance (MSMC). Second, three parameters are proposed in MSMC to adjust the weights of the correlation and redundancy, improve the relevance of feature subsets, and reduce the redundancy. Third, in the improved cuckoo search algorithm, a weighted combination strategy is used to select candidate feature subsets, a crossover mutation concept is used to adjust the candidate feature subsets, and finally, the filtered features are selected into optimal feature subsets. Therefore, the MSMCCS combines the efficiency of filters with the greater accuracy of wrappers. Experimental results on eight common data sets from the University of California at Irvine Machine Learning Repository showed that the MSMCCS algorithm had better classification accuracy than the seven wrapper methods, the one filter method, and the two hybrid methods. Furthermore, the proposed algorithm achieved preferable performance on the Wilcoxon signed-rank test and the sensitivity–specificity test.
引用
收藏
页码:971 / 985
页数:14
相关论文
共 124 条
[31]  
Nakamura RYM(2016)A hybrid particle swarm optimization for feature subset selection by integrating a novel local search strategy [J] Appl Soft Comput 43 117-130
[32]  
Costa KAP(2014)Cuckoo search: recent advances and applications Neural Comput Applic 24 169-174
[33]  
Yang XS(2014)Discrete cuckoo search algorithm for the travelling salesman problem Neural Comput & Applic 24 1659-1669
[34]  
Souza AN(2015)Cross grouping strategy based 2DPCA method for face recognition Appl Soft Comput 29 270-279
[35]  
Papa JP(2015)Stress test procedure for feature selection algorithms Chemom Intell Lab Syst 142 172-183
[36]  
Passino KM(2013)Incremental updating approximations in dominance-based rough sets approach under the variation of the attribute set Knowl Based Syst 40 17-26
[37]  
Chen YP(2006)A GA-based feature selection and parameters optimization for support vector machines Expert Syst Appl 31 231-240
[38]  
Li Y(2000)Assessment of the sensitivity and specificity of oligonucleotide (50mer) microarrays Nucleic Acids Res 28 4552-4557
[39]  
Wang G(1973)On methods of handling ties in the Wilcoxon signed-rank test J Am Stat Assoc 68 985-988
[40]  
Zheng YF(2011)A ‘non-parametric’ version of the naive Bayes classifier Knowl Based Syst 24 775-784