A novel hybrid algorithm for feature selection

被引:0
作者
Yuefeng Zheng
Ying Li
Gang Wang
Yupeng Chen
Qian Xu
Jiahao Fan
Xueting Cui
机构
[1] Jilin University,College of Computer Science and Technology
[2] Jilin University,Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education
[3] BODA College of Jilin Normal University,undefined
来源
Personal and Ubiquitous Computing | 2018年 / 22卷
关键词
Cuckoo search algorithm; Classification; Dimensionality reduction; Feature selection; Maximum Spearman and minimum covariance;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important filtering method for data analysis, pattern classification, data mining, and so on. Feature selection reduces the number of features by removing irrelevant and redundant data. In this paper, we propose a hybrid filter–wrapper feature subset selection algorithm called the maximum Spearman minimum covariance cuckoo search (MSMCCS). First, based on Spearman and covariance, a filter algorithm is proposed called maximum Spearman minimum covariance (MSMC). Second, three parameters are proposed in MSMC to adjust the weights of the correlation and redundancy, improve the relevance of feature subsets, and reduce the redundancy. Third, in the improved cuckoo search algorithm, a weighted combination strategy is used to select candidate feature subsets, a crossover mutation concept is used to adjust the candidate feature subsets, and finally, the filtered features are selected into optimal feature subsets. Therefore, the MSMCCS combines the efficiency of filters with the greater accuracy of wrappers. Experimental results on eight common data sets from the University of California at Irvine Machine Learning Repository showed that the MSMCCS algorithm had better classification accuracy than the seven wrapper methods, the one filter method, and the two hybrid methods. Furthermore, the proposed algorithm achieved preferable performance on the Wilcoxon signed-rank test and the sensitivity–specificity test.
引用
收藏
页码:971 / 985
页数:14
相关论文
共 124 条
[1]  
Armanfard N(2016)Local feature selection for data classification IEEE Trans Pattern Anal Mach Intell 38 1217-1227
[2]  
Reilly JP(2011)Feature selection and kernel learning for local learning-based clustering IEEE Trans Pattern Anal Mach Intell 33 1532-1547
[3]  
Komeili M(2015)Feature selection via global redundancy minimization IEEE Trans Knowl Data Eng 27 2743-2755
[4]  
Zeng H(1997)Eigenfaces vs. Fisherfaces: recognition using class specific linear projection IEEE Trans Pattern Anal Mach Intell 19 711-720
[5]  
Cheung YM(2008)MPCA: multilinear principal component analysis of tensor objects IEEE Trans Neural Netw 19 18-39
[6]  
Wang D(2005)Face recognition using laplacianfaces IEEE Trans Pattern Anal Mach Intell 27 328-340
[7]  
Nie F(2003)Laplacian Eigenmaps for dimensionality reduction and data representation Neural Comput 15 1373-1396
[8]  
Huang H(2013)Comparison of metaheuristic strategies for peakbin selection in proteomic mass spectrometry data Inf Sci 222 229-246
[9]  
Belhumeur PN(2015)The ant lion optimizer Adv Eng Softw 83 80-98
[10]  
Hespanha JP(2013)Bat algorithm: literature review and applications Int J Bio-Inspir Com 5 141-149