A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection

被引:0
作者
Marwa Hammami
Slim Bechikh
Chih-Cheng Hung
Lamjed Ben Said
机构
[1] Kennesaw State University,
[2] Anyang Normal University,undefined
[3] SMART Lab,undefined
[4] University of Tunis,undefined
[5] ISG-Campus,undefined
来源
Memetic Computing | 2019年 / 11卷
关键词
Feature selection; Multi-objective optimization; Filter objectives; Expensive wrapper objectives; Hybrid evolutionary algorithms;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is an important pre-processing data mining task, which can reduce the data dimensionality and improve not only the classification accuracy but also the classifier efficiency. Filters use statistical characteristics of the data as the evaluation measure rather than using a classification algorithm. On the contrary, the wrapper process is computationally expensive because the evaluation of every feature subset requires running the classifier on the datasets and computing the accuracy from the obtained confusion matrix. In order to solve this problem, we propose a hybrid tri-objective evolutionary algorithm that optimizes two filter objectives, namely the number of features and the mutual information, and one wrapper objective corresponding to the accuracy. Once the population is classified into different non-dominated fronts, only feature subsets belonging to the first (best) one are improved using the indicator-based multi-objective local search. Our proposed hybrid algorithm, named Filter-Wrapper-based Nondominated Sorting Genetic Algorithm-II, is compared against several multi-objective and single-objective feature selection algorithms on eighteen benchmark datasets having different dimensionalities. Experimental results show that our proposed algorithm gives competitive and better results with respect to existing algorithms.
引用
收藏
页码:193 / 208
页数:15
相关论文
共 40 条
  • [1] Gheyas IA(2010)Feature subset selection in large dimensionality domains Pattern Recognit 43 5-13
  • [2] Smith LS(1997)Wrappers for feature subset selection Artif Intell 97 273-324
  • [3] Kohavi R(2016)A Survey on evolutionary computation approaches to feature selection IEEE Trans Evol Comput 20 606-626
  • [4] John GH(2013)An SVM-wrapped multiobjective evolutionary feature selection approach for identifying cancer-microRNA markers IEEE Trans Nanobiosci 12 275-281
  • [5] Xue B(2013)Particle swarm optimization for feature selection in classification: a multi-objective approach IEEE Trans Cybern 43 1656-1671
  • [6] Zhang M(2007)A hybrid genetic algorithm for feature selection wrapper based on mutual information Pattern Recognit Lett 28 1825-1844
  • [7] Browne WN(2010)Towards a memetic feature selection paradigm [application notes] IEEE Comput Intell Mag 5 41-53
  • [8] Yao X(2002)A fast and elitist multiobjective genetic algorithm: NSGA-II IEEE Trans Evol Comput 6 182-197
  • [9] Mukhopadhyay A(1967)Nearest neighbor pattern classification IEEE Trans Inf Theory 13 21-27
  • [10] Maulik U(2011)A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms Swarm Evol Comput 1 3-18