An evolutionary filter approach to feature selection in classification for both single- and multi-objective scenarios

被引:14
作者
Hancer, Emrah [1 ,2 ,3 ]
Xue, Bing [1 ,2 ]
Zhang, Mengjie [1 ,2 ]
机构
[1] Victoria Univ Wellington, Ctr Data Sci & Artificial Intelligence, Wellington 6140, New Zealand
[2] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
[3] Mehmet Akif Ersoy Univ, Dept Software Engn, TR-15039 Burdur, Turkiye
关键词
Differential evolution; Neighborhood component analysis; Multi-objective optimization; Classification; PARTICLE SWARM OPTIMIZATION; DIFFERENTIAL EVOLUTION; MUTUAL INFORMATION; ALGORITHM; DENSITY;
D O I
10.1016/j.knosys.2023.111008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high-dimensional datasets in various domains, such as text categorization, information retrieval and bioinformatics, have highlighted the importance of feature selection in data mining. Despite the numerous existing approaches to feature selection, there is still a need for further research in this field. In this paper, we propose an evolutionary filter feature selection approach that can be used for both single-and multi-objective scenarios by introducing an objective function inspired by Neighborhood Component Analysis (NCA)-based method and then integrating it into the differential evolution framework. The proposed approach applicable to two scenarios aims to identify an optimal feature subset through an evolutionary search process that maximizes class separation while minimizing the dimensionality. Through comprehensive experimental studies conducted on diverse datasets, the results show that the proposed approach outperforms recently proposed evolutionary information-theoretic, rough set-based and state-of-the-art feature selection approaches in both scenarios. Notably, this study is the first to integrate an NCA-based strategy into an evolutionary feature selection approach. Furthermore, you can access the source code of this approach at https://github.com/ ehancer06/DENCA this link.
引用
收藏
页数:15
相关论文
共 43 条
  • [1] [Anonymous], 2001, 7 INT C SOFT COMP
  • [2] A survey on swarm intelligence approaches to feature selection in data mining
    Bach Hoai Nguyen
    Xue, Bing
    Zhang, Mengjie
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
  • [3] USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING
    BATTITI, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04): : 537 - 550
  • [4] Coleman T. F., 1994, Mathematical Programming, V67, P189, DOI [10.1007/BF01582221, DOI 10.1007/BF01582221]
  • [5] Deb K., 2000, Parallel Problem Solving from Nature PPSN VI. 6th International Conference. Proceedings (Lecture Notes in Computer Science Vol.1917), P849
  • [6] Surrogate-Assisted and Filter-Based Multiobjective Evolutionary Feature Selection for Deep Learning
    Espinosa, Raquel
    Jimenez, Fernando
    Palma, Jose
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9591 - 9605
  • [7] FERRI FJ, 1994, MACH INTELL PATT REC, V16, P403
  • [8] The use of multiple measurements in taxonomic problems
    Fisher, RA
    [J]. ANNALS OF EUGENICS, 1936, 7 : 179 - 188
  • [9] Evolutionary feature selection on high dimensional data using a search space reduction approach
    Garcia-Torres, Miguel
    Ruiz, Roberto
    Divina, Federico
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [10] Goldberger J., 2004, NEURAL INF PROCESS S, V17, P513