Differential Evolution-Based Feature Selection: A Niching-Based Multiobjective Approach

被引:52
作者
Wang, Peng [1 ]
Xue, Bing [1 ]
Liang, Jing [2 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Evolutionary Computat Res Grp, Wellington 6140, New Zealand
[2] Zhengzhou Univ, Sch Elect Engn, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Classification algorithms; Task analysis; Statistics; Sociology; Error analysis; Optimization; Classification; differential evolution (DE); evolutionary multiobjective optimization (EMO); feature selection; GENETIC ALGORITHM; OPTIMIZATION; RELEVANCE;
D O I
10.1109/TEVC.2022.3168052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is to reduce both the dimensionality of data and the classification error rate (i.e., increase the classification accuracy) of a learning algorithm. The two objectives are often conflicting, thus a multiobjective feature selection method can obtain a set of nondominated feature subsets. Each solution in the set has a different size and a corresponding classification error rate. However, most existing feature selection algorithms have ignored that, for a given size, there can be different feature subsets with very similar or the same accuracy. This article introduces a niching-based multiobjective feature selection method that simultaneously minimizes the number of selected features and the classification error rate. The proposed method conceives to identify: 1) a set of feature subsets with good convergence and distribution and 2) multiple feature subsets choosing the same number of features with almost the same lowest classification error rate. The contributions of this article are threefold. First, a niching and global interaction mutation operator is proposed that can produce promising feature subsets. Second, a newly developed environmental selection mechanism allows equal informative feature subsets to be stored by relaxing the Pareto-dominance relationship. Finally, the proposed subset repairing mechanism can generate better feature subsets and further remove the redundant features. The proposed method is compared against seven multiobjective feature selection algorithms on 19 datasets, including both binary and multiclass classification tasks. The results show that the proposed method can evolve a rich and diverse set of nondominated solutions for different feature selection tasks, and their availability helps in understanding the relationships between features.
引用
收藏
页码:296 / 310
页数:15
相关论文
共 61 条
  • [21] KIRA K, 1992, MACHINE LEARNING /, P249
  • [22] Kononenko I., 1994, Machine Learning: ECML-94. European Conference on Machine Learning. Proceedings, P171
  • [23] A species conserving genetic algorithm for multimodal function optimization
    Li, JP
    Balazs, ME
    Parks, GT
    Clarkson, PJ
    [J]. EVOLUTIONARY COMPUTATION, 2002, 10 (03) : 207 - 234
  • [24] Feature Selection: A Data Perspective
    Li, Jundong
    Cheng, Kewei
    Wang, Suhang
    Morstatter, Fred
    Trevino, Robert P.
    Tang, Jiliang
    Liu, Huan
    [J]. ACM COMPUTING SURVEYS, 2018, 50 (06)
  • [25] Seeking Multiple Solutions: An Updated Survey on Niching Methods and Their Applications
    Li, Xiaodong
    Epitropakis, Michael G.
    Deb, Kalyanmoy
    Engelbrecht, Andries
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (04) : 518 - 538
  • [26] A Survey on Sparse Learning Models for Feature Selection
    Li, Xiaoping
    Wang, Yadi
    Ruiz, Ruben
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (03) : 1642 - 1660
  • [27] Liu H., 1998, Feature extraction, construction and selection: A data mining perspective, V453
  • [28] Multiple similarly-well solutions exist for biomedical feature selection and classification problems
    Liu, Jiamei
    Xu, Cheng
    Yang, Weifeng
    Shu, Yayun
    Zheng, Weiwei
    Zhou, Fengfeng
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [29] Multiple Reference Points-Based Decomposition for Multiobjective Feature Selection in Classification: Static and Dynamic Mechanisms
    Nguyen, Bach Hoai
    Xue, Bing
    Andreae, Peter
    Ishibuchi, Hisao
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (01) : 170 - 184
  • [30] Nie F, 2010, ADV NEURAL INFORM PR, P1813