A Length-Adaptive Non-Dominated Sorting Genetic Algorithm for Bi-Objective High-Dimensional Feature Selection

被引:17
作者
Gong, Yanlu [1 ]
Zhou, Junhai [1 ]
Wu, Quanwang [1 ]
Zhou, MengChu [2 ]
Wen, Junhao [3 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[2] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[3] Chongqing Univ, Coll Big Data & Software Engn, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-objective optimization; feature selection (FS); genetic algorithm; high-dimensional data; length-adaptive; MULTIOBJECTIVE FEATURE-SELECTION; DIFFERENTIAL EVOLUTION; SEARCH;
D O I
10.1109/JAS.2023.123648
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a crucial data preprocessing method in data mining, feature selection (FS) can be regarded as a bi-objective optimization problem that aims to maximize classification accuracy and minimize the number of selected features. Evolutionary computing (EC) is promising for FS owing to its powerful search capability. However, in traditional EC-based methods, feature subsets are represented via a length-fixed individual encoding. It is ineffective for high-dimensional data, because it results in a huge search space and prohibitive training time. This work proposes a length-adaptive non-dominated sorting genetic algorithm (LA-NSGA) with a length-variable individual encoding and a length-adaptive evolution mechanism for bi-objective high-dimensional FS. In LA-NSGA, an initialization method based on correlation and redundancy is devised to initialize individuals of diverse lengths, and a Pareto dominance-based length change operator is introduced to guide individuals to explore in promising search space adaptively. Moreover, a dominance-based local search method is employed for further improvement. The experimental results based on 12 high-dimensional gene datasets show that the Pareto front of feature subsets produced by LA-NSGA is superior to those of existing algorithms.
引用
收藏
页码:1834 / 1844
页数:11
相关论文
共 53 条
  • [1] A dynamic locality multi-objective salp swarm algorithm for feature selection
    Aljarah, Ibrahim
    Habib, Maria
    Faris, Hossam
    Al-Madi, Nailah
    Heidari, Ali Asghar
    Mafarja, Majdi
    Abd Elaziz, Mohamed
    Mirjalili, Seyedali
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 147
  • [2] Optimizing multi-objective PSO based feature selection method using a feature elitism mechanism
    Amoozegar, Maryam
    Minaei-Bidgoli, Behrouz
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 : 499 - 514
  • [3] A survey on swarm intelligence approaches to feature selection in data mining
    Bach Hoai Nguyen
    Xue, Bing
    Zhang, Mengjie
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
  • [4] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [5] Benchmark for filter methods for feature selection in high-dimensional classification data
    Bommert, Andrea
    Sun, Xudong
    Bischl, Bernd
    Rahnenfuehrer, Joerg
    Lang, Michel
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143
  • [6] A survey on feature selection methods
    Chandrashekar, Girish
    Sahin, Ferat
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) : 16 - 28
  • [7] Chen Hongge, 2019, P MACHINE LEARNING R, V97
  • [8] Variable-Length Representation for EC-Based Feature Selection in High-Dimensional Data
    Cilia, N. D.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2019, 2019, 11454 : 325 - 340
  • [9] A Bi-population Cooperative Optimization Algorithm Assisted by an Autoencoder for Medium-scale Expensive Problems
    Cui, Meiji
    Li, Li
    Zhou, MengChu
    Li, Jiankai
    Abusorrah, Abdullah
    Sedraoui, Khaled
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (11) : 1952 - 1966
  • [10] Surrogate-Assisted Autoencoder-Embedded Evolutionary Optimization Algorithm to Solve High-Dimensional Expensive Problems
    Cui, Meiji
    Li, Li
    Zhou, Mengchu
    Abusorrah, Abdullah
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (04) : 676 - 689