Multi-objective optimization algorithm based on clustering guided binary equilibrium optimizer and NSGA-III to solve high-dimensional feature selection problem

被引:12
作者
Zhang, Min [1 ]
Wang, Jie-Sheng [1 ]
Liu, Yu [1 ]
Song, Hao-Ming [1 ]
Hou, Jia-Ning [1 ]
Wang, Yu-Cai [1 ]
Wang, Min [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Elect & Informat Engn, Anshan 114044, Peoples R China
关键词
Feature selection; Equilibrium optimizer; NSGA-III; Transfer function; Clustering guidance;
D O I
10.1016/j.ins.2023.119638
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection (FS) is an indispensable activity in machine learning, whose purpose is to identify relevant predictive values from a high-dimensional feature space to improve performance and reduce model learning time. However, the large increase in feature dimensions poses a great challenge to FS methods. Therefore, a multi-objective optimization algorithm consisting of an Equilibrium Optimizer (EO) and NSGA-III was proposed to solve the FS problem with high-dimensional data. Through S-shaped, V-shaped and U-shaped transfer functions, the conversion from real number coding to binary coding is realized to solve discrete problems, and the influence of these three transfer functions on the effect of FS is compared. In addition, the algorithm op-timizes the population in the binary search space by building an external archive, and realizes the selection and optimization of external archive individuals based on the clustering strategy. The KNN classifier was used to realize the classification progress. The simulation experiments are divided into two groups with 18 medium and high dimensional data. The first group analyzes the optimization effect of the proposed framework under three transfer functions. The second group of experiments selects the algorithm that wins in the first group of experiments and compares it with eleven classical multi-objective optimization algorithms. The evaluation criteria includes two optimization objectives of the FS problem and the optimization indices of HV and IGD. The first set of experiments showed that the U-shaped transfer function family performed best in the FS problem, with U3 being the most excellent, followed by V-shaped and S-shaped. Compared to other multi-objective optimization algorithms, the simulation results also confirm the effective-ness of the proposed strategy.
引用
收藏
页数:30
相关论文
共 47 条
[1]   Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems [J].
Akinola, Olatunji A. ;
Agushaka, Jeffrey O. ;
Ezugwu, Absalom E. .
PLOS ONE, 2022, 17 (10)
[2]   Fast Genetic Algorithm for feature selection-A qualitative approximation approach [J].
Altarabichi, Mohammed Ghaith ;
Nowaczyk, Slawomir ;
Pashami, Sepideh ;
Mashhadi, Peyman Sheikholharam .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
[3]   On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems [J].
Amaldi, E ;
Kann, V .
THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) :237-260
[4]  
Backhaus K., 2023, Multivariate Analysis: An Application-Oriented Introduction, P453
[5]   Finding compact and well-separated clusters: Clustering using silhouette coefficients [J].
Bagirov, Adil M. ;
Aliguliyev, Ramiz M. ;
Sultanova, Nargiz .
PATTERN RECOGNITION, 2023, 135
[6]   How to find a good explanation for clustering? [J].
Bandyapadhyay, Sayan ;
Fomin, Fedor, V ;
Golovach, Petr A. ;
Lochet, William ;
Purohit, Nidhi ;
Simonov, Kirill .
ARTIFICIAL INTELLIGENCE, 2023, 322
[7]   Enhanced Ali Baba and the forty thieves algorithm for feature selection [J].
Braik, Malik .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (08) :6153-6184
[8]   Determination of fatty acid of wheat by near-infrared spectroscopy with combined feature selection based on CARS and NSGA-III [J].
Chen, Run .
INFRARED PHYSICS & TECHNOLOGY, 2023, 129
[9]   An improved binary particle swarm optimization combing V-shaped and U-shaped transfer function [J].
Chen, Yuxiang ;
Liu, Jianhua ;
Zhu, Jian ;
Wang, Zihang .
EVOLUTIONARY INTELLIGENCE, 2023, 16 (05) :1653-1666
[10]   A hybrid feature selection model based on butterfly optimization algorithm: COVID-19 as a case study [J].
EL-Hasnony, Ibrahim M. ;
Elhoseny, Mohamed ;
Tarek, Zahraa .
EXPERT SYSTEMS, 2022, 39 (03)