A new feature selection algorithm based on fuzzy-pathfinder optimization

被引:0
作者
Zandvakili A. [1 ]
Mansouri N. [2 ]
Javidi M.M. [2 ]
机构
[1] Department of Computer Science, Shahid Bahonar University of Kerman, Kerman
[2] Faculty of Shahid, Bahonar University of Kerman, Kerman
关键词
Feature selection; Fuzzy logic; Meta-heuristic; Pathfinder optimization;
D O I
10.1007/s00521-024-10043-2
中图分类号
学科分类号
摘要
Data mining and machine learning require feature selection because features can dramatically improve model performance. In contrast, there are no polynomial solutions for selecting a subset feature. It is possible to achieve this by using meta-heuristic algorithms, specifically population-based algorithms that are able to provide a subset of features that is optimal and not exact. Meta-heuristic algorithms face challenges such as staying in local minima, easily falling into local optimum, weakly global searchability, premature convergence, and slow convergence speeds. However, recent research has limitations such as high complexity and weak initialization. In order to overcome these limitations, a three-stage model is proposed. In the first stage, the correlation of features and the correlation of features with class are considered during feature selection and used to create the initial population in the pathfinder optimization algorithm (PFA). PFA is a population-based algorithm and has some drawbacks, in the last iterations, the fluctuation rate (A) and vibration vector (ε) parameters converge to 0, and finding a new solution is impossible. As a second stage, a fuzzy inference system is designed to adjust these parameters adaptively and is called fuzzy-pathfinder optimization (FPO). In the third stage, FPO is used to select relevant features based on classification error, proportion of selected features, and redundancy. Finally, different algorithms such as simulated annealing (SA), differential evolutionary (DE), genetic algorithm (GA), particle swarm optimization (PSO), PFA, estimation of distribution algorithm (EDA), and symmetrical uncertainty criterion (SUC-PSO) are used for comparison. Based on the results, the proposed model is able to reach an average accuracy of 96% on average. Based on a comparison of the proposed algorithm with SA, DE, GA, PSO, PFA, EDA, and SUC-PSO, the objective function is improved by 17.3%, 5.6%, 3.0%, 4.5%, 5.0%, 0.5%, and 1.2%, respectively. The use of comprehensive objective functions, the adaptive adjustment of parameters, and the creation of a targeted initial population are key strengths of FPO. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:17585 / 17614
页数:29
相关论文
共 39 条
  • [1] Zhang P., Et al., MFSJMI: Multi-label feature selection considering join mutual information and interaction weight, Pattern Recogn, 138, (2023)
  • [2] Thakkar A., Lohiya R., A survey on intrusion detection system: feature selection, model, performance measures, application perspective, challenges, and future research directions, Artif Intell Rev, 55, pp. 453-563, (2022)
  • [3] Han F., Et al., Multi-objective particle swarm optimization with adaptive strategies for feature selection, Swarm Evol Comput, 62, (2021)
  • [4] Paul D., Et al., Multi-objective PSO based online feature selection for multi-label classification, Knowl-Based Syst, 222, (2021)
  • [5] Halim Z., Et al., An effective genetic algorithm-based feature selection method for intrusion detection systems, Comput Secur, 110, (2021)
  • [6] Bandyopadhyay R., Et al., Harris Hawks optimisation with Simulated Annealing as a deep feature selection method for screening of COVID-19 CT-scans, Appl Soft Comput, 111, (2021)
  • [7] Salesi S., Et al., TAGA: tabu asexual genetic algorithm embedded in a filter/filter feature selection approach for high-dimensional data, Inf Sci, 565, pp. 105-127, (2021)
  • [8] Dhal P., Azad C., A multi-objective feature selection method using newton’s law based pso with gwo, Appl Soft Comput, 107, (2021)
  • [9] Lee I.G., Et al., A mixed integer linear programming support vector machine for cost-effective group feature selection: branch-cut-and-price approach, Eur J Oper Res, 299, pp. 1055-1068, (2022)
  • [10] Sharifabad M.M., Et al., BRNS+ SSFSM-DTI: a hybrid method for drug-target interaction prediction based on balanced reliable negative samples and semi-supervised feature selection, Chemom Intell Lab Syst, 220, (2022)