Mean based relief: An improved feature selection method based on ReliefF

被引:0
作者
Nitisha Aggarwal
Unmesh Shukla
Geetika Jain Saxena
Mukesh Rawat
Anil Singh Bafila
Sanjeev Singh
Amit Pundir
机构
[1] University of Delhi South Campus,Institute of Informatics and Communication
[2] Maharaja Agrasen College,Department of Electronics
来源
Applied Intelligence | 2023年 / 53卷
关键词
Feature selection; Filter method; Machine learning; ReliefF;
D O I
暂无
中图分类号
学科分类号
摘要
Selection of relevant features is vitally important in machine learning tasks involving large datasets with numerous features. It helps in reducing the dimensionality of a dataset and improving model performance. This study introduces a feature selection technique named μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu $$\end{document}-Relief, which is based on ReliefF, one of the most extensively used Relief-based algorithms. μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu $$\end{document}-Relief effectively determines the most relevant feature subset and significantly outperforms the ReliefF algorithm. ReliefF estimates feature quality considering only the nearest neighbors, resulting in low classification accuracy on non-uniformly distributed or noisy datasets. The proposed μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu $$\end{document}-Relief technique considers neighbors with more effective information on the basis of mean distance. It utilizes neighbors far from the mean distance to obtain feature weight estimates, which improves the algorithm’s performance. The algorithm was tested on thirteen real-world datasets and validated on three synthetic datasets. Its effectiveness in selecting relevant features was evaluated by comparing it to other well-known feature selection algorithms, namely Chi-Square, ANOVA, MI, CMIM, MRMR, SURF*, MultiSURF, MultiSURF*, and ReliefF. When evaluated using multiple classifiers trained on the features selected by different feature selection techniques, the metrics of classification accuracy, weighted F1-score, and ROC-AUC, showed that μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu $$\end{document}-Relief effectively determined relevant features and outperformed other techniques.
引用
收藏
页码:23004 / 23028
页数:24
相关论文
共 80 条
  • [1] Dhal P(2022)A comprehensive survey on feature selection in the various fields of machine learning Appl Intell 52 4543-4581
  • [2] Azad C(2022)Feature selection techniques in the context of big data: taxonomy and analysis Appl Intell 50 2749-2769
  • [3] Abdulwahab HM(2020)Distributed learning for supervised multiview feature selection Appl Intell 97 273-324
  • [4] Ajitha S(1997)Wrappers for feature subset selection Artif Intell 32 5951-5973
  • [5] Saif MAN(2020)Ensemble feature selection for high-dimensional data: A stability analysis across multiple domains Neural Comput & Applic 9 26766-26791
  • [6] Men M(2021)Metaheuristic Algorithms on Feature Selection: A survey of one decade of research (2009–2019) IEEE Access 245 464-204
  • [7] Zhong P(2022)A competitive mechanism based multi-objective differential evolution algorithm and its application in feature selection Knowl Based Syst 100 185-106
  • [8] Wang Z(2021)Review of swarm intelligence-based feature selection methods Eng Appl Artif Intel 10 94-188
  • [9] Lin Q(2022)A review of the modification strategies of the nature inspired algorithms for feature selection problem Mathematics 158 20-1365
  • [10] Kohavi R(2020)A whale optimization algorithm with chaos mechanism based on quasi-opposition for global optimization problems Expert Syst Appl 161 168-745