A new ranking-based stability measure for feature selection algorithms

被引:1
|
作者
Rakesh, Deepak Kumar [1 ]
Anwit, Raj [1 ,2 ]
Jana, Prasanta K. [1 ]
机构
[1] Indian Inst Technol, Indian Sch Mines, Dept Comp Sci & Engn, Dhanbad 826004, India
[2] Bhagalpur Coll Engn, Dept Comp Sci & Engn, Bhagalpur 813210, India
关键词
Feature selection; Ensemble feature selection; Stability; High-dimensional datasets; Classifiers; ENSEMBLE; MICROARRAY; REPRODUCIBILITY; CLASSIFICATION;
D O I
10.1007/s00500-022-07767-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The stability of a feature selection (FS) algorithm is one of the most crucial issues when working with a machine learning model. Until now, various stability measures based on a subset of features have been proposed. However, they lack consideration for feature ranking which is equally important to judge the robustness of algorithms. This paper proposes a novel frequency-based stability measure called rank stability (RSt) that evaluates FS algorithms on both criteria, i.e., subsets of features and feature rankings. The proposed measure evaluates the variation of feature rankings generated by FS algorithms after making a small perturbation to the training set. We mathematically justify the proposed measure based on the earlier and newly defined desirable properties. Additionally, we explore various heterogeneous ensemble techniques and compare them with traditional FS algorithms on real-world datasets. We perform extensive experiments to demonstrate that the heterogeneous ensemble techniques perform better than traditional FS algorithms with respect to the proposed measure and other performance metrics.
引用
收藏
页码:5377 / 5396
页数:20
相关论文
共 50 条
  • [1] A new ranking-based stability measure for feature selection algorithms
    Deepak Kumar Rakesh
    Raj Anwit
    Prasanta K. Jana
    Soft Computing, 2023, 27 : 5377 - 5396
  • [2] Neighborhood Ranking-Based Feature Selection
    Ipkovich, Adam
    Abonyi, Janos
    IEEE ACCESS, 2024, 12 : 20152 - 20168
  • [3] A New Measure of Feature Selection Algorithms' Stability
    Novovicova, Jana
    Somol, Petr
    Pudil, Pavel
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 382 - +
  • [4] Statistical model for reproducibility in ranking-based feature selection
    Urkullu, Ari
    Perez, Aritz
    Calvo, Borja
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (02) : 379 - 410
  • [5] Statistical model for reproducibility in ranking-based feature selection
    Ari Urkullu
    Aritz Pérez
    Borja Calvo
    Knowledge and Information Systems, 2021, 63 : 379 - 410
  • [6] Automatically fast determining of feature number for ranking-based feature selection
    Wang, Z.
    Sun, M.
    Jiang, J.
    ELECTRONICS LETTERS, 2012, 48 (23) : 1462 - 1463
  • [7] Ranking-Based Feature Selection Method for Dynamic Belief Clustering
    Ben Hariz, Sarra
    Elouedi, Zied
    ADAPTIVE AND INTELLIGENT SYSTEMS, 2011, 6943 : 308 - 319
  • [8] Ranking-based Feature Selection for Anomaly Detection in Sensor Networks
    Li, Rui
    Zhao, Jizhong
    Liu, Kebin
    He, Yuan
    AD HOC & SENSOR WIRELESS NETWORKS, 2013, 19 (1-2) : 119 - 139
  • [9] A ranking-based feature selection approach for handwritten character recognition
    Cilia, Nicole Dalia
    De Stefano, Claudio
    Fontanella, Francesco
    di Freca, Alessandra Scotto
    PATTERN RECOGNITION LETTERS, 2019, 121 : 77 - 86
  • [10] Conformal Stability Measure for Feature Selection Algorithms
    Lopez-De-Castro, Marcos
    Garcia-Galindo, Alberto
    Armananzas, Ruben
    13TH SYMPOSIUM ON CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, 2024, 230 : 105 - 119