On the Stability of Feature Selection Algorithms

被引:0
|
作者
Nogueira, Sarah [1 ]
Sechidis, Konstantinos [1 ]
Brown, Gavin [1 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
stability; feature selection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature Selection is central to modern data science, from exploratory data analysis to predictive model-building. The "stability" of a feature selection algorithm refers to the robustness of its feature preferences, with respect to data sampling and to its stochastic nature. An algorithm is 'unstable' if a small change in data leads to large changes in the chosen feature subset. Whilst the idea is simple, quantifying this has proven more challenging we note numerous proposals in the literature, each with different motivation and justification. We present a rigorous statistical treatment for this issue. In particular, with this work we consolidate the literature and provide (1) a deeper understanding of existing work based on a small set of properties, and (2) a clearly justified statistical approach with several novel benefits. This approach serves to identify a stability measure obeying all desirable properties, and (for the first time in the literature) allowing confidence intervals and hypothesis tests on the stability, enabling rigorous experimental comparison of feature selection algorithms.
引用
收藏
页数:54
相关论文
共 50 条
  • [1] Stability of feature selection algorithms
    Kalousis, A
    Prados, J
    Hilario, M
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 218 - 225
  • [2] A New Measure of Feature Selection Algorithms' Stability
    Novovicova, Jana
    Somol, Petr
    Pudil, Pavel
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 382 - +
  • [3] Conformal Stability Measure for Feature Selection Algorithms
    Lopez-De-Castro, Marcos
    Garcia-Galindo, Alberto
    Armananzas, Ruben
    13TH SYMPOSIUM ON CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, 2024, 230 : 105 - 119
  • [4] A Study on Selection Stability Measures for Various Feature Selection Algorithms
    Chelvan, Mohana P.
    Perumal, K.
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH, 2016, : 121 - 124
  • [5] A Comparative Study of the Stability of Filter based Feature Selection Algorithms
    Sen, Rikta
    Mandal, Ashis Kumar
    Goswami, Saptarsi
    Chakraborty, Basabi
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 192 - 197
  • [6] Stability of feature selection algorithms: a study on high-dimensional spaces
    Kalousis, Alexandros
    Prados, Julien
    Hilario, Melanie
    KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (01) : 95 - 116
  • [7] Stability of feature selection algorithms: a study on high-dimensional spaces
    Alexandros Kalousis
    Julien Prados
    Melanie Hilario
    Knowledge and Information Systems, 2007, 12 : 95 - 116
  • [8] An information theoretic approach to quantify the stability of feature selection and ranking algorithms
    Alaiz-Rodriguez, Rocio
    Parnell, Andrew C.
    KNOWLEDGE-BASED SYSTEMS, 2020, 195 (195)
  • [9] A new ranking-based stability measure for feature selection algorithms
    Rakesh, Deepak Kumar
    Anwit, Raj
    Jana, Prasanta K.
    SOFT COMPUTING, 2023, 27 (09) : 5377 - 5396
  • [10] A new ranking-based stability measure for feature selection algorithms
    Deepak Kumar Rakesh
    Raj Anwit
    Prasanta K. Jana
    Soft Computing, 2023, 27 : 5377 - 5396