Invariant optimal feature selection: A distance discriminant and feature ranking based solution

被引:71
|
作者
Liang, Jianning [1 ]
Yang, Su [1 ]
Winstanley, Adam [2 ]
机构
[1] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Dept Comp Sci & Engn, Shanghai 200433, Peoples R China
[2] Natl Univ Ireland, Natl Ctr Geocomputat, Dept Comp Sci, Maynooth, Kildare, Ireland
基金
中国国家自然科学基金;
关键词
optimal feature selection; distance discriminant; feature ranking;
D O I
10.1016/j.patcog.2007.10.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of feature selection is to find the optimal subset consisting of m features chosen from the total it features. One critical problem for many feature selection methods is that an exhaustive search strategy has to be applied to seek the best subset among all the possible ((n)(m)) feature subsets, which usually results in a considerably high computational complexity. The alternative suboptimal feature selection methods provide more practical solutions in terms of computational complexity but they cannot promise that the finally selected feature subset is globally optimal. We propose a new feature selection algorithm based on a distance discriminant (FSDD), which not only solves the problem of the high computational costs but also overcomes the drawbacks of the suboptimal methods. The proposed method is able to find the optimal feature subset without exhaustive search or Branch and Bound algorithm. The most difficult problem for optimal feature selection, the search problem, is converted into a feature ranking problem following rigorous theoretical proof such that the computational complexity can be greatly reduced. The proposed method is invariant to the linear transformation of data when a diagonal transformation matrix is applied. FSDD was compared with ReliefF and mrmrMID based on mutual information on 8 data sets. The experiment results show that FSDD outperforms the other two methods and is highly efficient. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1429 / 1439
页数:11
相关论文
共 50 条
  • [1] AN OPTIMAL FEATURE SUBSET SELECTION METHOD BASED ON DISTANCE DISCRIMINANT AND DISTRIBUTION OVERLAPPING
    Liang, Jianning
    Yang, Su
    Wang, Yuanyuan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2009, 23 (08) : 1577 - 1597
  • [2] Unsupervised Optimal Discriminant Vector Based Feature Selection Method
    Cao, Su-Qun
    Manton, Jonathan H.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [3] Discriminant Feature Extraction Based on Center Distance
    Yan, Hui
    Yang, Wankou
    Yang, Jian
    Yang, Jingyu
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1249 - 1252
  • [4] Feature ranking based consensus clustering for feature subset selection
    Rani, D. Sandhya
    Rani, T. Sobha
    Bhavani, S. Durga
    Krishna, G. Bala
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8154 - 8169
  • [5] Euclidean distance based feature ranking and subset selection for bearing fault diagnosis
    Patel, Sachin P.
    Upadhyay, S. H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 154
  • [6] Feature selection based on kernel discriminant analysis
    Ashihara, Masamichi
    Abe, Shigeo
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 282 - 291
  • [7] Kernel discriminant analysis based feature selection
    Ishii, Tsuneyoshi
    Ashihara, Masamichi
    Abe, Shigeo
    NEUROCOMPUTING, 2008, 71 (13-15) : 2544 - 2552
  • [8] Ranking a random feature for variable and feature selection
    Stoppiglia, Hervé
    Dreyfus, Gérard
    Dubois, Rémi
    Oussar, Yacine
    Journal of Machine Learning Research, 2003, 3 : 1399 - 1414
  • [9] UNSUPERVISED FEATURE RANKING AND SELECTION BASED ON AUTOENCODERS
    Sharifipour, Sasan
    Fayyazi, Hossein
    Sabokrou, Mohammad
    Adeli, Ehsan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3172 - 3176
  • [10] Optimal Feature Selection in High-Dimensional Discriminant Analysis
    Kolar, Mladen
    Liu, Han
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (02) : 1063 - 1083