A novel filter feature selection algorithm based on relief

被引:36
作者
Cui, Xueting [1 ,2 ]
Li, Ying [1 ,2 ]
Fan, Jiahao [1 ,2 ]
Wang, Tan [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China
[3] Space Technol Jilin Ltd Co, Jilin, Jilin, Peoples R China
关键词
Relief; ReliefF; Neighbor search; Feature selection; Classification; NEURAL-NETWORK; OPTIMIZATION; INFORMATION; MACHINE;
D O I
10.1007/s10489-021-02659-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Relief algorithm is a feature selection algorithm that uses the nearest neighbor to weight attributes. However, Relief only considers the correlation between features, which leads to a low classification accuracy on noisy datasets whose interaction effect is weak. To overcome the weaknesses of Relief, a novel feature selection algorithm, named Multidirectional Relief (MRelief), is proposed. The MRelief algorithm includes four improvements. First, the multidirectional neighbor search method, which finds all neighbors within a distance threshold from different orientations, is included to obtain regularly distributed neighbors. Therefore, the weights provided by MRelief are more accurate than those provided by Relief. Second, a novel objective function that incorporates the instances' force coefficients is introduced to reduce the influence of noise. Thus, the new objective function improves the classification accuracy of MRelief. Third, subset generation is introduced to the MRelief algorithm and combined with the maximum Pearson maximum distance (MPMD) to generate a promising candidate subset for feature selection. Finally, a novel multiclass margin definition is proposed and introduced to the MRelief algorithm to handle multiclass data. As demonstrated by extensive experiments on eleven UCI datasets and eleven real-world gene expression benchmarking datasets, MRelief is significantly better than other algorithms including LPLIR, ReliefF, LLH-Relief, MultiSURF, MSLIR-NN, MRMR, MPMD and STIR in our study.
引用
收藏
页码:5063 / 5081
页数:19
相关论文
共 42 条
  • [1] A novel Whale Optimization Algorithm integrated with Nelder-Mead simplex for multi-objective optimization problems
    Abdel-Basset, Mohamed
    Mohamed, Reda
    Mirjalili, Seyedali
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [2] Fostering interpretability of data mining models through data perturbation
    Belkoura, Seddik
    Zanin, Massimiliano
    LaTorre, Antonio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 137 : 191 - 201
  • [3] TRAINING A 3-NODE NEURAL NETWORK IS NP-COMPLETE
    BLUM, AL
    RIVEST, RL
    [J]. NEURAL NETWORKS, 1992, 5 (01) : 117 - 127
  • [4] A whale optimization algorithm with chaos mechanism based on quasi-opposition for global optimization problems
    Chen, Hui
    Li, Weide
    Yang, Xuan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158
  • [5] A novel bacterial foraging optimization algorithm for feature selection
    Chen, Yu-Peng
    Li, Ying
    Wang, Gang
    Zheng, Yue-Feng
    Xu, Qian
    Fan, Jia-Hao
    Cui, Xue-Ting
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 83 : 1 - 17
  • [6] Rapid building detection using machine learning
    Cohen, Joseph Paul
    Ding, Wei
    Kuhlman, Caitlin
    Chen, Aijun
    Di, Liping
    [J]. APPLIED INTELLIGENCE, 2016, 45 (02) : 443 - 457
  • [7] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [8] Learning to construct knowledge bases from the World Wide Web
    Craven, M
    DiPasquo, D
    Freitag, D
    McCallum, A
    Mitchell, T
    Nigam, K
    Slattery, S
    [J]. ARTIFICIAL INTELLIGENCE, 2000, 118 (1-2) : 69 - 113
  • [9] A Hybrid Improved Dragonfly Algorithm for Feature Selection
    Cui, Xueting
    Li, Ying
    Fan, Jiahao
    Wang, Tan
    Zheng, Yuefeng
    [J]. IEEE ACCESS, 2020, 8 : 155619 - 155629
  • [10] Demsar J, 2006, J MACH LEARN RES, V7, P1