A novel hybrid feature selection method considering feature interaction in neighborhood rough set

被引:73
作者
Wan, Jihong [1 ,2 ]
Chen, Hongmei [1 ,2 ]
Yuan, Zhong [1 ,2 ]
Li, Tianrui [1 ,2 ]
Yang, Xiaoling [1 ,2 ]
Sang, BinBin [1 ,2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Sichuan, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu 611756, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Neighborhood rough set; Interaction feature selection; Feature correlations; Multi-neighborhood calculation; Uncertainty measures; Hybrid data; MUTUAL INFORMATION; ATTRIBUTE REDUCTION; UNCERTAINTY MEASURES; MAX-RELEVANCE; ALGORITHM; ENTROPY; CLASSIFICATION; DEPENDENCY;
D O I
10.1016/j.knosys.2021.107167
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interaction between features can provide essential information that affects the performances of learning models. Nevertheless, most feature selection methods do not take interaction into account in feature correlations calculation. In this work, to solve the problem of dimensional reduction in hybrid data with uncertainty and noise, a novel feature selection method is proposed considering the characteristic of interaction in the neighborhood rough set. First of all, the multi-neighborhood radii set for hybrid data is obtained according to the distribution characteristics of features. Then, considering the ubiquity of interactive features, the feature correlations are redefined via employing various neighborhood information uncertainty measures. Furthermore, a new objective evaluation function of the interactive selection of hybrid features is developed, which is called the Max-Relevance min Redundancy Max-Interaction (MRmRMI). Finally, a novel interaction feature selection algorithm based on neighborhood conditional mutual information (NCMI_IFS) is designed. To evaluate the performance of the proposed algorithm, we compare it with other eight representative feature selection algorithms on twenty public datasets. Experimental results on four different classifiers show that the NCMI_IFS algorithm has higher classification performance and is significantly effective. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 55 条
  • [1] USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING
    BATTITI, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04): : 537 - 550
  • [2] Feature selection using Joint Mutual Information Maximisation
    Bennasar, Mohamed
    Hicks, Yulia
    Setchi, Rossitza
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8520 - 8532
  • [3] Comparative metabolomics of estrogen receptor positive and estrogen receptor negative breast cancer: alterations in glutamine and beta-alanine metabolism
    Budczies, Jan
    Brockmoeller, Scarlet F.
    Mueller, Berit M.
    Barupal, Dinesh K.
    Richter-Ehrenstein, Christiane
    Kleine-Tebbe, Anke
    Griffin, Julian L.
    Oresic, Matej
    Dietel, Manfred
    Denkert, Carsten
    Fiehn, Oliver
    [J]. JOURNAL OF PROTEOMICS, 2013, 94 : 279 - 288
  • [4] A novel filter feature selection method using rough set for short text data
    Cekik, Rasim
    Uysal, Alper Kursat
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [5] Parallel attribute reduction in dominance-based neighborhood rough set
    Chen, Hongmei
    Li, Tianrui
    Cai, Yong
    Luo, Chuan
    Fujita, Hamido
    [J]. INFORMATION SCIENCES, 2016, 373 : 351 - 368
  • [6] Measures of uncertainty for neighborhood rough sets
    Chen, Yumin
    Xue, Yu
    Ma, Ying
    Xu, Feifei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 120 : 226 - 235
  • [7] Gene selection for tumor classification using neighborhood rough sets and entropy measures
    Chen, Yumin
    Zhang, Zunjun
    Zheng, Jianzhong
    Ma, Ying
    Xue, Yu
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 67 : 59 - 68
  • [8] Feature selection with redundancy-complementariness dispersion
    Chen, Zhijun
    Wu, Chaozhong
    Zhang, Yishi
    Huang, Zhen
    Ran, Bin
    Zhong, Ming
    Lyu, Nengchao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 203 - 217
  • [9] Novel multi-label feature selection via label symmetric uncertainty correlation learning and feature redundancy evaluation
    Dai, Jianhua
    Chen, Jiaolong
    Liu, Ye
    Hu, Hu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 207
  • [10] Key energy-consumption feature selection of thermal power systems based on robust attribute reduction with rough sets
    Dong Lianjie
    Chen Degang
    Wang Ningling
    Lu Zhanhui
    [J]. INFORMATION SCIENCES, 2020, 532 : 61 - 71