A novel hybrid feature selection method considering feature interaction in neighborhood rough set

被引:73
作者
Wan, Jihong [1 ,2 ]
Chen, Hongmei [1 ,2 ]
Yuan, Zhong [1 ,2 ]
Li, Tianrui [1 ,2 ]
Yang, Xiaoling [1 ,2 ]
Sang, BinBin [1 ,2 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Sichuan, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu 611756, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Neighborhood rough set; Interaction feature selection; Feature correlations; Multi-neighborhood calculation; Uncertainty measures; Hybrid data; MUTUAL INFORMATION; ATTRIBUTE REDUCTION; UNCERTAINTY MEASURES; MAX-RELEVANCE; ALGORITHM; ENTROPY; CLASSIFICATION; DEPENDENCY;
D O I
10.1016/j.knosys.2021.107167
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interaction between features can provide essential information that affects the performances of learning models. Nevertheless, most feature selection methods do not take interaction into account in feature correlations calculation. In this work, to solve the problem of dimensional reduction in hybrid data with uncertainty and noise, a novel feature selection method is proposed considering the characteristic of interaction in the neighborhood rough set. First of all, the multi-neighborhood radii set for hybrid data is obtained according to the distribution characteristics of features. Then, considering the ubiquity of interactive features, the feature correlations are redefined via employing various neighborhood information uncertainty measures. Furthermore, a new objective evaluation function of the interactive selection of hybrid features is developed, which is called the Max-Relevance min Redundancy Max-Interaction (MRmRMI). Finally, a novel interaction feature selection algorithm based on neighborhood conditional mutual information (NCMI_IFS) is designed. To evaluate the performance of the proposed algorithm, we compare it with other eight representative feature selection algorithms on twenty public datasets. Experimental results on four different classifiers show that the NCMI_IFS algorithm has higher classification performance and is significantly effective. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 55 条
  • [21] Multi-label feature selection based on neighborhood mutual information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Chen, Jinkun
    Duan, Jie
    [J]. APPLIED SOFT COMPUTING, 2016, 38 : 244 - 256
  • [22] Online multi-label streaming feature selection based on neighborhood rough set
    Liu, Jinghua
    Lin, Yaojin
    Li, Yuwen
    Weng, Wei
    Wu, Shunxiang
    [J]. PATTERN RECOGNITION, 2018, 84 : 273 - 287
  • [23] Neighborhood mutual information and its application on hyperspectral band selection for classification
    Liu, Yao
    Xie, Hong
    Chen, Yuehua
    Tan, Kezhu
    Wang, Liguo
    Xie, Wu
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 157 : 140 - 151
  • [24] Quick attribute reduct algorithm for neighborhood rough set model
    Liu Yong
    Huang Wenliang
    Jiang Yunliang
    Zeng Zhiyong
    [J]. INFORMATION SCIENCES, 2014, 271 : 65 - 81
  • [25] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    [J]. NEUROCOMPUTING, 2017, 256 : 56 - 62
  • [26] High-dimensional hybrid feature selection using interaction information-guided search
    Nakariyakul, Songyot
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 145 : 59 - 66
  • [27] Semi-supervised neighborhood discrimination index for feature selection
    Pang, Qing-Qing
    Zhang, Li
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 204
  • [28] Theoretical evaluation of feature selection methods based on mutual information
    Pascoal, Claudia
    Oliveira, M. Rosario
    Pacheco, Antonio
    Valadas, Rui
    [J]. NEUROCOMPUTING, 2017, 226 : 168 - 181
  • [29] Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
    Peng, HC
    Long, FH
    Ding, C
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) : 1226 - 1238
  • [30] Incremental feature selection for dynamic hybrid data using neighborhood rough set
    Shu, Wenhao
    Qian, Wenbin
    Xie, Yonghong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 194