Incremental neighborhood entropy-based feature selection for mixed-type data under the variation of feature set

被引:11
作者
Shu, Wenhao [1 ]
Qian, Wenbin [2 ]
Xie, Yonghong [3 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Jiangxi, Peoples R China
[2] Jiangxi Agr Univ, Sch Software, Nanchang 330045, Jiangxi, Peoples R China
[3] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Incremental algorithm; Dynamic mixed-type data; Neighborhood rough sets; ATTRIBUTE REDUCTION; ROUGH SET; UPDATING APPROXIMATIONS; DISCERNIBILITY MATRIX; DECISION SYSTEMS; GRANULARITY; ACCELERATOR; ALGORITHM; MODEL;
D O I
10.1007/s10489-021-02526-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is to find relevant features and delete redundant features, which provides a basis for classification problems. In many real-world applications, mixed-type data including missing, numerical, and categorical features are ubiquitous in medical treatment, intrusion detection, traffic analysis and so on. Feature selection from mixed-type data has attracted considerable research attention. The neighborhood rough set model has attracted much attention to select a feature subset when handling with mixed-type data. In this study, we focus on the feature selection process for mixed-type data under the variation of feature set by the utilization of neighborhood rough sets. At first, the hybrid relation is given to define the similarity between objects for the mixed-type data without resorting to the discretization process. On this basis, the neighborhood entropy is given to evaluate the uncertainty of the mixed-type data. When new features may appear while old features are deleted, the updated neighborhood entropy is computed incrementally to reflect the significance of mixed-type features, which is an important step in the dynamic feature selection process. Finally, an efficient incremental feature selection algorithm for selecting a new feature subset is developed when deleting and adding a feature set simultaneously. Experimental results over different real-life data sets have verified the feasibility and efficiency of the proposed algorithm from the perspective of the runtime.
引用
收藏
页码:4792 / 4806
页数:15
相关论文
共 58 条
  • [1] Incremental approaches to updating reducts under dynamic covering granularity
    Cai, Mingjie
    Lang, Guangming
    Fujita, Hamido
    Li, Zhenyu
    Yang, Tian
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 172 : 130 - 140
  • [2] Feature selection for imbalanced data based on neighborhood rough sets
    Chen, Hongmei
    Li, Tianrui
    Fan, Xin
    Luo, Chuan
    [J]. INFORMATION SCIENCES, 2019, 483 : 1 - 20
  • [3] Parallel attribute reduction in dominance-based neighborhood rough set
    Chen, Hongmei
    Li, Tianrui
    Cai, Yong
    Luo, Chuan
    Fujita, Hamido
    [J]. INFORMATION SCIENCES, 2016, 373 : 351 - 368
  • [4] An entropy-based uncertainty measurement approach in neighborhood systems
    Chen, Yumin
    Wu, Keshou
    Chen, Xuhui
    Tang, Chaohui
    Zhu, Qingxin
    [J]. INFORMATION SCIENCES, 2014, 279 : 239 - 250
  • [5] Fuzzy time-series model based on rough set rule induction for forecasting stock price
    Cheng, Ching-Hsue
    Yang, Jun-He
    [J]. NEUROCOMPUTING, 2018, 302 : 33 - 45
  • [6] Maximal-Discernibility-Pair-Based Approach to Attribute Reduction in Fuzzy Rough Sets
    Dai, Jianhua
    Hu, Hu
    Wu, Wei-Zhi
    Qian, Yuhua
    Huang, Debiao
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (04) : 2174 - 2187
  • [7] An Uncertainty Measure for Incomplete Decision Tables and Its Applications
    Dai, Jianhua
    Wang, Wentao
    Xu, Qing
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (04) : 1277 - 1289
  • [8] Uncertainty measurement for interval-valued decision systems based on extended conditional entropy
    Dai, Jianhua
    Wang, Wentao
    Xu, Qing
    Tian, Haowei
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 27 : 443 - 450
  • [9] Ensemble feature selection using bi-objective genetic algorithm
    Das, Asit K.
    Das, Sunanda
    Ghosh, Arka
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 123 : 116 - 127
  • [10] Consistency-based search in feature selection
    Dash, M
    Liu, HA
    [J]. ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) : 155 - 176