Feature selection based on maximal neighborhood discernibility

被引:1
作者
Changzhong Wang
Qiang He
Mingwen Shao
Qinghua Hu
机构
[1] Bohai university,Department of Mathematics
[2] Beijing University of Civil Engineering and Architecture,College of Science
[3] Chinese University of Petroleum,College of Computer and Communication Engineering
[4] Tianjin University,School of Computer Science and Technology
来源
International Journal of Machine Learning and Cybernetics | 2018年 / 9卷
关键词
Feature selection; Neighborhood; Rough sets; Discernibility matrix;
D O I
暂无
中图分类号
学科分类号
摘要
Neighborhood rough set has been proven to be an effective tool for feature selection. In this model, the positive region of decision is used to evaluate the classification ability of a subset of candidate features. It is computed by just considering consistent samples. However, the classification ability is not only related to consistent samples, but also to the ability to discriminate samples with different decisions. Hence, the dependency function, constructed by the positive region, cannot reflect the actual classification ability of a feature subset. In this paper, we propose a new feature evaluation function for feature selection by using discernibility matrix. We first introduce the concept of neighborhood discernibility matrix to characterize the classification ability of a feature subset. We then present the relationship between distance matrix and discernibility matrix, and construct a feature evaluation function based on discernibility matrix. It is used to measure the significance of a candidate feature. The proposed model not only maintains the maximal dependency function, but also can select features with the greatest discernibility ability. The experimental results show that the proposed method can be used to deal with heterogeneous data sets. It is able to find effective feature subsets in comparison with some existing algorithms.
引用
收藏
页码:1929 / 1940
页数:11
相关论文
共 171 条
  • [1] Guyon I(2003)An introduction to variable and feature selection J Mach Learn Res 3 1157-1182
  • [2] Elisseeff A(2002)Input feature selection by mutual information based on Parzen window IEEE Trans Pattern Anal Mach Intell 24 1667-1671
  • [3] Kwak N(2010)A distance measure approach to exploring the rough set boundary region for attribute reduction IEEE Trans Knowl Data Eng 22 305-317
  • [4] Choi C-H(2015)Fuzzy granular structure distance IEEE Trans Fuzzy Syst 23 2245-2259
  • [5] Parthalain N(2013)Entropy measures and granularity measures for set-valued information systems Inf Sci 240 72-82
  • [6] Shen Q(1998)Uncertainty measures of rough set prediction Artif Intell 106 109-137
  • [7] ldquo RJ(2006)Fuzzy probabilistic approximation spaces and their information measures IEEE Trans Fuzzy Syst 14 191-201
  • [8] Qian Y(2014)A group incremental approach to feature selection applying rough set technique IEEE Trans Knowl Data Eng 26 294-304
  • [9] Li Y(2017)A novel approach to information fusion in multi-source datasets: a granular computing viewpoint Inf Sci 378 410-423
  • [10] Liang J(2013)Consistency-based search in feature selection Artif Intell 151 155-176