Numerical attribute reduction based on neighborhood granulation and rough approximation

被引:98
|
作者
College of Energy Science and Engineering, Harbin Institute of Technology, Harbin 150001, China [1 ]
机构
[1] College of Energy Science and Engineering, Harbin Institute of Technology
来源
Ruan Jian Xue Bao | 2008年 / 3卷 / 640-649期
关键词
Attribute reduction; Feature selection; Granular computing; Neighborhood relation; Numerical feature; Rough set; Variable precision;
D O I
10.3724/SP.J.1001.2008.00640
中图分类号
学科分类号
摘要
To deal with numerical features, a neighborhood rough set model is proposed based on the definitions of δ neighborhood and neighborhood relations in metric spaces. Each object in the universe is assigned with a neighborhood subset, called neighborhood granule. The family of neighborhood granules forms a concept system to approximate an arbitrary subset in the universe with two unions of neighborhood granules: lower approximation and upper approximation. Thereby, the concepts of neighborhood information systems and neighborhood decision tables are introduced. The properties of the model are discussed. Furthermore, the dependency function is used to evaluate the significance of numerical attributes and a forward greedy numerical attribute reduction algorithm is constructed. Experimental results with UCI data sets show that the neighborhood model can select a few attributes but keep, even improve classification power.
引用
收藏
页码:640 / 649
页数:9
相关论文
共 24 条
  • [1] Pawlak Z., Rough Sets-Theoretical Aspects of Reasoning about Data, (1991)
  • [2] Wang J., Wang R., Miao D.Q., Data enriching based on rough set theory, Chinese Journal of Computers, 21, 5, pp. 393-400, (1998)
  • [3] Chang L.Y., Wang G.Y., Wu Y., An approach for attribute reduction and rule generation based on rough set theory, Journal of Software, 10, 11, pp. 1207-1211, (1999)
  • [4] Shi Y., Sun Y.F., Zuo C., Spatial data classification based on rough set, Journal of Software, 11, 5, pp. 673-678, (2000)
  • [5] Yu D.R., Hu Q.H., Bao W., Combining rough set methodology and fuzzy clustering for knowledge discovery from quantitative data, Proc. of the Chinese Society for Electrical Engineering, 24, 6, pp. 205-210, (2004)
  • [6] Zhu Y.L., Wu L.Z., Li X.Y., Synthesized diagnosis on transformer faults based on Bayesian classifier and rough set, Proc. of the Chinese Society for Electrical Engineering, 25, 10, pp. 159-165, (2005)
  • [7] Wang Y.Q., Lu F.C., Li H.M., Synthetic fault diagnosis method of power transformer based on rough set theory and Bayesian network, Proc. of the Chinese Society for Electrical Engineering, 26, 8, pp. 137-141, (2006)
  • [8] Sun Q.Y., Zhang H.G., Fault diagnose algorithm of distribution system by continuous signals based on rough sets, Proc. of the Chinese Society for Electrical Engineering, 26, 11, pp. 156-161, (2006)
  • [9] Xie H., Cheng H.Z., Niu D.X., Discretization of continuous attributes in rough set theory based on information entropy, Chinese Journal of Computers, 28, 9, pp. 1570-1574, (2005)
  • [10] Jensen R., Shen Q., Semantics-Preserving dimensionality reduction: Rough and fuzzy-rough-based approaches, IEEE Trans. on Knowledge and Data Engineering, 16, 12, pp. 1457-1471, (2004)