A Rough Hypercuboid Approach for Feature Selection in Approximation Spaces

被引:56
作者
Maji, Pradipta [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, W Bengal, India
关键词
Pattern recognition; data mining; feature selection; rough sets; rough hypercuboid approach; REDUCTION; SETS;
D O I
10.1109/TKDE.2012.242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The selection of relevant and significant features is an important problem particularly for data sets with large number of features. In this regard, a new feature selection algorithm is presented based on a rough hypercuboid approach. It selects a set of features from a data set by maximizing the relevance, dependency, and significance of the selected features. By introducing the concept of the hypercuboid equivalence partition matrix, a novel representation of degree of dependency of sample categories on features is proposed to measure the relevance, dependency, and significance of features in approximation spaces. The equivalence partition matrix also offers an efficient way to calculate many more quantitative measures to describe the inexactness of approximate classification. Several quantitative indices are introduced based on the rough hypercuboid approach for evaluating the performance of the proposed method. The superiority of the proposed method over other feature selection methods, in terms of computational complexity and classification accuracy, is established extensively on various real-life data sets of different sizes and dimensions.
引用
收藏
页码:16 / 29
页数:14
相关论文
共 35 条
  • [1] [Anonymous], ROUGH SETS
  • [2] [Anonymous], 1992, Intelligent Decision Support. Handbook of Applications and Advances of the Rough Sets Theory, DOI DOI 10.1007/978-94-015-7975-9_21
  • [3] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
  • [4] Rough set-aided keyword reduction for text categorization
    Chouchoulas, A
    Shen, Q
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (09) : 843 - 873
  • [5] Dash M., 2000, Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications, PADKK'00, P110
  • [6] DEVIJVER PA, 1982, PATTERN RECOGNITION
  • [7] ROUGH FUZZY-SETS AND FUZZY ROUGH SETS
    DUBOIS, D
    PRADE, H
    [J]. INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) : 191 - 209
  • [8] Duda R.O., 1999, Pattern classification
  • [9] Gilad-Bachrach R., 2004, P 21 INT C MACH LEAR, P43, DOI DOI 10.1145/1015330.1015352
  • [10] Guyon I., 2003, J MACH LEARN RES, V3, P1157