A rough set approach to feature selection based on ant colony optimization

被引:191
作者
Chen, Yumin [1 ]
Miao, Duoqian [1 ]
Wang, Ruizhi [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Rough sets; Ant colony optimization; Feature selection; Mutual information; Data mining; DIMENSIONALITY REDUCTION; ALGORITHMS;
D O I
10.1016/j.patrec.2009.10.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory is one of the effective methods to feature selection, which can preserve the meaning of the features. The essence of rough set approach to feature selection is to find a subset of the original features. Since finding a minimal subset of the features is a NP-hard problem, it is necessary to investigate effective and efficient heuristic algorithms. Ant colony optimization (ACO) has been successfully applied to many difficult combinatorial problems like quadratic assignment, traveling salesman, scheduling, etc. It is particularly attractive for feature selection since there is no heuristic information that can guide search to the optimal minimal subset every time. However, ants can discover the best feature combinations as they traverse the graph. In this paper, we propose a new rough set approach to feature selection based on ACO, which adopts mutual information based feature significance as heuristic information. A novel feature selection algorithm is also given. Jensen and Shen proposed a ACO-based feature selection approach which starts from a random feature. Our approach starts from the feature core, which changes the complete graph to a smaller one. To verify the efficiency of our algorithm, experiments are carried out on some standard UCI datasets. The results demonstrate that our algorithm can provide efficient solution to find a minimal subset of the features. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:226 / 233
页数:8
相关论文
共 36 条
  • [1] [Anonymous], ROUGH SETS KNOWLEDGE
  • [2] [Anonymous], THESIS U EDINBURGH
  • [3] [Anonymous], 1999, Swarm Intelligence
  • [4] [Anonymous], 1998, Feature Extraction, Construction and Selection: A Data Mining Perspective
  • [5] Dash M., 1997, Intelligent Data Analysis, V1
  • [6] Consistency-based search in feature selection
    Dash, M
    Liu, HA
    [J]. ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) : 155 - 176
  • [7] Deogun J., 1998, Journal of ASIS, V49, P403
  • [8] Ant system: Optimization by a colony of cooperating agents
    Dorigo, M
    Maniezzo, V
    Colorni, A
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1996, 26 (01): : 29 - 41
  • [9] Dorigo M., 1999, P C EV COMP
  • [10] Duan Q., 2007, J COMPUTATIONAL INFO, V3, P1067