Framework for efficient feature selection in genetic algorithm based data mining

被引:78
作者
Sikora, Riyaz
Piramuthu, Selwyn
机构
[1] Univ Texas, Dept Informat Syst, Arlington, TX 76019 USA
[2] Univ Florida, Dept Informat & Decis Sci, Gainesville, FL 32611 USA
关键词
genetic algorithms; rule learning; knowledge discover; data mining; evolutionary algorithms;
D O I
10.1016/j.ejor.2006.02.040
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present the design of more effective and efficient genetic algorithm based data mining techniques that use the concepts of feature selection. Explicit feature selection is traditionally done as a wrapper approach where every candidate feature subset is evaluated by executing the data mining algorithm on that subset. In this article we present a GA for doing both the tasks of mining and feature selection simultaneously by evolving a binary code along side the chromosome structure used for evolving the rules. We then present a wrapper approach to feature selection based on Hausdorff distance measure. Results from applying the above techniques to a real world data mining problem show that combining both the feature selection methods provides the best performance in terms of prediction accuracy and computational efficiency. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:723 / 737
页数:15
相关论文
共 31 条
  • [1] AHMED A, 2001, P INT S SIGN PROC IT, P477
  • [2] Alt H., 1991, P 7 ANN S COMP GEOM, P186
  • [3] Angeline P.J., 1995, IEEE PROCEDEENG COMP, P152
  • [4] [Anonymous], MACH LEARN P
  • [5] BABCOCK C, 1994, COMPUTER WORLD 0906
  • [6] USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING
    BATTITI, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04): : 537 - 550
  • [7] BENITEZ JM, 2001, P IFSA WORLD C 20 NA, V2, P1003
  • [8] BRADLEY PS, 1998, INFORMS J COMPUTING, V10
  • [9] CHAMBLESS B, 2001, P INT JOINT C NEUR N, V2, P1443
  • [10] Feature selection in web applications by ROC inflections and powerset pruning
    Coetzee, FM
    Glover, E
    Lawrence, S
    Giles, CL
    [J]. 2001 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2001, : 5 - 14