AIFSA: A New Approach for Feature Selection and Weighting

被引:0
作者
Fouad, Walid [1 ]
Badr, Amr [1 ]
Farag, Ibrahim [1 ]
机构
[1] Cairo Univ, Fac Comp & Informat, Dept Comp Sci, Cairo, Egypt
来源
INFORMATICS ENGINEERING AND INFORMATION SCIENCE, PT II | 2011年 / 252卷
关键词
Data Mining; Text Classification; Artificial Immune Systems; Clonal Selection; Wrapper Feature Selection; Feature Weighting; TEXT CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a typical search problem where each state in the search space represents a subset of features candidate for selection. Out of n features, 2n subsets can be constructed, hence, an exhaustive search of all subsets becomes infeasible when n is relatively large. Therefore. Feature selection is done by employing a heuristic search algorithm that tries to reach the optimal feature subset. Here, we propose a new wrapper feature selection and weighting algorithm called Artificial Immune Feature Selection Algorithm (AIFSA); the algorithm is based on the metaphors of the Clonal Selection Algorithm (CSA). AIFSA, by itself, is not a classification algorithm, rather it utilizes well-known classifiers to evaluate and promote candidate feature subset. Experiments were performed on textual datasets like WebKB and Syskill&Webert web page ratings. Experimental results showed AIFSA competitive performance over traditional well-known filter feature selection approaches as well as some wrapper approaches existing in literature.
引用
收藏
页码:596 / 609
页数:14
相关论文
共 22 条
[11]   Learning and revising user profiles: The identification of interesting Web sites [J].
Pazzani, M ;
Billsus, D .
MACHINE LEARNING, 1997, 27 (03) :313-331
[12]   Web Page Classification: Features and Algorithms [J].
Qi, Xiaoguang ;
Davison, Brian D. .
ACM COMPUTING SURVEYS, 2009, 41 (02)
[13]   LEARNING LOGICAL DEFINITIONS FROM RELATIONS [J].
QUINLAN, JR .
MACHINE LEARNING, 1990, 5 (03) :239-266
[14]  
Rajalakshmi R, 2011, COMM COM INF SC, V147, P323
[15]  
Schneider KM, 2005, LECT NOTES ARTIF INT, V3721, P252
[16]  
Singh SR, 2010, JMLR WORKSH CONF PRO, V10, P76
[17]  
Slattery S., 2000, PROC ICML00, P895
[18]  
Sun A., 2002, P 4 INT WORKSHOP WEB, P96, DOI DOI 10.1145/584931.584952
[19]  
Twycross J, 2003, ADV SOFT COMP, P33
[20]  
Weber Ingmar, 2009, P 18 INT C WORLD WID