An Exponential Monte-Carlo algorithm for feature selection problems

被引:23
作者
Abdullah, Salwani [1 ]
Sabar, Nasser R. [2 ]
Nazri, Mohd Zakree Ahmad [1 ]
Ayob, Masri [1 ]
机构
[1] Univ Kebangsaan Malaysia, CAIT, Data Min & Optimizat Res Grp DMO, Ukm Bangi 43600, Selangor, Malaysia
[2] Univ Nottingham, Semenyih 43500, Selangor, Malaysia
关键词
Feature selection; Exponential Monte-Carlo; Local search; GREAT DELUGE ALGORITHM; ATTRIBUTE REDUCTION; ROUGH;
D O I
10.1016/j.cie.2013.10.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Feature selection problems (FS) can be defined as the process of eliminating redundant features while avoiding information loss. Due to that fact that FS is an NP-hard problem, heuristic and meta-heuristic approaches have been widely used by researchers. In this work, we proposed an Exponential Monte-Carlo algorithm (EMC-FS) for the feature selection problem. EMC-FS is a meta-heuristic approach which is quite similar to a simulated annealing algorithm. The difference is that no cooling schedule is required. Improved solutions are accepted and worse solutions are adaptively accepted based on the quality of the trial solution, the search time and the number of consecutive non-improving iterations. We have evaluated our approach against the latest methodologies in the literature on standard benchmark problems. The quality of the obtained subset of features has also been evaluated in terms of the number of generated rules (descriptive patterns) and classification accuracy. Our research demonstrates that our approach produces some of the best known results. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:160 / 167
页数:8
相关论文
共 29 条
[1]  
Abdullah S., 2010, Proceedings 10th International Conference on Intelligent Systems Design and Applications (ISDA 2010), P1032, DOI 10.1109/ISDA.2010.5687052
[2]  
Abdullah S., 2005, Proceedings of MISTA 2005: The 2nd Multidisciplinary Conference on Scheduling: Theory and Applications . 18-21 July, P413
[3]  
Abdullah S, 2010, COMM COM INF SC, V118, P189
[4]  
Abdullah S, 2007, OPER RES COMPUT SCI, V39, P153
[5]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[6]  
Arajy Yahya Z., 2010, Proceedings 10th International Conference on Intelligent Systems Design and Applications (ISDA 2010), P1015, DOI 10.1109/ISDA.2010.5687053
[7]  
Ayob M., 2003, P INT C INT TECHN IN, P17
[8]  
Blake C. L., 1998, Uci repository of machine learning databases
[9]   Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods [J].
Borra, Simone ;
Di Ciaccio, Agostino .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) :2976-2989
[10]   Tabu search for attribute reduction in rough set theory [J].
Hedar, Abdel-Rahman ;
Wang, Jue ;
Fukushima, Masao .
SOFT COMPUTING, 2008, 12 (09) :909-918