An adaptive memetic algorithm for feature selection using proximity graphs

被引:10
作者
Abu Zaher, Amer [1 ]
Berretta, Regina [1 ]
Noman, Nasimul [1 ]
Moscato, Pablo [1 ]
机构
[1] Univ Newcastle, Sch Elect Engn & Comp, Univ Dr, Callaghan, NSW 2300, Australia
基金
澳大利亚研究理事会;
关键词
evolutionary algorithm; feature selection; memetic algorithm; minimum spanning tree; proximity graph; CLASSIFICATION; PREDICTION; SEARCH; TUMOR;
D O I
10.1111/coin.12196
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a multivariate feature selection method that uses proximity graphs for assessing the quality of feature subsets. Initially, a complete graph is built, where nodes are the samples, and edge weights are calculated considering only the selected features. Next, a proximity graph is constructed on the basis of these weights and different fitness functions, calculated over the proximity graph, to evaluate the quality of the selected feature set. We propose an iterative methodology on the basis of a memetic algorithm for exploring the space of possible feature subsets aimed at maximizing a quality score. We designed multiple local search strategies, and we used an adaptive strategy for automatic balancing between the global and local search components of the memetic algorithm. The computational experiments were carried out using four well-known data sets. We investigate the suitability of three different proximity graphs (minimum spanning tree, k-nearest neighbors, and relative neighborhood graph) for the proposed approach. The selected features have been evaluated using a total of 49 classification methods from an open-source data mining and machine learning package (WEKA). The computational results show that the proposed adaptive memetic algorithm can perform better than traditional genetic algorithms in finding more useful feature sets. Finally, we establish the competitiveness of our approach by comparing it with other well-known feature selection methods.
引用
收藏
页码:156 / 183
页数:28
相关论文
共 40 条
[1]   A New Feature Selection Technique for Load and Price Forecast of Electrical Power Systems [J].
Abedinia, Oveis ;
Amjady, Nima ;
Zareipour, Hamidreza .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2017, 32 (01) :62-74
[2]  
Abu Zaher A, 2015, P 13 AUSTR DAT MIN C
[3]  
Abu Zaher A, 2016, P APPL INF TECHN INN
[4]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[5]  
[Anonymous], P 20 INT JOINT C ART
[6]   An Information Theoretic Clustering Approach for Unveiling Authorship Affinities in Shakespearean Era Plays and Poems [J].
Arefin, Ahmed Shamsul ;
Vimieiro, Renato ;
Riveros, Carlos ;
Craig, Hugh ;
Moscato, Pablo .
PLOS ONE, 2014, 9 (10)
[7]  
Berretta Regina, 2008, V453, P363, DOI 10.1007/978-1-60327-429-6_19
[8]   PROXIMITY GRAPHS: E, δ, Δ, χ AND ω [J].
Bose, Prosenjit ;
Dujmovic, Vida ;
Hurtado, Ferran ;
Iacono, John ;
Langerman, Stefan ;
Meijer, Henk ;
Sacristan, Vera ;
Saumell, Maria ;
Wood, David R. .
INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2012, 22 (05) :439-469
[9]  
Carreira-Perpinan MiguelA., 2005, ADV NEURAL INFORM PR, V17, P225
[10]  
Cotta C, 2004, LECT NOTES COMPUT SC, V3005, P21