TAGA: Tabu Asexual Genetic Algorithm embedded in a filter/filter feature selection approach for high-dimensional data

被引:34
作者
Salesi, Sadegh [1 ]
Cosma, Georgina [2 ]
Mavrovouniotis, Michalis [3 ]
机构
[1] Nottingham Trent Univ, Sch Sci & Technol, Dept Comp, Nottingham, England
[2] Loughborough Univ, Sch Sci, Dept Comp Sci, Loughborough, Leics, England
[3] Univ Cyprus, Dept Elect & Comp Engn, KIOS Res & Innovat Ctr Excellence, Nicosia, Cyprus
关键词
INFORMATION; RELEVANCE;
D O I
10.1016/j.ins.2021.01.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is the process of selecting an optimal subset of features required for maintaining or improving the performance of data mining models. Recently, hybrid filter/wrapper feature selection methods have shown promising results for high dimensional data. However, filter/wrapper methods lack of generalisation power, which enables the selected features to be trainable over different classifiers without having to repeat the feature selection process. To address the generalisation power problem, this paper proposes a novel evolutionary-based filter feature selection algorithm that is sequentially hybridised with the Fisher score filter algorithm in a new hybrid framework called filter/filter. The proposed algorithm is based on a long-term memory Tabu Search combined with an Asexual (i.e. mutation-based) Genetic Algorithm (TAGA). TAGA benefits from a new integer-encoded solution representation, a novel mutation operator, a new tabu list encoding scheme, and uses a minimum redundancy maximum relevance information theory-based criterion as the fitness function. Experiments were carried out on various high-dimensional datasets including image, text, and biological data. The goodness of the selected subsets was evaluated using different classifiers and the experimental results demonstrate that TAGA outperforms other conventional and state-of-the-art feature selection algorithms. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:105 / 127
页数:23
相关论文
共 47 条
[1]   An effective asexual genetic algorithm for solving the job shop scheduling problem [J].
Amirghasemi, Mehrdad ;
Zamani, Reza .
COMPUTERS & INDUSTRIAL ENGINEERING, 2015, 83 :123-138
[2]  
[Anonymous], 2011, Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
[3]  
Asuncion Arthur, 2007, Uci machine learning repository
[4]   A review of microarray datasets and applied feature selection methods [J].
Bolon-Canedo, V. ;
Sanchez-Marono, N. ;
Alonso-Betanzos, A. ;
Benitez, J. M. ;
Herrera, F. .
INFORMATION SCIENCES, 2014, 282 :111-135
[5]   Hybrid Framework Using Multiple-Filters and an Embedded Approach for an Efficient Selection and Classification of Microarray Data [J].
Bonilla-Huerta, Edmundo ;
Hernandez-Montiel, Alberto ;
Morales-Caporal, Roberto ;
Arjona-Lopez, Marco .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) :12-26
[6]  
Bonnlander B. V., 1994, 1994 International Symposium on Artificial Neural Networks. ISANN '94. Proceedings, P42
[7]   A simple algorithm for optimization and model fitting: AGA (asexual genetic algorithm) [J].
Canto, J. ;
Curiel, S. ;
Martinez-Gomez, E. .
ASTRONOMY & ASTROPHYSICS, 2009, 501 (03) :1259-1268
[8]   ITERATIVE FEATURE PERTURBATION AS A GENE SELECTOR FOR MICROARRAY DATA [J].
Canul-Reich, Juana ;
Hall, Lawrence O. ;
Goldgof, Dmitry B. ;
Korecki, John N. ;
Eschrich, Steven .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)
[9]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[10]  
Cui Y., 2015, Journal of Information Hiding and Multimedia Signal Processing, P154