Feature selection based on rough set approach, wrapper approach, and binary whale optimization algorithm

被引:77
作者
Tawhid, Mohamed A. [1 ]
Ibrahim, Abdelmonem M. [1 ,2 ]
机构
[1] Thompson Rivers Univ, Fac Sci, Dept Math, Stat, Kamloops, BC V2C 0C8, Canada
[2] Al Azhar Univ, Fac Sci, Dept Math, Assiut Branch, Assiut, Egypt
基金
加拿大自然科学与工程研究理事会;
关键词
Feature selection; Classification; Whale optimization algorithm; Rough set theory; Wrapper approach; Logistic regression; ATTRIBUTE REDUCTION; DIFFERENTIAL EVOLUTION; SEARCH; CLASSIFICATION; CANCER; SVM;
D O I
10.1007/s13042-019-00996-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The principle of any approach for solving feature selection problem is to find a subset of the original features. Since finding a minimal subset of the features is an NP-hard problem, it is necessary to develop and propose practical and efficient heuristic algorithms. The whale optimization algorithm is a recently developed nature-inspired meta-heuristic optimization algorithm that imitates the hunting behavior of humpback whales to solve continuous optimization problems. In this paper, we propose a novel binary whale optimization algorithm (BWOA) to solve feature selection problem. BWOA is especially desirable and appealing for feature selection problem whenever there is no heuristic information that can lead the search to the optimal minimal subset. Nonetheless, whales can find the best features as they hunt the prey. Rough set theory (RST) is one of the effective algorithms for feature selection. We use RST with BWOA as the first experiment, and in the second experiment, we use a wrapper approach with BWOA on three different classifiers for feature selection. Also, we verify the performance and the effectiveness of the proposed algorithm by performing our experiments using 32 datasets from the UCI machine learning repository and comparing the proposed algorithm with some powerful existing algorithms in the literature. Furthermore, we employ two nonparametric statistical tests, Wilcoxon Signed-Rank test, and Friedman test, at 5% significance level. Our results show that the proposed algorithm can provide an efficient tool to find a minimal subset of the features.
引用
收藏
页码:573 / 602
页数:30
相关论文
共 79 条
[61]   Rough set approach to knowledge-based decision support [J].
Pawlak, Z .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1997, 99 (01) :48-57
[62]  
Pawlak Z., 2012, ROUGH SETS THEORETIC
[63]   A wrapper approach for feature selection and Optimum-Path Forest based on Bat Algorithm [J].
Rodrigues, Douglas ;
Pereira, Luis A. M. ;
Nakamura, Rodrigo Y. M. ;
Costa, Kelton A. P. ;
Yang, Xin-She ;
Souza, Andre N. ;
Papa, Joao Paulo .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) :2250-2258
[64]   Image retrieval: Current techniques, promising directions, and open issues [J].
Rui, Y ;
Huang, TS ;
Chang, SF .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1999, 10 (01) :39-62
[65]   A review of feature selection techniques in bioinformatics [J].
Saeys, Yvan ;
Inza, Inaki ;
Larranaga, Pedro .
BIOINFORMATICS, 2007, 23 (19) :2507-2517
[66]  
Salzberg Steven L., 1994, Machine Learning, V16, P235, DOI DOI 10.1007/BF00993309
[67]   Rough-fuzzy functions in classification [J].
Sarkar, M .
FUZZY SETS AND SYSTEMS, 2002, 132 (03) :353-369
[68]   Fast branch & bound algorithms for optimal feature selection [J].
Somol, P ;
Pudil, P ;
Kittler, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (07) :900-912
[69]   Rough set methods in feature selection and recognition [J].
Swiniarski, RW ;
Skowron, A .
PATTERN RECOGNITION LETTERS, 2003, 24 (06) :833-849
[70]   An unsupervised feature selection algorithm based on ant colony optimization [J].
Tabakhi, Sina ;
Moradi, Parham ;
Akhlaghian, Fardin .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 32 :112-123