An efficient binary social spider algorithm for feature selection problem

被引:43
作者
Bas, Emine [1 ]
Ulker, Erkan [2 ]
机构
[1] Selcuk Univ, Kulu Vocat Sch, TR-42075 Konya, Turkey
[2] Konya Tech Univ, Fac Engn & Nat Sci, Dept Comp Engn, TR-42075 Konya, Turkey
关键词
Social spider algorithm; Feature selection; Classifiers; PARTICLE SWARM OPTIMIZATION; FEATURE SUBSET-SELECTION; GENETIC ALGORITHM; CLASSIFICATION; REDUCTION;
D O I
10.1016/j.eswa.2020.113185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The social spider algorithm (SSA) is a heuristic algorithm created on spider behaviors to solve continuous problems. In this paper, firstly a binary version of the social spider algorithm called binary social spider algorithm (BinSSA) is proposed. Currently, there is insufficient focus on the binary version of SSA in the literature. The main part of the binary version is the transfer function. The transfer function is responsible for mapping continuous search space to binary search space. In this study, eight of the transfer functions divided into two families, S-shaped and V-shaped, are evaluated. BinSSA is obtained from SSA, by transforming constant search space to binary search space with eight different transfer functions (S-Shapes and V-Shaped). Thus, eight different variations of BinSSA are formed as BinSSA1, BinSSA2, BinSSA3, BinSSA4, BinSSA5, BinSSA6, BinSSA7, and BinSSA8. For increasing, exploration and exploitation capacity of BinSSA, a crossover operator is added as BinSSA-CR. In secondly, the performances of BinSSA variations are tested on feature selection task. The optimal subset of features is a challenging problem in the process of feature selection. In this paper, according to different comparison criteria (mean of fitness values, the standard deviation of fitness values, the best of fitness values, the worst of fitness values, accuracy values, the mean number of the selected features, CPU time), the best BinSSA variation is detected. In the feature selection problem, the K-nearest neighbor (K-NN) and support vector machines (SVM) are used as classifiers. A detailed study is performed for the fixed parameter values used in the fitness function. BinSSA is evaluated on low-scaled, middle-scaled and large-scaled twenty-one well-known UCI datasets and obtained results are compared with state-of-art algorithms in the literature. Obtained results have shown that BinSSA and BinSSA-CR show superior performance and offer quality and stable solutions. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:25
相关论文
共 69 条
[1]  
Acilar A.M., 2013, THESIS
[2]  
[Anonymous], 2019, Uci machine learning repository: Data sets
[3]   A comparison of feature selection models utilizing binary particle swarm optimization and genetic algorithm in determining coronary artery disease using support vector machine [J].
Babaoglu, Ismail ;
Findik, Oguz ;
Ulker, Erkan .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) :3177-3183
[4]   A new optimization algorithm for solving wind turbine placement problem: Binary artificial algae algorithm [J].
Beskirli, Mehmet ;
Koc, Ismail ;
Hakli, Huseyin ;
Kodaz, Halife .
RENEWABLE ENERGY, 2018, 121 :301-308
[5]   Improved binary PSO for feature selection using gene expression data [J].
Chuang, Li-Yeh ;
Chang, Hsueh-Wei ;
Tu, Chung-Jui ;
Yang, Cheng-Hong .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (01) :29-38
[6]   A swarm optimization algorithm inspired in the behavior of the social-spider [J].
Cuevas, Erik ;
Cienfuegos, Miguel ;
Zaldivar, Daniel ;
Perez-Cisneros, Marco .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) :6374-6384
[7]  
Dash M., 1997, Intelligent Data Analysis, V1
[8]  
DORIGO M, 1990, THESIS
[9]   Social spider algorithm for solving the transmission expansion planning problem [J].
El-bages, M. S. ;
Elsayed, W. T. .
ELECTRIC POWER SYSTEMS RESEARCH, 2017, 143 :235-243
[10]   Modified social spider algorithm for solving the economic dispatch problem [J].
Elsayed, W. T. ;
Hegazy, Y. G. ;
Bendary, F. M. ;
El-bages, M. S. .
ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (04) :1672-1681