A parallel hybrid krill herd algorithm for feature selection

被引:30
作者
Abualigah, Laith [1 ]
Alsalibi, Bisan [2 ]
Shehab, Mohammad [3 ]
Alshinwan, Mohammad [1 ]
Khasawneh, Ahmad M. [1 ]
Alabool, Hamzeh [4 ]
机构
[1] Amman Arab Univ, Fac Comp Sci & Informat, Amman 11953, Jordan
[2] Univ Sains Malaysia, Sch Comp Sci, George Town, Malaysia
[3] Aqaba Univ Technol, Comp Sci Dept, Aqaba, Jordan
[4] Saudi Elect Univ, Coll Comp & Informat, Abha, Saudi Arabia
关键词
Feature selection; Document clustering; Parallel membrane computing; Krill herd algorithm; Local search; Optimization problem; WHALE OPTIMIZATION ALGORITHM; TEXT FEATURE-SELECTION; ARTIFICIAL BEE COLONY; DIMENSION REDUCTION; COMBINATION; STRATEGY;
D O I
10.1007/s13042-020-01202-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel feature selection method is introduced to tackle the problem of high-dimensional features in the text clustering application. Text clustering is a prevailing direction in big text mining; in this manner, documents are grouped into cohesive groups by using neatly selected informative features. Swarm-based optimization techniques have been widely used to select the relevant text features and shown promising results on multi-sized datasets. The performance of traditional optimization algorithms tends to fail miserably when using large-scale datasets. A novel parallel membrane-inspired framework is proposed to enhance the performance of the krill herd algorithm combined with the swap mutation strategy (MHKHA). In which the krill herd algorithm is hybridized the swap mutation strategy and incorporated within the parallel membrane framework. Finally, the k-means technique is employed based on the results of feature selection-based Krill Herd Algorithm to cluster the documents. Seven benchmark datasets of various characterizations are used. The results revealed that the proposed MHKHA produced superior results compared to other optimization methods. This paper presents an alternative method for the text mining community through cohesive and informative features.
引用
收藏
页码:783 / 806
页数:24
相关论文
共 46 条
[1]   RETRACTED: A hybrid whale optimization algorithm based on local search strategy for the permutation flow shop scheduling problem (Retracted article. See vol. 128, pg. 567, 2022) [J].
Abdel-Basset, Mohamed ;
Manogaran, Gunasekaran ;
El-Shahat, Doaa ;
Mirjalili, Seyedali .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 85 :129-145
[2]  
Abualigah L, 2018, INNOVATIVE COMPUTING, DOI [10.1007/978-3-319-66984-7_18, DOI 10.1007/978-3-319-66984-7_18]
[3]  
Abualigah L.M., 2016, 2016 7 INT C COMP SC, P1
[4]   A novel hybrid antlion optimization algorithm for multi-objective task scheduling problems in cloud computing environments [J].
Abualigah, Laith ;
Diabat, Ali .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (01) :205-223
[5]  
Abualigah L, 2020, NEURAL COMPUT APPL, V32, P12381, DOI [10.1007/s00521-020-04839-1, 10.1007/s00521-020-05107-y]
[6]  
Abualigah LM, 2019, EAI SPRINGER INNOVAT, P205, DOI 10.1007/978-3-319-96451-5_9
[7]   A combination of objective functions and hybrid Krill herd algorithm for text document clustering analysis [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 73 :111-125
[8]   A new feature selection method to improve the document clustering using particle swarm optimization algorithm [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 25 :456-466
[9]   Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin .
JOURNAL OF SUPERCOMPUTING, 2017, 73 (11) :4773-4795
[10]   Feature Selection with β-Hill climbing Search for Text Clustering Application [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Al-Betar, Mohammed Azmi ;
Alyasseri, Zaid Abdi Alkareem ;
Alomari, Osama Ahmad ;
Hanandeh, Essam Said .
2017 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT), 2017, :22-27