Nature-Inspired Optimization Algorithms for Text Document Clustering-A Comprehensive Analysis

被引:58
作者
Abualigah, Laith [1 ]
Gandomi, Amir H. [2 ]
Elaziz, Mohamed Abd [3 ]
Hussien, Abdelazim G. [4 ]
Khasawneh, Ahmad M. [1 ]
Alshinwan, Mohammad [1 ]
Houssein, Essam H. [5 ]
机构
[1] Amman Arab Univ, Fac Comp Sci & Informat, Amman 11953, Jordan
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ultimo, NSW 2007, Australia
[3] Zagazig Univ, Dept Math, Fac Sci, Zagazig 44519, Egypt
[4] Fayoum Univ, Fac Sci, Faiyum 63514, Egypt
[5] Minia Univ, Fac Comp & Informat, Al Minya 61519, Egypt
关键词
nature-inspired; optimization algorithms; machine learning; optimization problems; text clustering applications; ANT COLONY OPTIMIZATION; ARTIFICIAL NEURAL-NETWORKS; FEATURE-SELECTION METHOD; KRILL HERD ALGORITHM; DIMENSION REDUCTION; FEATURE-EXTRACTION; MULTIVARIATE DATA; SEARCH ALGORITHM; BEE COLONY; CLASSIFICATION;
D O I
10.3390/a13120345
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text clustering is one of the efficient unsupervised learning techniques used to partition a huge number of text documents into a subset of clusters. In which, each cluster contains similar documents and the clusters contain dissimilar text documents. Nature-inspired optimization algorithms have been successfully used to solve various optimization problems, including text document clustering problems. In this paper, a comprehensive review is presented to show the most related nature-inspired algorithms that have been used in solving the text clustering problem. Moreover, comprehensive experiments are conducted and analyzed to show the performance of the common well-know nature-inspired optimization algorithms in solving the text document clustering problems including Harmony Search (HS) Algorithm, Genetic Algorithm (GA), Particle Swarm Optimization (PSO) Algorithm, Ant Colony Optimization (ACO), Krill Herd Algorithm (KHA), Cuckoo Search (CS) Algorithm, Gray Wolf Optimizer (GWO), and Bat-inspired Algorithm (BA). Seven text benchmark datasets are used to validate the performance of the tested algorithms. The results showed that the performance of the well-known nurture-inspired optimization algorithms almost the same with slight differences. For improvement purposes, new modified versions of the tested algorithms can be proposed and tested to tackle the text clustering problems.
引用
收藏
页数:32
相关论文
共 109 条
  • [1] Abualigah L., 2020, Swarm Intelligence for Cloud Computing, P127, DOI [10.1201/9780429020582, DOI 10.1201/9780429020582]
  • [2] Abualigah L.M., 2016, 2016 7 INT C COMPUTE, P12, DOI [DOI 10.1109/CSIT.2016.7549456, DOI 10.1109/CSIT.2016.7549464, 10.1109/CSIT.2016.7549456]
  • [3] Abualigah L.M., 2016, P 1 EAI INT C COMP S
  • [4] Abualigah L.M., 2016, P 2016 IEEE 7 INT C
  • [5] The Arithmetic Optimization Algorithm
    Abualigah, Laith
    Diabat, Ali
    Mirjalili, Seyedali
    Elaziz, Mohamed Abd
    Gandomi, Amir H.
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 376
  • [6] Lightning search algorithm: a comprehensive survey
    Abualigah, Laith
    Abd Elaziz, Mohamed
    Hussien, Abdelazim G.
    Alsalibi, Bisan
    Jalali, Seyed Mohammad Jafar
    Gandomi, Amir H.
    [J]. APPLIED INTELLIGENCE, 2021, 51 (04) : 2353 - 2376
  • [7] A Comprehensive Survey of the Harmony Search Algorithm in Clustering Applications
    Abualigah, Laith
    Diabat, Ali
    Geem, Zong Woo
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [8] Ant Lion Optimizer: A Comprehensive Survey of Its Variants and Applications
    Abualigah, Laith
    Shehab, Mohammad
    Alshinwan, Mohammad
    Mirjalili, Seyedali
    Abd Elaziz, Mohamed
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2021, 28 (03) : 1397 - 1416
  • [9] Multi-verse optimizer algorithm: a comprehensive survey of its results, variants, and applications
    Abualigah, Laith
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16) : 12381 - 12401
  • [10] Salp swarm algorithm: a comprehensive survey
    Abualigah, Laith
    Shehab, Mohammad
    Alshinwan, Mohammad
    Alabool, Hamzeh
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (15) : 11195 - 11215