Automatic clustering using nature-inspired metaheuristics: A survey

被引:156
作者
Jose-Garcia, Adan [1 ]
Gomez-Flores, Wilfrido [1 ]
机构
[1] Natl Polytech Inst, Ctr Res & Adv Studies, Informat Technol Lab, Ciudad Victoria, Tamaulipas, Mexico
关键词
Cluster analysis; Automatic clustering; Nature-inspired metaheuristics; Single-objective and multiobjective; metaheuristics; GENETIC ALGORITHM; DIFFERENTIAL EVOLUTION; OPTIMIZATION ALGORITHM; PIXEL CLASSIFICATION; VALIDITY MEASURE; TABU SEARCH; PERFORMANCE; INDEXES;
D O I
10.1016/j.asoc.2015.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In cluster analysis, a fundamental problem is to determine the best estimate of the number of clusters; this is known as the automatic clustering problem. Because of lack of prior domain knowledge, it is difficult to choose an appropriate number of clusters, especially when the data have many dimensions, when clusters differ widely in shape, size, and density, and when overlapping exists among groups. In the late 1990s, the automatic clustering problem gave rise to a new era in cluster analysis with the application of nature-inspired metaheuristics. Since then, researchers have developed several new algorithms in this field. This paper presents an up-to-date review of all major nature-inspired metaheuristic algorithms used thus far for automatic clustering. Also, the main components involved during the formulation of metaheuristics for automatic clustering are presented, such as encoding schemes, validity indices, and proximity measures. A total of 65 automatic clustering approaches are reviewed, which are based on single-solution, single-objective, and multiobjective metaheuristics, whose usage percentages are 3%, 69%, and 28%, respectively. Single-objective clustering algorithms are adequate to efficiently group linearly separable clusters. However, a strong tendency in using multiobjective algorithms is found nowadays to address non-linearly separable problems. Finally, a discussion and some emerging research directions are presented. (C) 2016 Published by Elsevier B.V.
引用
收藏
页码:192 / 213
页数:22
相关论文
共 167 条
[1]  
Abubaker A, 2015, PLOS ONE, V10, DOI [10.1371/journal.pone.0130995, 10.1371/journal.pone.0135641]
[2]   Research on particle swarm optimization based clustering: A systematic review of literature and techniques [J].
Alam, Shafiq ;
Dobbie, Gillian ;
Koh, Yun Sing ;
Riddle, Patricia ;
Rehman, Saeed Ur .
SWARM AND EVOLUTIONARY COMPUTATION, 2014, 17 :1-13
[3]   Application of shuffled frog-leaping algorithm on clustering [J].
Amiri, Babak ;
Fathian, Mohammad ;
Maroosi, Ali .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2009, 45 (1-2) :199-209
[4]  
[Anonymous], 2005, Fundamentals of Computational Swarm Intelligence
[5]  
[Anonymous], 1966, Artificial_Intelligence_Through_Simulated Evolution
[6]  
[Anonymous], IEEE J SEL TOP APPL
[7]  
[Anonymous], 2013, INT J COMPUT SCI ENG
[8]   Evolution strategies – A comprehensive introduction [J].
Hans-Georg Beyer ;
Hans-Paul Schwefel .
Natural Computing, 2002, 1 (1) :3-52
[9]   Simulated annealing using a Reversible Jump Markov Chain Monte Carlo algorithm for fuzzy clustering [J].
Bandyopadhyay, S .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) :479-490
[10]   Genetic clustering for automatic evolution of clusters and application to image classification [J].
Bandyopadhyay, S ;
Maulik, U .
PATTERN RECOGNITION, 2002, 35 (06) :1197-1208