Automatic clustering using nature-inspired metaheuristics: A survey

被引:156
作者
Jose-Garcia, Adan [1 ]
Gomez-Flores, Wilfrido [1 ]
机构
[1] Natl Polytech Inst, Ctr Res & Adv Studies, Informat Technol Lab, Ciudad Victoria, Tamaulipas, Mexico
关键词
Cluster analysis; Automatic clustering; Nature-inspired metaheuristics; Single-objective and multiobjective; metaheuristics; GENETIC ALGORITHM; DIFFERENTIAL EVOLUTION; OPTIMIZATION ALGORITHM; PIXEL CLASSIFICATION; VALIDITY MEASURE; TABU SEARCH; PERFORMANCE; INDEXES;
D O I
10.1016/j.asoc.2015.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In cluster analysis, a fundamental problem is to determine the best estimate of the number of clusters; this is known as the automatic clustering problem. Because of lack of prior domain knowledge, it is difficult to choose an appropriate number of clusters, especially when the data have many dimensions, when clusters differ widely in shape, size, and density, and when overlapping exists among groups. In the late 1990s, the automatic clustering problem gave rise to a new era in cluster analysis with the application of nature-inspired metaheuristics. Since then, researchers have developed several new algorithms in this field. This paper presents an up-to-date review of all major nature-inspired metaheuristic algorithms used thus far for automatic clustering. Also, the main components involved during the formulation of metaheuristics for automatic clustering are presented, such as encoding schemes, validity indices, and proximity measures. A total of 65 automatic clustering approaches are reviewed, which are based on single-solution, single-objective, and multiobjective metaheuristics, whose usage percentages are 3%, 69%, and 28%, respectively. Single-objective clustering algorithms are adequate to efficiently group linearly separable clusters. However, a strong tendency in using multiobjective algorithms is found nowadays to address non-linearly separable problems. Finally, a discussion and some emerging research directions are presented. (C) 2016 Published by Elsevier B.V.
引用
收藏
页码:192 / 213
页数:22
相关论文
共 167 条
[71]  
Hruschka ER, 2004, LECT NOTES ARTIF INT, V3315, P861
[72]   Data clustering: 50 years beyond K-means [J].
Jain, Anil K. .
PATTERN RECOGNITION LETTERS, 2010, 31 (08) :651-666
[73]  
Kanade PM, 2003, NAFIPS'2003: 22ND INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS PROCEEDINGS, P227
[74]   Automatic clustering for generalised cell formation using a hybrid particle swarm optimisation [J].
Kao, Yucheng ;
Chen, Chien-Chih .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (12) :3466-3484
[75]  
Karaboga D, 2008, APPL SOFT COMPUT, V8, P687, DOI 10.1016/j.asoc.2007.05.007
[76]  
Karaboga D., 2005, TR06 ERC U COMP ENG, V32, P459
[77]   A novel clustering approach: Artificial Bee Colony (ABC) algorithm [J].
Karaboga, Dervis ;
Ozturk, Celal .
APPLIED SOFT COMPUTING, 2011, 11 (01) :652-657
[78]   A comparative study of Artificial Bee Colony algorithm [J].
Karaboga, Dervis ;
Akay, Bahriye .
APPLIED MATHEMATICS AND COMPUTATION, 2009, 214 (01) :108-132
[79]   An efficient approach for unsupervised fuzzy clustering based on grouping evolution strategies [J].
Kashan, Ali Husseinzadeh ;
Rezaee, Babak ;
Karimiyan, Somayyeh .
PATTERN RECOGNITION, 2013, 46 (05) :1240-1254
[80]  
Kennedy J, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS PROCEEDINGS, VOLS 1-6, P1942, DOI 10.1109/icnn.1995.488968