A Survey of Evolutionary Algorithms for Clustering

被引:481
作者
Hruschka, Eduardo Raul [1 ]
Campello, Ricardo J. G. B. [1 ]
Freitas, Alex A. [2 ]
de Carvalho, Andre C. Ponce Leon F. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, BR-13560970 Sao Carlos, SP, Brazil
[2] Univ Kent, Dept Comp Sci, Canterbury CT2 7NZ, Kent, England
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS | 2009年 / 39卷 / 02期
基金
巴西圣保罗研究基金会;
关键词
Applications; clustering; evolutionary algorithms; GENE-EXPRESSION DATA; MISSING VALUE ESTIMATION; K-MEANS; VALIDITY; SEARCH; CLASSIFICATION; NUMBER;
D O I
10.1109/TSMCC.2008.2007252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a survey of evolutionary algorithms designed for clustering tasks. It tries to reflect the profile of this area by focusing more on those subjects that have been given more importance in the literature. In this context, most of the paper is devoted to partitional algorithms that look for hard clusterings of data, though overlapping (i.e., soft and fuzzy) approaches are also covered in the paper. The paper is original in what concerns two main aspects. Firsts it provides an up-to-date overview that is fully devoted to evolutionary algorithms for clustering, is not limited to any particular kind of evolutionary approach, and comprises advanced topics like multiobjective and ensemble-based evolutionary clustering. Second, it provides a taxonomy that highlights some very important aspects in the context of evolutionary data clustering, namely, fixed or variable number of clusters, cluster-oriented or nonoriented operators, context-sensitive or context-insensitive operators, guided or unguided operators, binary, integer, or real encodings, centroid-based, medoid-based, label-based, tree-based, or graph-based representations, among others. A number of references are provided that describe applications of evolutionary algorithms for clustering in different domains, such as image processing, computer security, and bioinformatics. The paper ends by addressing some important issues and open questions that can be subject of future research.
引用
收藏
页码:133 / 155
页数:23
相关论文
共 134 条
  • [1] Alves VS, 2007, IEEE INT CONF FUZZY, P375
  • [2] ALVES VS, 2006, P IEEE C EV COMP, P6240
  • [3] [Anonymous], SERIES PROBABILITY M
  • [4] [Anonymous], P 25 INT C SYST SCI
  • [5] [Anonymous], 2007, 10 INT C INFORM TECH, DOI DOI 10.1109/ICIT.2007.13
  • [6] [Anonymous], 1998, INT SER INTELL TECHN
  • [7] [Anonymous], 2000, Evolutionary computation
  • [8] [Anonymous], THESIS U W AUSTR PER
  • [9] [Anonymous], SOFT COMPUTING KNOWL
  • [10] [Anonymous], PATTERN RECOGNITION