A Multi-Objective Genetic Graph-based Clustering Algorithm with Memory Optimization

被引:0
作者
Menendez, Hector D. [1 ]
Barrero, David F.
Camacho, David [1 ,2 ]
机构
[1] Univ Autonoma Madrid, Dept Comp Sci, E-28049 Madrid, Spain
[2] Univ Alcala de Henares, Dept Automat, E-28871 Alcala De Henares, Spain
来源
2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2013年
关键词
NETWORK;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Clustering is one of the most versatile tools for data analysis. Over the last few years, clustering that seeks the continuity of data (in opposition to classical centroid-based approaches) has attracted an increasing research interest. It is a challenging problem with a remarkable practical interest. The most popular continuity clustering method is the Spectral Clustering algorithm, which is based on graph cut: it initially generates a Similarity Graph using a distance measure and then uses its Graph Spectrum to find the best cut. Memory consuption is a serious limitation in that algorithm: The Similarity Graph representation usually requires a very large matrix with a high memory cost. This work proposes a new algorithm, based on a previous implementation named Genetic Graph-based Clustering (GGC), that improves the memory usage while maintaining the quality of the solution. The new algorithm, called Multi-Objective Genetic Graph-based Clustering (MOGGC), uses an evolutionary approach introducing a Multi-Objective Genetic Algorithm to manage a reduced version of the Similarity Graph. The experimental validation shows that MOGGC increases the memory efficiency, maintaining and improving the GGC results in the synthetic and real datasets used in the experiments. An experimental comparison with several classical clustering methods (EM, SC and K-means) has been included to show the efficiency of the proposed algorithm.
引用
收藏
页码:3174 / 3181
页数:8
相关论文
共 32 条
[1]  
[Anonymous], 2004, ICML
[2]  
[Anonymous], 2005, Discovering Knowledge in Data: An Introduction to Data Mining
[3]  
[Anonymous], 2007, ACM Transactions on Knowledge Discovery from Data, DOI [DOI 10.1145/1217299.1217303, 10.1145/1217299.1217303]
[4]  
Bello-Orgaz G., 2012, P 4 INT WORKSH WEB I, P4
[5]  
Carroll SusanRovezzi., 2002, STAT MADE SIMPLE SCH
[6]   Robust path-based spectral clustering [J].
Chang, Hong ;
Yeung, Dit-Yan .
PATTERN RECOGNITION, 2008, 41 (01) :191-203
[7]  
Coello C.A.C., 2007, Evolutionary Algorithms for Solving Multi-Objective Problems, V5, DOI DOI 10.1007/978-0-387-36797-2
[8]  
Coley, 1999, INTRO GENETIC ALGORI
[9]  
Corne D. W., 2000, Parallel Problem Solving from Nature PPSN VI. 6th International Conference. Proceedings (Lecture Notes in Computer Science Vol.1917), P839
[10]   Modeling wine preferences by data mining from physicochemical properties [J].
Cortez, Paulo ;
Cerdeira, Antonio ;
Almeida, Fernando ;
Matos, Telmo ;
Reis, Jose .
DECISION SUPPORT SYSTEMS, 2009, 47 (04) :547-553