An efficient k′-means clustering algorithm

被引:273
作者
Zalik, Krista Rizman [1 ]
机构
[1] Univ Maribor, Fac Nat Sci & Math, Dept Math & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
clustering analysis; k-means; cluster number; cost-function; rival penalized;
D O I
10.1016/j.patrec.2008.02.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces k-means algorithm that performs correct clustering without pre-assigning the exact number of clusters. This is achieved by minimizing a suggested cost-function. The cost-function extends the mean-square-error cost-function of k-means. The algorithm consists of two separate steps. The first is a pre-processing procedure that performs initial clustering and assigns at least one seed point to each cluster. During the second step, the seed-points are adjusted to minimize the cost-function. The algorithm automatically penalizes any possible winning chances for all rival seed-points in subsequent iterations. When the cost-function reaches a global minimum, the correct number of clusters is determined and the remaining seed points are located near the centres of actual clusters. The simulated experiments described in this paper confirm good performance of the proposed algorithm. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1385 / 1391
页数:7
相关论文
共 19 条
[1]   COMPETITIVE LEARNING ALGORITHMS FOR VECTOR QUANTIZATION [J].
AHALT, SC ;
KRISHNAMURTHY, AK ;
CHEN, PK ;
MELTON, DE .
NEURAL NETWORKS, 1990, 3 (03) :277-290
[2]  
Akaike H., 1992, Selected papers of hirotugu akaike, P610, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[3]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[6]   On rival penalization controlled competitive learning for clustering with automatic cluster number selection [J].
Cheung, YM .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (11) :1583-1588
[7]   Cluster center initialization algorithm for K-means clustering [J].
Khan, SS ;
Ahmad, A .
PATTERN RECOGNITION LETTERS, 2004, 25 (11) :1293-1302
[8]   A genetic algorithm that exchanges neighboring centers for k-means clustering [J].
Laszlo, Michael ;
Mukherjee, Sumitra .
PATTERN RECOGNITION LETTERS, 2007, 28 (16) :2359-2366
[9]  
LAW LT, 2003, P 2003 INT JOINT C N, P20
[10]  
Ma JW, 2006, LECT NOTES COMPUT SC, V3971, P442