Improving the Initial Centroids of k-means Clustering Algorithm to Generalize its Applicability

被引:22
作者
Goyal M. [1 ]
Kumar S. [1 ]
机构
[1] Faculty of Engineering and Technology, Manav Rachna International University, Faridabad
关键词
Clustering; Initial centroids; k-means algorithm; Partitional clustering algorithm;
D O I
10.1007/s40031-014-0106-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
k-means is one of the most widely used partition based clustering algorithm. But the initial centroids generated randomly by the k-means algorithm cause the algorithm to converge at the local optimum. So to make k-means algorithm globally optimum, the initial centroids must be selected carefully rather than randomly. Though many researchers have already been carried out for the enhancement of k-means algorithm, they have their own limitations. In this paper a new method to formulate the initial centroids is proposed which results in better clusters equally for uniform and non-uniform data sets. © 2014, The Institution of Engineers (India).
引用
收藏
页码:345 / 350
页数:5
相关论文
共 12 条
[1]  
Rauf S.A., Mahfooz S., Khusro S., Javed H., Enhanced k-mean clustering algorithm to reduce number of iterations and time complexity, Middle-East J. Sci. Res., 12, 7, pp. 959-963, (2012)
[2]  
Li C.S., Cluster center initialization method for k-means algorithm over data sets with two clusters, in Proceedings of International Conference on Advances in Engineering, pp, 324–328, (2011)
[3]  
Zhang C., Fang Z., An improved k-means clustering algorithm, J. Inf. Comput. Sci., 10, 1, pp. 193-199, (2013)
[4]  
Yuan F., A new algorithm to get the initial centroids, in Proceedings of the Third IEEE International Conference on Machine Learning and Cybernetics, Shanghai, pp. 1191-1193, (2004)
[5]  
Nazeer K.A.A., Sebastian M.P., Improving the accuracy and efficiency of the k-means clustering algorithm, Proceedings of the World Congress on Engineering, (2009)
[6]  
Arai K., Barakbah A.R., Hierarchical k-means: an algorithm for centroids initialization for k-means, Rep. Fac. Sci. Eng. Saga Univ., 36, 1, pp. 25-31, (2007)
[7]  
Agha M.E., Ashour W.M., Efficient and fast initialization algorithm for k-means clustering, Int. J. Intell. Syst. Appl., 1, pp. 21-31, (2012)
[8]  
Celebi M.E., Kingravi H.A., Vela P.A., A comparative study of efficient initialization methods for the k-means clustering algorithm, J. Exp. Syst. Appl., 40, 1, pp. 200-210, (2013)
[9]  
Yedla M., Rao Pathakota S., Srinivasa T.M., Enhancing k-means clustering algorithm with improved initial center, IJCSIT, 1, 2, pp. 121-125, (2010)
[10]  
Aldahdooh R.T., Ashour W., DIMK means: distance based initialization method for k-means clustering algorithm, Int. J. Intell. Syst. Appl., 5, 2, pp. 41-51, (2013)