Extending the k-means Clustering Algorithm to Improve the Compactness of the Clusters

被引:1
作者
Nasiakou, Antonia [1 ]
Alamaniotis, Miltiadis [1 ]
Tsoukalas, Lefteri H. [1 ]
机构
[1] Purdue Univ, Sch Nucl Engn, 400 Cent Dr, W Lafayette, IN 47906 USA
来源
JOURNAL OF PATTERN RECOGNITION RESEARCH | 2016年 / 11卷 / 01期
基金
美国国家科学基金会;
关键词
k-means; Clustering algorithm; Center initialization; Validation measures;
D O I
10.13176/11.745
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Clustering is a popular method essentially applied to data analysis, data mining, vector quantization and data compression. The most widely used clustering algorithm, which belongs to the group of partitioning algorithms, is the k-means. In this paper, we propose an extended version of k-means where the initial cluster centers are selected based on a heuristic data based formula, in contrast to random selection adopted by the traditional k-means algorithm. In particular, a new formula for selecting the initial cluster centers, before applying the k-means algorithm for clustering of a data set, is introduced. The new extended k-means algorithm is tested on clustering a set of 2-D data points. The obtained results exhibit superiority with respect to clustering compactness of the proposed algorithm as compared to traditional k-means. The validity of the extended algorithm is assessed through a set of clustering measures (Silhouette, Davies-Bouldin), with the most prominent being the Davies-Bouldin measure, that identify how compactness and well-separated the clusters are.
引用
收藏
页码:61 / 73
页数:13
相关论文
共 19 条
  • [1] [Anonymous], P 15 INT C MACH LEAR
  • [2] Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
  • [3] Online Adaptive Hierarchical Clustering in a Decision Tree Framework
    Basak, Jayanta
    [J]. JOURNAL OF PATTERN RECOGNITION RESEARCH, 2011, 6 (02): : 201 - 229
  • [4] A new algorithm for initial cluster centers in k-means algorithm
    Erisoglu, Murat
    Calis, Nazif
    Sakallioglu, Sadullah
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (14) : 1701 - 1705
  • [5] Fahim A.M, 2006, JOF ZHEJIANG U, V7
  • [6] Jain A.K., 2010, INT J PATTERN RECOGN, V31
  • [7] Kathiresan V., 2012, P 2012 INT C COMP CO, P1
  • [8] Komarasamy G., 2013, INT J ENG TECHN, V5, P3951
  • [9] Mahmud M. S., 2012, 2012 7th International Conference on Electrical & Computer Engineering (ICECE), P647, DOI 10.1109/ICECE.2012.6471633
  • [10] Manoharan J.J., 2016, ARPN J ENG APPL SCI, V11, P1086