K-means algorithm with a novel distance measure

被引:3
作者
Abudalfa, Shadi I. [1 ]
Mikki, Mohammad [1 ]
机构
[1] Islamic Univ Gaza, Dept Comp Engn, Gaza City, Israel
关键词
Data clustering; distance measure; point symmetry; kd-tree; k-means;
D O I
10.3906/elk-1010-869
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe an essential problem in data clustering and present some solutions for it. We investigated using distance measures other than Euclidean type for improving the performance of clustering. We also developed an improved point symmetry-based distance measure and proved its efficiency. We developed a k-means algorithm with a novel distance measure that improves the performance of the classical k-means algorithm. The proposed algorithm does not have the worst-case bound on running time that exists in many similar algorithms in the literature. Experimental results shown in this paper demonstrate the effectiveness of the proposed algorithm. We compared the proposed algorithm with the classical k-means algorithm. We presented the proposed algorithm and their performance results in detail along with avenues of future research.
引用
收藏
页码:1665 / 1684
页数:20
相关论文
共 19 条
  • [1] [Anonymous], 2010, INT J COMPUTER SCI E
  • [2] [Anonymous], P KDD
  • [3] [Anonymous], ICML
  • [4] [Anonymous], KNOWLEDGE DATA ENG
  • [5] [Anonymous], 2 WSEAS INT C SCI CO
  • [6] ATTNEAVE F, 1955, Am J Psychol, V68, P209, DOI 10.2307/1418892
  • [7] GAPS: A clustering method using a new point symmetry-based distance measure
    Bandyopadhyay, Sanghamitra
    Saha, Sriparna
    [J]. PATTERN RECOGNITION, 2007, 40 (12) : 3430 - 3451
  • [8] Blake C. L., 1998, Uci repository of machine learning databases
  • [9] Bouckaert R. R., 2010, WEKA MANUAL VERSION, V327
  • [10] Gan G, 2007, ASA SIAM SER STAT AP, V20, P1, DOI 10.1137/1.9780898718348