An Improved K-means Clustering Algorithm Based on Dissimilarity

被引：0

作者：

Wang Shunye ^{[1
]}

机构：

[1] Langfang Teachers Coll, Dept Comp Sci & Technol, Langfang, Peoples R China

来源：

PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC) | 2013年

关键词：

k-means; initial centriods; Huffman tree; dissimilarity;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

K-means clustering algorithm is one of the most widely used clustering algorithms and has been applied in many fields of science and technology. A major problem of the original k-means clustering algorithm is that the cluster results depend on the initial centroids which choose at random. At the same time, the similarity measure on the algorithm based on distance is not suitable for big high-dimensional dataset. They all lead to severe degradation in performance. In this paper, an improved k-means clustering algorithm based on dissimilarity is proposed. It selects the initial centriods using the Huffman tree which uses dissimilarity matrix to construct. Many experiments confirm that the proposed algorithm is an efficient algorithm with better clustering accuracy on the same algorithm time complexity.

引用

页码：2629 / 2633

页数：5

共 19 条

[1]

Abubaker Mohamed, 2013, International Journal of Intelligent Systems and Applications, V5, P37, DOI 10.5815/ijisa.2013.03.04

[2]

[Anonymous], 2010, INTRO DATA MINING

[3]

[Anonymous], 2009, P WORLD C ENG

[4]

[Anonymous], SEL TOP SIGNAL PROCE

[5]

Chen Qimai, 2009, MICROCOMPUTER INFORM, V25, P27

[6]

Chen Qimai, 2009, MICROCOMPUTER INFORM, V25, P198

[7]

Fu De-sheng, 2011, Journal of Computer Applications, V31, P432, DOI 10.3724/SP.J.1087.2011.00432

[8]

Han Ling-bo, 2010, Computer Engineering and Applications, V46, P150, DOI 10.3778/j.issn.1002-8331.2010.17.042

[9]

Huang Maida, 2009, MICROCOMPUTER INFORM, V25, P187

[10]

Indira Priya P., 2012, INT J COMPUTER APPL, P12

← 1 2 →