An Improved K-means Clustering Method based on Data Field

被引：0

作者：

Xu, Cui ^{[1
]}

Liu, Yuhua ^{[1
]}

Xu, Ke ^{[1
]}

机构：

[1] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON CONTROL SYSTEM AND AUTOMATION (CSA 2013) | 2013年

关键词：

Clustering analysis; k-means; data field; splitting clusters; merging clusters;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Clustering is useful for discovering groups and identifying interesting distributions in the underlying data. At present, k-means algorithm as a method of clustering based on the partition has more applications. By analyzing the problem of k-means, we find the traditional k-means algorithm suffers from some shortcomings, such as requiring the user to give out the number of clusters k in advance, being sensitive to the initial cluster centers, being sensitive to the noise and isolated data, only being applied to the type found in globular clusters, and being easily trapped into a local solution et cetera. This improved algorithm uses the potential of data field to find the center data and eliminate the noise data. It decomposes big or extended cluster into several small clusters, then merges adjacent small clusters into a big cluster using the information provided by the Safety Area. Experimental results demonstrate that the improved k-means algorithm can determine the number of clusters, distinguish irregular cluster to a certain extent, decrease the dependence on the initial cluster centers, eliminate the effects of the noise data and get a better clustering accuracy.

引用

页码：454 / 459

页数：6

共 10 条

[1]

Akaike H., 1973, 2 INTERNAT SYMPOS IN, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0, 10.1007/978-1-4612-0919-5_38]

[2]

Brownlee K., 1967, STAT THEORY METHODOL

[3]

Dai Xiaojun, 2004, J COMPUTER ENG APPL, V26

[4] A new algorithm for initial cluster centers in k-means algorithm [J].

Erisoglu, Murat ;

Calis, Nazif ;

Sakallioglu, Sadullah .

PATTERN RECOGNITION LETTERS, 2011, 32 (14) :1701-1705

[5]

Gan Wen-yan, 2006, Acta Electronica Sinica, V34, P258

[6] Data clustering: 50 years beyond K-means [J].

Jain, Anil K. .

PATTERN RECOGNITION LETTERS, 2010, 31 (08) :651-666

[7] Efficient clustering algorithm based on local optimality of K-means [J].

Lei, Xiao-Feng ;

Xie, Kun-Qing ;

Lin, Fan ;

Xia, Zheng-Yi .

Ruan Jian Xue Bao/Journal of Software, 2008, 19 (07) :1683-1692

[8] An Improved Parameter less Data Clustering Technique based on Maximum Distance of Data and Lioyd k-means Algorithm [J].

Mohd, Wan Maseri Binti Wan ;

Beg, A. H. ;

Herawan, Tutut ;

Rabbi, K. F. .

FIRST WORLD CONFERENCE ON INNOVATION AND COMPUTER SCIENCES (INSODE 2011), 2012, 1 :367-371

[9] Clustering algorithms research [J].

Sun, Ji-Gui ;

Liu, Jie ;

Zhao, Lian-Yu .

Ruan Jian Xue Bao/Journal of Software, 2008, 19 (01) :48-61

[10] Survey of clustering algorithms [J].

Xu, R ;

Wunsch, D .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03) :645-678

← 1 →