An efficient K-means clustering algorithm based on influence factors

被引：0

作者：

Leng, Mingwei ^{[1
]}

Tang, Haitao ^{[1
]}

Chen, Xiaoyun ^{[2
]}

机构：

[1] Shangrao Normal Coll, Dept Math & Comp, Jiangxi Shangrao 334000, Peoples R China

[2] Lanzhou Univ, Sch Informat Sci Engn, Lanzhou 730000, Peoples R China

来源：

SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS | 2007年

关键词：

clustering; k-means; initial points; influence factor;

D O I：

10.1109/SNPD.2007.279

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Clustering has been one of the most widely studied topics in data mining and pattern recognition, k-means clustering has been one of the popular, simple and faster clustering algorithms. but the right value of k is unkwown and selecting effectively initial points is also difficult. In view of this, a lot of work has been done on various versions of k-means, which refines initial points and detects the number of clusters. In this paper, we present a new algorithm, called an efficient k-means clustering based on influence factors,which is divided into two stages and can automatically achieve the actual value of k and select the right initial points based on the datasets characters. Propose influence factor to measure similarity of two clusters, using it to determine whether the two clusers should be merged into one. In order to obtain a faster algorithm, a theorem is proposed and proofed, using it to accelerate the algorithm. Experimental results from Gaussian datasets were generated as in Pelleg and Moore (2000)[11] show the algorithm has high quality and obtains a satisfying result.

引用

页码：815 / +

页数：2

共 50 条

[31] A Credits Based Scheduling Algorithm with K-means Clustering
Sharma, Vrajesh
Bala, Manju
2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 82 - 86
[32] An Improved K-means Clustering Algorithm Based on Dissimilarity
Wang Shunye
PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 2629 - 2633
[33] A MapReduce-based K-means clustering algorithm
Mao, YiMin
Gan, DeJin
Mwakapesa, D. S.
Nanehkaran, Y. A.
Tao, Tao
Huang, XueYu
JOURNAL OF SUPERCOMPUTING, 2022, 78 (04) : 5181 - 5202
[34] Chinese text clustering algorithm based k-means
Yao, Mingyu
Pi, Dechang
Cong, Xiangxiang
2012 INTERNATIONAL CONFERENCE ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING (ICMPBE2012), 2012, 33 : 301 - 307
[35] Efficient disk-based K-means clustering for relational databases
Ordonez, C
Omiecinski, E
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (08) : 909 - 921
[36] The global k-means clustering algorithm
Likas, A
Vlassis, N
Verbeek, JJ
PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
[37] Improved K-means clustering algorithm
Zhang, Zhe
Zhang, Junxi
Xue, Huifeng
CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
[38] A Clustering Algorithm Based on K-means for Wireless Indoor Monitoring System
Li, Xiao Hui
Fang, Kang Ling
Zhang, Liang
He, Jian
2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 488 - +
[39] Distributed Algorithm for Text Documents Clustering Based on k-Means Approach
Sarnovsky, Martin
Carnoka, Noema
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2015, PT II, 2016, 430 : 165 - 174
[40] A K-means Text Clustering Algorithm Based on Subject Feature Vector
Duo, Ji
Zhang, Peng
Hao, Liu
JOURNAL OF WEB ENGINEERING, 2021, 20 (06): : 1935 - 1946

← 1 2 3 4 5 →