An Improved K-means text clustering algorithm By Optimizing initial cluster centers

被引:0
|
作者
Xiong, Caiquan [1 ]
Hua, Zhen [1 ]
Lv, Ke [1 ]
Li, Xuan [1 ]
机构
[1] Hubei Univ Technol, Sch Comp Sci, Wuhan, Hubei, Peoples R China
来源
2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD) | 2016年
基金
中国国家自然科学基金;
关键词
K-means algorithm; initial cluster centers; Text clustering;
D O I
10.1109/CCBD.2016.29
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
K-means clustering algorithm is an influential algorithm in data mining. The traditional K-means algorithm has sensitivity to the initial cluster centers, leading to the result of clustering depends on the initial centers excessively. In order to overcome this shortcoming, this paper proposes an improved K-means text clustering algorithm by optimizing initial cluster centers. The algorithm first calculates the density of each data object in the data set, and then judge which data object is an isolated point. After removing all of isolated points, a set of data objects with high density is obtained. Afterwards, chooses k high density data objects as the initial cluster centers, where the distance between the data objects is the largest. The experimental results show that the improved K-means algorithm can improve the stability and accuracy of text clustering.
引用
收藏
页码:265 / 268
页数:4
相关论文
共 50 条
  • [21] An Improved K-means Clustering Algorithm
    Wang Yintong
    Li Wanlong
    Gao Rujia
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [22] Improved K-means clustering algorithm
    Zhang, Zhe
    Zhang, Junxi
    Xue, Huifeng
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 169 - 172
  • [23] An improved K-means clustering algorithm
    Huang, Xiuchang
    Su, Wei
    Journal of Networks, 2014, 9 (01) : 161 - 167
  • [24] Improved Algorithm for the k-means Clustering
    Zhang, Sheng
    Wang, Shouqiang
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 4717 - 4720
  • [25] An improved K-Means text clustering algorithm based on Local Search
    Liu, Xiangwei
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11578 - 11581
  • [26] Improved K-means Clustering Algorithm Based on the Optimized Initial Centriods
    Wang, Shunye
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 450 - 453
  • [27] Improved Initial Clustering Center Selection Method for k-means Algorithm
    Xie, Qingqing
    Jiang, He
    Han, Bing
    Wang, Dongyuan
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1092 - 1095
  • [28] An Improved Swarm Based Hybrid K-Means Clustering for Optimal Cluster Centers
    Nayak, Janmenjoy
    Naik, Bighnaraj
    Kanungo, D. P.
    Behera, H. S.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 545 - 553
  • [29] An Improved K-means Clustering Algorithm Based on Meliorated Initial Centre
    Li, Xiang
    Wei, Zhenwei
    Li, Lingling
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRIAL ENGINEERING (AIIE 2016), 2016, 133 : 73 - 76
  • [30] A Method for selecting initial centers of K-means clustering
    Xiong, Zhibin
    Mou, Jinjun
    Du, Hongyan
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 147 - 148