A K-means Clustering with Optimized Initial Center Based on Hadoop Platform

被引:0
|
作者
Lin, Kunhui [1 ]
Li, Xiang [1 ]
Zhang, Zhongnan [1 ]
Chen, Jiahong [1 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen, Peoples R China
来源
2014 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2014) | 2014年
关键词
MapReduce; K-means clustering; Initial center; Density; MAPREDUCE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the explosive growth of data, the traditional clustering algorithms running on separate servers can not meet the demand. To solve the problem, more and more researchers implement the traditional clustering algorithms on the cloud computing platforms, especially for K-means clustering. But, few researchers pay attention to the K-means clustering structure, and most of researchers optimized the model of the cloud computing platform to raise the computing speed of K-means clustering. However the problem of instability caused by the random initial centers still exists. In this paper, we propose a K-means clustering algorithm with optimized initial centers based on data dimensional density. This method avoids the deficiency of the random initial centers and improves the stability of the Kmeans clustering. The experimental results show that the approach achieves a good performance on K-means, and improves the accuracy of K-means clustering on the test set.
引用
收藏
页码:263 / 266
页数:4
相关论文
共 50 条
  • [21] Improved Initial Clustering Center Selection Method for k-means Algorithm
    Xie, Qingqing
    Jiang, He
    Han, Bing
    Wang, Dongyuan
    2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 1092 - 1095
  • [22] K-Means Clustering Efficient Algorithm with Initial Class Center Selection
    Huang Suyu
    Hu Pingfang
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL WORKSHOP ON MATERIALS ENGINEERING AND COMPUTER SCIENCES (IWMECS 2018), 2018, 78 : 301 - 305
  • [23] Optimized K-Means Clustering Model based on Gap Statistic
    El-Mandouh, Amira M.
    Mahmoud, Hamdi A.
    Abd-Elmegid, Laila A.
    Haggag, Mohamed H.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (01) : 183 - 188
  • [24] A parallel k-means clustering initial center selection and dynamic center correction on GPU
    Kakooei, Mohammad
    Shahhoseini, Hadi Shahriar
    2014 22nd Iranian Conference on Electrical Engineering (ICEE), 2014, : 20 - 25
  • [25] An Optimized Version of the K-Means Clustering Algorithm
    Poteras, Cosmin Marian
    Mihaescu, Marian Cristian
    Mocanu, Mihai
    FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 695 - 699
  • [26] Cloud Computing K-Means Text Clustering Filtering Algorithm based on Hadoop
    Huang Suyu
    Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications, 2016, 71 : 1516 - 1521
  • [27] Analysis and Research of K-means Algorithm in Soil Fertility Based on Hadoop Platform
    Chen, Guifen
    Yang, Yuqin
    Guo, Hongliang
    Sun, Xionghui
    Chen, Hang
    Cai, Lixia
    Computer and Computing Technologies in Agriculture VIII, 2015, 452 : 304 - 312
  • [28] An Analysis of Financial Product Purchase Behavior Based K-means on Hadoop Platform
    Pang, Shuangyu
    APPLICATIONS OF DECISION SCIENCE IN MANAGEMENT, ICDSM 2022, 2023, 260 : 471 - 479
  • [29] Implementation of hadoop optimization K-means parallel clustering algorithm
    Huang, Suyu
    Tan, Lingli
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 160 - 160
  • [30] Study of a Reasonable Initial Center Selection Method Applied to a K-Means Clustering
    Lee, WonHee
    Lee, Samuel Sangkon
    An, Dong-Un
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08) : 1727 - 1733