An Improved K-Means Algorithm Based on Kurtosis Test

被引:3
|
作者
Wang, Tingxuan [1 ]
Gao, Junyao [1 ]
机构
[1] Beijing Inst Technol, 5 South Zhongguancun St, Beijing, Peoples R China
来源
2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019) | 2019年 / 1267卷
关键词
D O I
10.1088/1742-6596/1267/1/012027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a process of classifying data into different classes and has become an important tool in data mining. Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simplicity and high efficiency. However, the traditional K-means algorithm can only find spherical clusters, and is also susceptible to noise points and isolated points, which makes the clustering results affected. To solve these problems, this paper proposes an improved K-means algorithm based on kurtosis test. The improved algorithm can improve the adaptability of clustering algorithm to complex shape datasets while reducing the impact of outlier data on clustering results, so that the algorithm results can be more accurate. The method used in our study is known as kurtosis test and Monte Carlo method. We validate our theoretical results in experiments on a variety of datasets. The experimental results show that the proposed algorithm has larger external indicators of clustering performance metrics, which means that the accuracy of clustering results is significantly improved.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Video Classification Based On the Improved K-Means Clustering Algorithm
    Peng, Taile
    Zhang, Zhen
    Shen, Ke
    Jiang, Tao
    2019 5TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2020, 440
  • [32] K-means Clustering Algorithm based on Improved Density Peak
    Wei, Debin
    Zhang, Zhenxing
    ACM International Conference Proceeding Series, 2023, : 105 - 109
  • [33] Improved SLIC imagine segmentation algorithm based on K-means
    Han, Chun-yan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (02): : 1017 - 1023
  • [34] Load Forecasting Based on Improved K-means Clustering Algorithm
    Wang Yanbo
    Liu Li
    Pang Xinfu
    Fan Enpeng
    2018 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2018, : 2751 - 2755
  • [35] Improved C4.5 algorithm based on k-means
    Li, Honghui
    Xi, Yikun
    Lu, Hailiang
    Fu, Xueliang
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2020, 20 (01) : 177 - 189
  • [36] A name Disambiguation Approach Based on Improved K-Means Algorithm
    Wang, Ying-shuai
    Li, Pei-feng
    Yang, Xin-xin
    Zhu, Qiao-ming
    11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 472 - 476
  • [37] Improved SLIC imagine segmentation algorithm based on K-means
    Chun-yan Han
    Cluster Computing, 2017, 20 : 1017 - 1023
  • [38] An Improved k-means Algorithm based on Average Diameter Method
    Zhao, Yang
    Zeng, Bi
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [39] An Improved K-means Clustering Algorithm Based on Hadoop Platform
    Hou, Xiangru
    CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 1101 - 1109
  • [40] An Improved K-Means Clustering Algorithm Based on Spectral Method
    Tian, Shengwen
    Yang, Hongyong
    Wang, Yilei
    Li, Ali
    ADVANCES IN COMPUTATION AND INTELLIGENCE, PROCEEDINGS, 2008, 5370 : 530 - 536