An Improved K-Means Algorithm Based on Kurtosis Test

被引:3
|
作者
Wang, Tingxuan [1 ]
Gao, Junyao [1 ]
机构
[1] Beijing Inst Technol, 5 South Zhongguancun St, Beijing, Peoples R China
来源
2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019) | 2019年 / 1267卷
关键词
D O I
10.1088/1742-6596/1267/1/012027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a process of classifying data into different classes and has become an important tool in data mining. Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simplicity and high efficiency. However, the traditional K-means algorithm can only find spherical clusters, and is also susceptible to noise points and isolated points, which makes the clustering results affected. To solve these problems, this paper proposes an improved K-means algorithm based on kurtosis test. The improved algorithm can improve the adaptability of clustering algorithm to complex shape datasets while reducing the impact of outlier data on clustering results, so that the algorithm results can be more accurate. The method used in our study is known as kurtosis test and Monte Carlo method. We validate our theoretical results in experiments on a variety of datasets. The experimental results show that the proposed algorithm has larger external indicators of clustering performance metrics, which means that the accuracy of clustering results is significantly improved.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] An Improved K-means Algorithm for Test Case Optimization
    Tan, Tian-Tian
    Wang, Bao-Sheng
    Tang, Yong
    Zhou, Xu
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 169 - 172
  • [2] Analysis of Test Scores of Insurance Salesman Based on Improved K-means Algorithm
    Bai, Wei
    Liu, Jianhua
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 1192 - 1201
  • [3] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [4] A Clustering K-means Algorithm Based on Improved PSO Algorithm
    Tan, Long
    2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015), 2015, : 940 - 944
  • [5] An Improved Algorithm of K-means Based on Evolutionary Computation
    Wang, Yunlong
    Luo, Xiong
    Zhang, Jing
    Zhao, Zhigang
    Zhang, Jun
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (05): : 961 - 971
  • [6] Tobacco Distribution Based on Improved K-means Algorithm
    Zheng, Bin
    Tang, Fa-zhe
    Yang, Hua-long
    PROCEEDINGS OF 2009 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATION, LOGISTICS AND INFORMATICS, 2009, : 724 - +
  • [7] Improved K-means algorithm based on density Canopy
    Zhang, Geng
    Zhang, Chengchang
    Zhang, Huayu
    KNOWLEDGE-BASED SYSTEMS, 2018, 145 : 289 - 297
  • [8] An Improved K-Means Algorithm Based on Evidence Distance
    Zhu, Ailin
    Hua, Zexi
    Shi, Yu
    Tang, Yongchuan
    Miao, Lingwei
    ENTROPY, 2021, 23 (11)
  • [9] An Improved K-Means Algorithm Based on Contour Similarity
    Zhao, Jing
    Bao, Yanke
    Li, Dongsheng
    Guan, Xinguo
    MATHEMATICS, 2024, 12 (14)
  • [10] An Improved K-means Algorithm based on Mapreduce and Grid
    Ma, Li
    Gu, Lei
    Li, Bo
    Ma, Yue
    Wang, Jin
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2015, 8 (01): : 189 - 199