An Improved K-Means Algorithm Based on Evidence Distance

被引:11
作者
Zhu, Ailin [1 ]
Hua, Zexi [1 ]
Shi, Yu [2 ]
Tang, Yongchuan [3 ]
Miao, Lingwei [2 ,4 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Sch Elect Engn, Chengdu 611756, Peoples R China
[3] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[4] Qianghua Times Chengdu Technol Co Ltd, Chengdu 610095, Peoples R China
关键词
k-means clustering; evidence distance; cluster analysis; evidence theory; CLUSTERING-ALGORITHM; MEANS-PLUS;
D O I
10.3390/e23111550
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The main influencing factors of the clustering effect of the k-means algorithm are the selection of the initial clustering center and the distance measurement between the sample points. The traditional k-mean algorithm uses Euclidean distance to measure the distance between sample points, thus it suffers from low differentiation of attributes between sample points and is prone to local optimal solutions. For this feature, this paper proposes an improved k-means algorithm based on evidence distance. Firstly, the attribute values of sample points are modelled as the basic probability assignment (BPA) of sample points. Then, the traditional Euclidean distance is replaced by the evidence distance for measuring the distance between sample points, and finally k-means clustering is carried out using UCI data. Experimental comparisons are made with the traditional k-means algorithm, the k-means algorithm based on the aggregation distance parameter, and the Gaussian mixture model. The experimental results show that the improved k-means algorithm based on evidence distance proposed in this paper has a better clustering effect and the convergence of the algorithm is also better.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Performance Evaluation of K-means Clustering Algorithm with Various Distance Metrics
    Kapil, Shruti
    Chawla, Meenu
    PROCEEDINGS OF THE FIRST IEEE INTERNATIONAL CONFERENCE ON POWER ELECTRONICS, INTELLIGENT CONTROL AND ENERGY SYSTEMS (ICPEICES 2016), 2016,
  • [32] A Distance Metric for Uneven Clusters of Unsupervised K-Means Clustering Algorithm
    Raeisi, Mostafa
    Sesay, Abu B.
    IEEE ACCESS, 2022, 10 : 86286 - 86297
  • [33] Research on Network Intrusion Detection System Based on Improved K-means Clustering Algorithm
    Li Tian
    Wang Jianwen
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 76 - 79
  • [34] Wind Power Scenario Reduction Based on Improved K-means Clustering and SBR Algorithm
    Zhao S.
    Yao J.
    Li Z.
    Dianwang Jishu/Power System Technology, 2021, 45 (10): : 3947 - 3954
  • [35] Improved K-Means Algorithm Based on Hybrid Fruit Fly Optimization and Differential Evolution
    Hu, Jixiong
    Wang, Chunzhi
    Liu, Chuan
    Ye, Zhiwei
    2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2017), 2017, : 464 - 467
  • [36] A parametric k-means algorithm
    Thaddeus Tarpey
    Computational Statistics, 2007, 22
  • [37] Automatic Text Summarization Method Based on Improved TextRank Algorithm and K-Means Clustering
    Liu, Wenjun
    Sun, Yuyan
    Yu, Bao
    Wang, Hailan
    Peng, Qingcheng
    Hou, Mengshu
    Guo, Huan
    Wang, Hai
    Liu, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2024, 287
  • [38] A parametric k-means algorithm
    Tarpey, Thaddeus
    COMPUTATIONAL STATISTICS, 2007, 22 (01) : 71 - 89
  • [39] Classification Method of Urban Rail Transit Emergencies Based on Improved K-means Algorithm
    Zheng X.-C.
    Wei Y.
    Qin Y.
    Wang M.-M.
    Chen M.-D.
    Zhao H.-W.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2019, 19 (03): : 134 - 140
  • [40] Asymmetric k-Means Algorithm
    Olszewski, Dominik
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT II, 2011, 6594 : 1 - 10