An Initialization Method Based on Hybrid Distance for k-Means Algorithm

被引:11
作者
Yang, Jie [1 ]
Ma, Yan [1 ]
Zhang, Xiangfen [1 ]
Li, Shunbao [2 ]
Zhang, Yuping [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
[2] Shanghai Normal Univ, Coll Math & Sci, Shanghai 200234, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1162/neco_a_01014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional k-means algorithm has been widely used as a simple and efficient clustering method. However, the performance of this algorithm is highly dependent on the selection of initial cluster centers. Therefore, the method adopted for choosing initial cluster centers is extremely important. In this letter, we redefine the density of points according to the number of its neighbors, as well as the distance between points and their neighbors. In addition, we define a new distance measure that considers both Euclidean distance and density. Based on that, we propose an algorithm for selecting initial cluster centers that can dynamically adjust the weighting parameter. Furthermore, we propose a new internal clustering validation measure, the clustering validation index based on the neighbors (CVN), which can be exploited to select the optimal result among multiple clustering results. Experimental results show that the proposed algorithm outperforms existing initialization methods on real-world data sets and demonstrates the adaptability of the proposed algorithm to data sets with various characteristics.
引用
收藏
页码:3094 / 3117
页数:24
相关论文
共 50 条
  • [31] An Initialization Scheme for Supervized K-means
    Lemaire, Vincent
    Ismaili, Oumaima Alaoui
    Cornuejols, Antoine
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [32] K-Means and Fuzzy based Hybrid Clustering Algorithm for WSN
    Angadi, Basavaraj M.
    Kakkasageri, Mahabaleshwar S.
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2023, 69 (04) : 793 - 801
  • [33] Importance of Initialization in K-Means Clustering
    Gupta, Anubhav
    Tomer, Antriksh
    Dahiya, Sonika
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [34] AN EFFICIENT INITIALIZATION METHOD FOR K-MEANS CLUSTERING OF HYPERSPECTRAL DATA
    Naeini, A. Alizade
    Jamshidzadeh, A.
    Saadatseresht, M.
    Homayouni, S.
    1ST ISPRS INTERNATIONAL CONFERENCE ON GEOSPATIAL INFORMATION RESEARCH, 2014, 40 (2/W3): : 35 - 39
  • [35] A modified version of the K-means algorithm based on the shape similarity distance
    Li, Dan
    Li, Xinbao
    FRONTIERS OF MECHANICAL ENGINEERING AND MATERIALS ENGINEERING II, PTS 1 AND 2, 2014, 457-458 : 1064 - 1068
  • [36] ck-means and fck-means: Two Deterministic Initialization Procedures for k-means Algorithm Using a Modified Crowding Distance
    Layeb, Abdesslem
    ACTA INFORMATICA PRAGENSIA, 2023, 12 (02) : 379 - 399
  • [37] A modified version of the K-means algorithm with a distance based on cluster symmetry
    Su, MS
    Chou, CH
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (06) : 674 - 680
  • [38] A robust algorithm for cluster initialization using uniform effect of k-Means
    Peng, Liuqing
    Zhang, Junying
    Xu, Jin
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2010, 38 (08): : 73 - 76
  • [39] Cluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters
    Li, Chun Sheng
    INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING 2011, 2011, 24 : 324 - 328
  • [40] A comparative study of efficient initialization methods for the k-means clustering algorithm
    Celebi, M. Emre
    Kingravi, Hassan A.
    Vela, Patricio A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (01) : 200 - 210