An Initialization Method Based on Hybrid Distance for k-Means Algorithm

被引:11
|
作者
Yang, Jie [1 ]
Ma, Yan [1 ]
Zhang, Xiangfen [1 ]
Li, Shunbao [2 ]
Zhang, Yuping [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
[2] Shanghai Normal Univ, Coll Math & Sci, Shanghai 200234, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1162/neco_a_01014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional k-means algorithm has been widely used as a simple and efficient clustering method. However, the performance of this algorithm is highly dependent on the selection of initial cluster centers. Therefore, the method adopted for choosing initial cluster centers is extremely important. In this letter, we redefine the density of points according to the number of its neighbors, as well as the distance between points and their neighbors. In addition, we define a new distance measure that considers both Euclidean distance and density. Based on that, we propose an algorithm for selecting initial cluster centers that can dynamically adjust the weighting parameter. Furthermore, we propose a new internal clustering validation measure, the clustering validation index based on the neighbors (CVN), which can be exploited to select the optimal result among multiple clustering results. Experimental results show that the proposed algorithm outperforms existing initialization methods on real-world data sets and demonstrates the adaptability of the proposed algorithm to data sets with various characteristics.
引用
收藏
页码:3094 / 3117
页数:24
相关论文
共 50 条
  • [1] Adaptive Initialization Method for K-Means Algorithm
    Yang, Jie
    Wang, Yu-Kai
    Yao, Xin
    Lin, Chin-Teng
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [2] An Improved Initialization Center K-means Clustering Algorithm Based on Distance and Density
    Duan, Yanling
    Liu, Qun
    Xia, Shuyin
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [3] AN INTELLIGENT INITIALIZATION METHOD FOR THE K-MEANS CLUSTERING ALGORITHM
    Sheu, Jyh-Jian
    Chen, Wei-Ming
    Tsai, Wen-Bin
    Chu, Ko-Tsung
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2551 - 2566
  • [4] Density K-means : A New Algorithm for Centers Initialization for K-means
    Lan, Xv
    Li, Qian
    Zheng, Yi
    PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 958 - 961
  • [5] An initialization method of K-means clustering algorithm for mixed data
    Li, Taoying, 1873, ICIC International (10):
  • [6] An initialization method for the K-Means algorithm using neighborhood model
    Cao, Fuyuan
    Liang, Jiye
    Jiang, Guang
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (03) : 474 - 483
  • [7] AN INITIALIZATION METHOD OF K-MEANS CLUSTERING ALGORITHM FOR MIXED DATA
    Li, Taoying
    Jin, Zhihong
    Chen, Yan
    Ebonzo, Angelo Dan Menga
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (05): : 1873 - 1883
  • [8] Spectral method of K-means initialization
    Qian, Xian
    Huang, Xuan-Jing
    Wu, Li-De
    Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (04): : 342 - 346
  • [9] The New K-Means Initialization Method
    Brejna, Bartosz
    Pietranik, Marcin
    Kozierkiewicz, Adrianna
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, ICCCI 2024, 2024, 14810 : 372 - 381
  • [10] A New Projection-based K-Means Initialization Algorithm
    Du, Wei
    Lin, Hu
    Sun, Jianwei
    Yu, Bo
    Yang, Haibo
    2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 2341 - 2345