Variable Weighting in Fuzzy k-Means Clustering to Determine the Number of Clusters

被引:38
作者
Khan, Imran [1 ]
Luo, Zongwei [1 ]
Huang, Joshua Zhexue [2 ]
Shahzad, Waseem [3 ]
机构
[1] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen Key Lab Computat Intelligence, Shenzhen 518055, Guangdong, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
[3] Natl Univ Comp & Emerging Sci, Dept Comp Sci, Islamabad 44000, Pakistan
关键词
Fuzzy k-means; clustering; number of clusters; data mining; variable weighting; MEANS ALGORITHM; DATA SETS; SELECTION; CENTERS; MODEL;
D O I
10.1109/TKDE.2019.2911582
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most significant problems in cluster analysis is to determine the number of clusters in unlabeled data, which is the input for most clustering algorithms. Some methods have been developed to address this problem. However, little attention has been paid on algorithms that are insensitive to the initialization of cluster centers and utilize variable weights to recover the number of clusters. To fill this gap, we extend the standard fuzzy k-means clustering algorithm. It can automatically determine the number of clusters by iteratively calculating the weights of all variables and the membership value of each object in all clusters. Two new steps are added to the fuzzy k-means clustering process. One of them is to introduce a penalty term to make the clustering process insensitive to the initial cluster centers. The other one is to utilize a formula for iterative updating of variable weights in each cluster based on the current partition of data. Experimental results on real-world and synthetic datasets have shown that the proposed algorithm effectively determined the correct number of clusters while initializing the different number of cluster centroids. We also tested the proposed algorithm on gene data to determine a subset of important genes.
引用
收藏
页码:1838 / 1853
页数:16
相关论文
共 50 条
  • [41] K*-Means: An Effective and Efficient K-means Clustering Algorithm
    Qi, Jianpeng
    Yu, Yanwei
    Wang, Lihong
    Liu, Jinglei
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
  • [42] The global k-means clustering algorithm
    Likas, A
    Vlassis, N
    Verbeek, JJ
    PATTERN RECOGNITION, 2003, 36 (02) : 451 - 461
  • [43] An Improved Method for K-Means Clustering
    Cui, Xiaowei
    Wang, Fuxiang
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 756 - 759
  • [44] Dynamic Incremental K-means Clustering
    Aaron, Bryant
    Tamir, Dan E.
    Rishe, Naphtali D.
    Kandel, Abraham
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 308 - 313
  • [45] Inference of number of prototypes with a framework approach to K-means clustering
    Chambers, Simon J.
    Jarman, Ian H.
    Etchells, Terence A.
    Lisboa, Paulo J. G.
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2013, 13 (04) : 323 - 340
  • [46] Automatic detection of outliers and the number of clusters in k-means clustering via Chebyshev-type inequalities
    Olukanmi, Peter
    Nelwamondo, Fulufhelo
    Marwala, Tshilidzi
    Twala, Bhekisipho
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08) : 5939 - 5958
  • [47] Automatic detection of outliers and the number of clusters in k-means clustering via Chebyshev-type inequalities
    Peter Olukanmi
    Fulufhelo Nelwamondo
    Tshilidzi Marwala
    Bhekisipho Twala
    Neural Computing and Applications, 2022, 34 : 5939 - 5958
  • [48] Optimization of the clusters number of An improved fuzzy C-means clustering algorithm
    Xu Yejun
    10TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2015), 2015, : 931 - 935
  • [49] K-Means Cloning: Adaptive Spherical K-Means Clustering
    Hedar, Abdel-Rahman
    Ibrahim, Abdel-Monem M.
    Abdel-Hakim, Alaa E.
    Sewisy, Adel A.
    ALGORITHMS, 2018, 11 (10):
  • [50] NSS-AKmeans: An Agglomerative Fuzzy K-Means Clustering Method with Automatic Selection of Cluster Number
    Zhang, Yanfeng
    Xu, Xiaofei
    Ye, Yunming
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 2, 2010, : 32 - 38