A size-insensitive integrity-based fuzzy c-means method for data clustering

被引:55
|
作者
Lin, Phen-Lan [1 ]
Huang, Po-Whei [2 ]
Kuo, C. H. [2 ]
Lai, Y. H. [2 ]
机构
[1] Providence Univ, Taichung 43301, Taiwan
[2] Natl Chung Hsing Univ, Taichung 40227, Taiwan
关键词
Fuzzy c-means; Cluster size insensitive; Integrity; Compactness; Purity; Data clustering; IMAGE SEGMENTATION; LOCAL INFORMATION; ALGORITHM; VALIDITY;
D O I
10.1016/j.patcog.2013.11.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy c-means (FCM) is one of the most popular techniques for data clustering. Since FCM tends to balance the number of data points in each cluster, centers of smaller clusters are forced to drift to larger adjacent clusters. For datasets with unbalanced clusters, the partition results of FCM are usually unsatisfactory. Cluster size insensitive FCM (csiFCM) dealt with "cluster-size sensitivity" problem by dynamically adjusting the condition value for the membership of each data point based on cluster size after the defuzzification step in each iterative cycle. However, the performance of csiFCM is sensitive to both the initial positions of cluster centers and the "distance" between adjacent clusters. In this paper, we present a cluster size insensitive integrity-based FCM method called siibFCM to improve the deficiency of csiFCM. The siibFCM method can determine the membership contribution of every data point to each individual cluster by considering cluster's integrity, which is a combination of compactness and purity. "Compactness" represents the distribution of data points within a cluster while "purity" represents how far a cluster is away from its adjacent cluster. We tested our siibFCM method and compared with the traditional FCM and csiFCM methods extensively by using artificially generated datasets with different shapes and data distributions, synthetic images, real images, and Escherichia coli dataset. Experimental results showed that the performance of siibFCM is superior to both traditional FCM and csiFCM in terms of the tolerance for "distance" between adjacent clusters and the flexibility of selecting initial cluster centers when dealing with datasets with unbalanced clusters. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2042 / 2056
页数:15
相关论文
共 50 条
  • [1] Fuzzy C-Means Clustering Algorithm for Image Segmentation Insensitive to Cluster Size
    Zhao Zhanmin
    Zhu Zhanlong
    Liu Yongjun
    Liu Ming
    Zheng Yibo
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (02)
  • [2] A new robust fuzzy c-means clustering method based on adaptive elastic distance
    Gao, Yunlong
    Wang, Zhihao
    Xie, Jiaxin
    Pan, Jinyan
    KNOWLEDGE-BASED SYSTEMS, 2022, 237
  • [3] Unsupervised Binning of Metagenomic Datasets Using Cluster Size Insensitive Fuzzy c-means Method
    Liu, Yu
    Liu, Fu
    Hou, Tao
    Wang, Ke
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3936 - 3939
  • [4] An Outlier Detection Method based on Fuzzy C-Means Clustering
    Li, Qiang
    Zhang, Jianpei
    Feng, Guangsheng
    ADVANCED DESIGN AND MANUFACTURE II, 2010, 419-420 : 165 - 168
  • [5] New fuzzy c-means clustering model based on the data weighted approach
    Tang, Chenglong
    Wang, Shigang
    Xu, Wei
    DATA & KNOWLEDGE ENGINEERING, 2010, 69 (09) : 881 - 900
  • [6] Extended fuzzy c-means: an analyzing data clustering problems
    Ramathilagam, S.
    Devi, R.
    Kannan, S. R.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (03): : 389 - 406
  • [7] Fuzzy c-means clustering based on weights and gene expression programming
    Jiang, Zhaohui
    Li, Tingting
    Min, Wenfang
    Qi, Zhao
    Rao, Yuan
    PATTERN RECOGNITION LETTERS, 2017, 90 : 1 - 7
  • [8] Regularized fuzzy c-means method for brain tissue clustering
    Hou, Z.
    Qian, W.
    Huang, S.
    Hu, Q.
    Nowinski, W. L.
    PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1788 - 1794
  • [9] On Fuzzy c-Means and Membership Based Clustering
    Torra, Vicenc
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I (IWANN 2015), 2015, 9094 : 597 - 607
  • [10] A review on suppressed fuzzy c-means clustering models
    Szilagyi, Laszlo
    Lefkovits, Laszlo
    Iclanzan, David
    ACTA UNIVERSITATIS SAPIENTIAE INFORMATICA, 2020, 12 (02) : 302 - 324