A size-insensitive integrity-based fuzzy c-means method for data clustering

被引:55
作者
Lin, Phen-Lan [1 ]
Huang, Po-Whei [2 ]
Kuo, C. H. [2 ]
Lai, Y. H. [2 ]
机构
[1] Providence Univ, Taichung 43301, Taiwan
[2] Natl Chung Hsing Univ, Taichung 40227, Taiwan
关键词
Fuzzy c-means; Cluster size insensitive; Integrity; Compactness; Purity; Data clustering; IMAGE SEGMENTATION; LOCAL INFORMATION; ALGORITHM; VALIDITY;
D O I
10.1016/j.patcog.2013.11.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy c-means (FCM) is one of the most popular techniques for data clustering. Since FCM tends to balance the number of data points in each cluster, centers of smaller clusters are forced to drift to larger adjacent clusters. For datasets with unbalanced clusters, the partition results of FCM are usually unsatisfactory. Cluster size insensitive FCM (csiFCM) dealt with "cluster-size sensitivity" problem by dynamically adjusting the condition value for the membership of each data point based on cluster size after the defuzzification step in each iterative cycle. However, the performance of csiFCM is sensitive to both the initial positions of cluster centers and the "distance" between adjacent clusters. In this paper, we present a cluster size insensitive integrity-based FCM method called siibFCM to improve the deficiency of csiFCM. The siibFCM method can determine the membership contribution of every data point to each individual cluster by considering cluster's integrity, which is a combination of compactness and purity. "Compactness" represents the distribution of data points within a cluster while "purity" represents how far a cluster is away from its adjacent cluster. We tested our siibFCM method and compared with the traditional FCM and csiFCM methods extensively by using artificially generated datasets with different shapes and data distributions, synthetic images, real images, and Escherichia coli dataset. Experimental results showed that the performance of siibFCM is superior to both traditional FCM and csiFCM in terms of the tolerance for "distance" between adjacent clusters and the flexibility of selecting initial cluster centers when dealing with datasets with unbalanced clusters. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2042 / 2056
页数:15
相关论文
共 50 条
  • [21] Relative entropy fuzzy c-means clustering
    Zarinbal, M.
    Zarandi, M. H. Fazel
    Turksen, I. B.
    INFORMATION SCIENCES, 2014, 260 : 74 - 97
  • [22] Effective fuzzy c-means clustering algorithms for data clustering problems
    Kannan, S. R.
    Ramathilagam, S.
    Chung, P. C.
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (07) : 6292 - 6300
  • [23] A New Criterion for Improving Convergence of Fuzzy C-Means Clustering
    Perez-Ortega, Joaquin
    Moreno-Calderon, Carlos Fernando
    Roblero-Aguilar, Sandra Silvia
    Almanza-Ortega, Nelva Nely
    Frausto-Solis, Juan
    Pazos-Rangel, Rodolfo
    Rodriguez-Lelis, Jose Maria
    AXIOMS, 2024, 13 (01)
  • [24] Double fuzzy relaxation local information C-Means clustering
    Gao, Yunlong
    Zheng, Xingshen
    Wu, Qinting
    Zhang, Jiahao
    Cao, Chao
    Pan, Jinyan
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [25] Sparse learning based fuzzy c-means clustering
    Gu, Jing
    Jiao, Licheng
    Yang, Shuyuan
    Zhao, Jiaqi
    KNOWLEDGE-BASED SYSTEMS, 2017, 119 : 113 - 125
  • [26] Image Segmentation Using a Modified Fuzzy C-Means Clustering
    Hajibabaei, Neda
    Firoozbakht, Mohsen
    2015 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2015, : 624 - 630
  • [27] Missing value estimation for microarray data based on fuzzy C-means clustering
    Luo, JiaWei
    Yang, Tao
    Wang, Yan
    Eighth International Conference on High-Performance Computing in Asia-Pacific Region, Proceedings, 2005, : 611 - 616
  • [28] Improved ionospheric clutter classification method based on fuzzy C-means clustering
    Zhou J.
    Wei Y.
    Xu R.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (02): : 35 - 41
  • [29] DATA CLUSTERING BASED ON FUZZY C-MEANS AND CHAOTIC WHALE OPTIMIZATION ALGORITHMS
    Arslan, Hatice
    Toz, Metin
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2019, 37 (04): : 1103 - 1124
  • [30] Ant Colony Based Fuzzy C-Means Clustering for Very Large Data
    Mullick, Dhruv
    Garg, Ayush
    Bajaj, Arpit
    Garg, Ayush
    Aggarwal, Swati
    ADVANCES IN FUZZY LOGIC AND TECHNOLOGY 2017, VOL 2, 2018, 642 : 578 - 591