Sparse Fuzzy C-Means Clustering with Lasso Penalty

被引:1
|
作者
Parveen, Shazia [1 ]
Yang, Miin-Shen [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Taoyuan 32023, Taiwan
来源
SYMMETRY-BASEL | 2024年 / 16卷 / 09期
关键词
clustering; fuzzy c-means (FCM); sparse FCM (S-FCM); lasso; S-FCM-Lasso; evaluation measures; SELECTION; ALGORITHMS;
D O I
10.3390/sym16091208
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is a technique of grouping data into a homogeneous structure according to the similarity or dissimilarity measures between objects. In clustering, the fuzzy c-means (FCM) algorithm is the best-known and most commonly used method and is a fuzzy extension of k-means in which FCM has been widely used in various fields. Although FCM is a good clustering algorithm, it only treats data points with feature components under equal importance and has drawbacks for handling high-dimensional data. The rapid development of social media and data acquisition techniques has led to advanced methods of collecting and processing larger, complex, and high-dimensional data. However, with high-dimensional data, the number of dimensions is typically immaterial or irrelevant. For features to be sparse, the Lasso penalty is capable of being applied to feature weights. A solution for FCM with sparsity is sparse FCM (S-FCM) clustering. In this paper, we propose a new S-FCM, called S-FCM-Lasso, which is a new type of S-FCM based on the Lasso penalty. The irrelevant features can be diminished towards exactly zero and assigned zero weights for unnecessary characteristics by the proposed S-FCM-Lasso. Based on various clustering performance measures, we compare S-FCM-Lasso with the S-FCM and other existing sparse clustering algorithms on several numerical and real-life datasets. Comparisons and experimental results demonstrate that, in terms of these performance measures, the proposed S-FCM-Lasso performs better than S-FCM and existing sparse clustering algorithms. This validates the efficiency and usefulness of the proposed S-FCM-Lasso algorithm for high-dimensional datasets with sparsity.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Sparse possibilistic c-means clustering with Lasso
    Yang, Miin-Shen
    Benjamin, Josephine B. M.
    PATTERN RECOGNITION, 2023, 138
  • [2] Sparse learning based fuzzy c-means clustering
    Gu, Jing
    Jiao, Licheng
    Yang, Shuyuan
    Zhao, Jiaqi
    KNOWLEDGE-BASED SYSTEMS, 2017, 119 : 113 - 125
  • [3] Sparse Regularization in Fuzzy c-Means for High-Dimensional Data Clustering
    Chang, Xiangyu
    Wang, Qingnan
    Liu, Yuewen
    Wang, Yu
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (09) : 2616 - 2627
  • [4] Image segmentation by fuzzy c-means clustering algorithm with a novel penalty term
    Yang, Yong
    Huang, Shuying
    COMPUTING AND INFORMATICS, 2007, 26 (01) : 17 - 31
  • [5] Fuzzy c-means clustering of incomplete data
    Hathaway, RJ
    Bezdek, JC
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2001, 31 (05): : 735 - 744
  • [6] An Accelerated Fuzzy C-Means clustering algorithm
    Hershfinkel, D
    Dinstein, I
    APPLICATIONS OF FUZZY LOGIC TECHNOLOGY III, 1996, 2761 : 41 - 52
  • [7] Fuzzy Double C-Means Clustering Based on Sparse Self-Representation
    Gu, Jing
    Jiao, Licheng
    Yang, Shuyuan
    Liu, Fang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2018, 26 (02) : 612 - 626
  • [8] Density-Weighted Fuzzy c-Means Clustering
    Hathaway, Richard J.
    Hu, Yingkang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2009, 17 (01) : 243 - 252
  • [9] Unsupervised Multiview Fuzzy C-Means Clustering Algorithm
    Hussain, Ishtiaq
    Sinaga, Kristina P.
    Yang, Miin-Shen
    ELECTRONICS, 2023, 12 (21)
  • [10] Intuitionistic fuzzy C-means clustering algorithms
    Xu, Zeshui
    Wu, Junjie
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2010, 21 (04) : 580 - 590