Sparse Fuzzy C-Means Clustering with Lasso Penalty

被引:1
|
作者
Parveen, Shazia [1 ]
Yang, Miin-Shen [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Taoyuan 32023, Taiwan
来源
SYMMETRY-BASEL | 2024年 / 16卷 / 09期
关键词
clustering; fuzzy c-means (FCM); sparse FCM (S-FCM); lasso; S-FCM-Lasso; evaluation measures; SELECTION; ALGORITHMS;
D O I
10.3390/sym16091208
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering is a technique of grouping data into a homogeneous structure according to the similarity or dissimilarity measures between objects. In clustering, the fuzzy c-means (FCM) algorithm is the best-known and most commonly used method and is a fuzzy extension of k-means in which FCM has been widely used in various fields. Although FCM is a good clustering algorithm, it only treats data points with feature components under equal importance and has drawbacks for handling high-dimensional data. The rapid development of social media and data acquisition techniques has led to advanced methods of collecting and processing larger, complex, and high-dimensional data. However, with high-dimensional data, the number of dimensions is typically immaterial or irrelevant. For features to be sparse, the Lasso penalty is capable of being applied to feature weights. A solution for FCM with sparsity is sparse FCM (S-FCM) clustering. In this paper, we propose a new S-FCM, called S-FCM-Lasso, which is a new type of S-FCM based on the Lasso penalty. The irrelevant features can be diminished towards exactly zero and assigned zero weights for unnecessary characteristics by the proposed S-FCM-Lasso. Based on various clustering performance measures, we compare S-FCM-Lasso with the S-FCM and other existing sparse clustering algorithms on several numerical and real-life datasets. Comparisons and experimental results demonstrate that, in terms of these performance measures, the proposed S-FCM-Lasso performs better than S-FCM and existing sparse clustering algorithms. This validates the efficiency and usefulness of the proposed S-FCM-Lasso algorithm for high-dimensional datasets with sparsity.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Analytically tractable case of fuzzy c-means clustering
    Pianykh, OS
    PATTERN RECOGNITION, 2006, 39 (01) : 35 - 46
  • [22] A review on suppressed fuzzy c-means clustering models
    Szilagyi, Laszlo
    Lefkovits, Laszlo
    Iclanzan, David
    ACTA UNIVERSITATIS SAPIENTIAE INFORMATICA, 2020, 12 (02) : 302 - 324
  • [23] Fuzzy C-Means Clustering and Sonification of HRV Features
    Borthakur, Debanjan
    Grace, Victoria
    Batchelor, Paul
    Dubey, Harishchandra
    2019 4TH IEEE/ACM INTERNATIONAL CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE), 2019, : 53 - 57
  • [24] A Centroid Auto-Fused Hierarchical Fuzzy c-Means Clustering
    Lin, Yunxia
    Chen, Songcan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (07) : 2006 - 2017
  • [25] Sparsity Fuzzy C-Means Clustering With Principal Component Analysis Embedding
    Chen, Jingwei
    Zhu, Jianyong
    Jiang, Hongyun
    Yang, Hui
    Nie, Feiping
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (07) : 2099 - 2111
  • [26] Image Segmentation Algorithm Based on Context Fuzzy C-Means Clustering
    Xu Jindong
    Zhao Tianyu
    Feng Guozheng
    Ou Shifeng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2079 - 2086
  • [27] Fuzzy C-means based clustering for linearly and nonlinearly separable data
    Tsai, Du-Ming
    Lin, Chung-Chan
    PATTERN RECOGNITION, 2011, 44 (08) : 1750 - 1760
  • [28] Transformer Condition Assessment Using Fuzzy C-means Clustering Techniques
    Eke, Samuel
    Clerc, Guy
    Aka-Ngnui, Thomas
    Fofana, I.
    IEEE ELECTRICAL INSULATION MAGAZINE, 2019, 35 (02) : 47 - 55
  • [29] k-means and fuzzy c-means fusion for object clustering
    Heni, Ashraf
    Jdey, Imen
    Ltifi, Hela
    2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 177 - 182
  • [30] OPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM
    Mehdizadeh, E.
    Sadi-Nezhad, S.
    Tavakkoli-Moghaddam, R.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2008, 5 (03): : 1 - 14