Semi-supervised possibilistic c-means clustering algorithm based on feature weights for imbalanced data

被引:10
作者
Yu, Haiyan [1 ]
Xu, Xiaoyu [1 ]
Li, Honglei [1 ]
Wu, Yuting [1 ]
Lei, Bo [1 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Telecommun & Informat Engn, Xian 710121, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Possibilistic c -means clustering (PCM); Semi; -supervised; Feature weight; Imbalanced data; Image segmentation; MAHALANOBIS DISTANCE; FUZZY; ENTROPY;
D O I
10.1016/j.knosys.2024.111388
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The possibilistic c-means clustering (PCM) algorithm improves the robustness of fuzzy c-means clustering (FCM) to noise and outliers by releasing the probabilistic constraint of memberships. The semi-supervised possibilistic cmeans clustering (SSPCM) algorithm improves the clustering effect on datasets with imbalanced sizes by introducing a small amount of label information. However, the traditional semi-supervised algorithm still faces the problem of low utilization of supervision information for datasets with large differences in sample sizes. Moreover, the Euclidean distance, which treats features equally, cannot handle feature-imbalanced data. Therefore, this paper proposes a semi-supervised possibilistic c-means clustering algorithm based on feature weights (FW-SSPCM) by introducing the ideas of supervised centers. First, the algorithm introduces the supervised center into the objective function of the SSPCM to improve the utilization rate of supervision information and thus guide the center iteration of small clusters. Second, the feature weighting strategy is introduced in the objective function to adaptively assign feature weights according to the importance of different features in different clusters, thus improving the adaptability of the algorithm to feature-imbalanced datasets. In addition, to improve the robustness of the antinoise effect and retain additional image details, a new image segmentation algorithm based on FW-SSPCM and local information (LFW-SSPCM) is proposed by introducing local spatial information obtained by bilateral filtering. Finally, through clustering experiments on synthetic data, UCI datasets and on color images characteristic of multiple features, including imbalanced sizes, imbalanced features and strong noise injection, the clustering performances of the proposed FW-SSPCM and LFW-SSPCM proposed in this paper are significantly better than those of several related clustering algorithms.
引用
收藏
页数:37
相关论文
共 50 条
  • [41] NCM: Neutrosophic c-means clustering algorithm
    Guo, Yanhui
    Sengur, Abdulkadir
    PATTERN RECOGNITION, 2015, 48 (08) : 2710 - 2724
  • [42] Possibilistic c-means clustering based on the nearest-neighbour isolation similarity
    Zhang, Yong
    Chen, Tianzhen
    Jiang, Yuqing
    Wang, Jianying
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (02) : 1781 - 1792
  • [43] Generalized entropy based possibilistic fuzzy C-Means for clustering noisy data and its convergence proof
    Askari, S.
    Montazerin, N.
    Zarandi, M. H. Fazel
    Hakimi, E.
    NEUROCOMPUTING, 2017, 219 : 186 - 202
  • [44] Adaptive Semi-Supervised Fuzzy C-Means Method With Local Spatial Information and Pre-Clustering for Image Segmentation
    Chen, Hao-Ran
    Wang, Xiao-Peng
    Wu, Jia-Xin
    Wang, Hai-Zhou
    IEEE ACCESS, 2024, 12 : 196328 - 196346
  • [45] POSSIBILISTIC FUZZY C-MEANS CLUSTERING ON MEDICAL DIAGNOSTIC SYSTEMS
    Simhachalam, B.
    Ganesan, G.
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1125 - 1129
  • [46] A semi-supervised clustering-based classification model for classifying imbalanced data streams in the presence of scarcely labelled data
    Bhowmick K.
    Narvekar M.
    International Journal of Business Intelligence and Data Mining, 2022, 20 (02) : 170 - 191
  • [47] Image Segmentation Algorithm Based on Context Fuzzy C-Means Clustering
    Xu Jindong
    Zhao Tianyu
    Feng Guozheng
    Ou Shifeng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2079 - 2086
  • [48] Grid-based C-means Clustering Algorithm for Image Segmentation
    Yue, Shihong
    Li, YueFeng
    He, Boyang
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL I, 2011, : 58 - 61
  • [49] Application of Fuzzy and Possibilistic c-Means Clustering Models in Blind Speaker Clustering
    Gosztolya, Gabor
    Szilagyi, Laszlo
    ACTA POLYTECHNICA HUNGARICA, 2015, 12 (07) : 41 - 56
  • [50] Vague C-means clustering algorithm
    Xu, Chao
    Zhang, Peilin
    Li, Bing
    Wu, Dinghai
    Fan, Hongbo
    PATTERN RECOGNITION LETTERS, 2013, 34 (05) : 505 - 510