Feature selection based on a modified fuzzy C-means algorithm with supervision

被引:36
|
作者
Marcelloni, F [1 ]
机构
[1] Univ Pisa, Dipartimento Ingn Informaz Elettr Informat Teleco, I-56122 Pisa, Italy
关键词
feature selection; fuzzy C-means; k-nearest neighbors; supervised learning;
D O I
10.1016/S0020-0255(02)00402-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we propose a new approach to feature selection based on a modified fuzzy C-means algorithm with supervision (MFCMS). MFCMS completes the unsupervised learning of classical fuzzy C-means with labeled patterns. The labeled patterns allow MFCMS to accurately model the shape of each cluster and consequently to highlight the features which result to be particularly effective to characterize a cluster. These features are distinguished by a low variance of their values for the patterns with a high membership degree to the cluster. If, with respect to these features, the distance between the prototype of the cluster and the prototypes of the other clusters is high, then these features have the property of discriminating between the cluster and the other clusters. To take these two aspects into account, for each cluster and each feature, we introduce a purposely defined index: the higher the value of the index, the higher the discrimination capability of the feature for the cluster. We execute MFCMS on the training set considering all patterns as labeled. Then, we retain the features which are associated, at least for one cluster, with an index larger than a threshold T. We applied MFCMS to several real-world pattern classification benchmarks. We used the well-known k-nearest neighbors as learning algorithm. We show that feature selection performed by MFCMS achieved an improvement in generalization on all data sets. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:201 / 226
页数:26
相关论文
共 50 条
  • [1] Modified fuzzy C-means algorithm for feature selection
    Frosini, Graziano
    Lazzerini, Beatrice
    Marcelloni, Francesco
    Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS, 2000, : 148 - 152
  • [2] A modified fuzzy C-means algorithm for feature selection
    Frosini, G
    Lazzerini, B
    Marcelloni, F
    PEACHFUZZ 2000 : 19TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 2000, : 148 - 152
  • [3] Optimization of Fuzzy C-Means Algorithm Using Feature Selection Strategies
    Maheshwari, Kanika
    Sharma, Vivek
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 368 - 379
  • [4] Clonal Selection based Fuzzy C-Means Algorithm for Clustering
    Ludwig, Simone A.
    GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, : 105 - 112
  • [5] Combining Fuzzy C-Means Clustering with Fuzzy Rough Feature Selection
    Zhao, Ruonan
    Gu, Lize
    Zhu, Xiaoning
    APPLIED SCIENCES-BASEL, 2019, 9 (04):
  • [6] A modified fuzzy C-Means algorithm based on gravity and cluster merging
    Zhong, Jiang
    Liu, Longhai
    Chen, Qiang
    Chen, Xue
    Zhou, Ying
    Journal of Information and Computational Science, 2010, 7 (13): : 2699 - 2706
  • [7] Image retrieval based on modified fuzzy C-means clustering algorithm
    Zhang, PZ
    Fu, P
    Xiao, J
    Meng, D
    Proceedings of the Eighth IASTED International Conference on Internet and Multimedia Systems and Applications, 2004, : 103 - 107
  • [8] Fuzzy C-Means Cluster Segmentation Algorithm Based on Modified Membership
    Li, Yanling
    Li, Gang
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 135 - +
  • [9] A Modified Possibilistic Fuzzy c-Means Clustering Algorithm
    Qu, Fuheng
    Hu, Yating
    Xue, Yaohong
    Yang, Yong
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 858 - 862
  • [10] Fuzzy C-Means Based Feature Selection Mechanism for Wireless Intrusion Detection
    Tseng, Chinyang Henry
    Tsaur, Woei-Jiunn
    Mujiono
    2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 : 143 - 152