Principal Component Analysis based Feature Selection for clustering

被引:6
|
作者
Xu, Jun-Ling [1 ]
Xu, Bao-Wen [1 ,2 ]
Zhang, Wei-Feng [3 ]
Cui, Zi-Feng [1 ]
机构
[1] Southeast Univ, Sch Engn & Comp Sci, Nanjing 211189, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Dept Comp, Nanjing 210003, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Dept Comp, Nanjing 210003, Peoples R China
来源
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2008年
基金
中国国家自然科学基金;
关键词
feature selection; Principal Component Analysis; clustering;
D O I
10.1109/ICMLC.2008.4620449
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Feature Extraction (FE) methods have been proved to be very effective for dimension reduction, but the features attained are meaningless. In order to exploit the effectiveness of FE methods to support Feature Selection (FS), this paper proposed a new FS approach for clustering based on Principal Component Analysis (PCA) called PS. It first uses PCA to transform the data from original feature space into a new feature space whose features are linear combination of the original ones, and then evaluates the importance of the original features based on the newly generated features and the feature importance measure proposed in this paper, finally selects features incrementally according to their importance to improve the performance of the clustering algorithm. Experiment is carried out on several popular data sets and the results show the advantages of the proposed approach.
引用
收藏
页码:460 / +
页数:2
相关论文
共 50 条
  • [31] A Principal Component Analysis and Clustering based Load Balancing Strategy for Cloud Computing
    Xue, Law Siew
    Abd Majid, NazatulAini
    Sundararajan, Elankovan A.
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2020, 9 (01): : 93 - 100
  • [32] Feature selection of postural summary statistic scores based on principal component analysis in Parkinson's disease
    Rocchi, L
    Chiari, L
    Cappello, A
    Horak, FB
    MODELLING IN MEDICINE AND BIOLOGY VI, 2005, 8 : 533 - 543
  • [33] Channel Feature Extraction and Modeling Based on Principal Component Analysis
    Yao, Biyuan
    Yin, Jianhua
    Li, Hui
    Zhou, Hui
    Wu, Wei
    EMBEDDED SYSTEMS TECHNOLOGY, ESTC 2017, 2018, 857 : 193 - 209
  • [34] A modified genetic algorithm and weighted principal component analysis based feature selection and extraction strategy in agriculture
    Shastry, K. Aditya
    Sanjay, H. A.
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [35] Clustering Based Analysis of Spirometric Data Using Principal Component Analysis and Self Organizing Map
    Asaithambi, Mythili
    Manoharan, Sujatha C.
    Subramanian, Srinivasan
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT II (SEMCCO 2013), 2013, 8298 : 523 - +
  • [36] IMPROVING CLUSTERING OF WEB BOT AND HUMAN SESSIONS BY APPLYING PRINCIPAL COMPONENT ANALYSIS
    Suchacka, Grazyna
    PROCEEDINGS OF THE 33RD INTERNATIONAL ECMS CONFERENCE ON MODELLING AND SIMULATION (ECMS 2019), 2019, 33 (01): : 434 - 440
  • [37] A clustering-based feature selection via feature separability
    Jiang, Shengyi
    Wang, Lianxi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (02) : 927 - 937
  • [38] An improved affinity propagation clustering algorithm based on principal component analysis and variation coefficient
    Han, Xuming, 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (07): : 549 - 555
  • [39] Curious Feature Selection-Based Clustering
    Moran M.
    Gordon G.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6146 - 6158
  • [40] Feature Selection for Density-Based Clustering
    Ling, Yun
    Ye, Chongyi
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 226 - 229