Principal Component Analysis based Feature Selection for clustering

被引:6
|
作者
Xu, Jun-Ling [1 ]
Xu, Bao-Wen [1 ,2 ]
Zhang, Wei-Feng [3 ]
Cui, Zi-Feng [1 ]
机构
[1] Southeast Univ, Sch Engn & Comp Sci, Nanjing 211189, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Dept Comp, Nanjing 210003, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Dept Comp, Nanjing 210003, Peoples R China
来源
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2008年
基金
中国国家自然科学基金;
关键词
feature selection; Principal Component Analysis; clustering;
D O I
10.1109/ICMLC.2008.4620449
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Feature Extraction (FE) methods have been proved to be very effective for dimension reduction, but the features attained are meaningless. In order to exploit the effectiveness of FE methods to support Feature Selection (FS), this paper proposed a new FS approach for clustering based on Principal Component Analysis (PCA) called PS. It first uses PCA to transform the data from original feature space into a new feature space whose features are linear combination of the original ones, and then evaluates the importance of the original features based on the newly generated features and the feature importance measure proposed in this paper, finally selects features incrementally according to their importance to improve the performance of the clustering algorithm. Experiment is carried out on several popular data sets and the results show the advantages of the proposed approach.
引用
收藏
页码:460 / +
页数:2
相关论文
共 50 条
  • [21] Principal Component Analysis and Clustering Based Indoor Localization
    Liang, Dong
    Yang, Jingkang
    Xuan, Rui
    Zhang, Zhaojing
    Yang, Zhifang
    Shi, Kexin
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1103 - 1108
  • [22] Feature Selection Based on Principal Component Regression for Underwater Source Localization by Deep Learning
    Zhu, Xiaoyu
    Dong, Hefeng
    Salvo Rossi, Pierluigi
    Landro, Martin
    REMOTE SENSING, 2021, 13 (08)
  • [24] Study of Defect Feature Dimension Reduction Based on Principal Component Analysis
    Han Fangfang
    Zhu Junchao
    Zhang Baofeng
    Duan Fajie
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1367 - 1371
  • [25] Feature selection for text data via sparse principal component analysis
    Son, Won
    KOREAN JOURNAL OF APPLIED STATISTICS, 2023, 36 (06) : 501 - 514
  • [26] Novel principal component analysis-based feature selection mechanism for classroom sound classification
    Tsalera, Eleni
    Papadakis, Andreas
    Samarakou, Maria
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (04) : 1827 - 1843
  • [27] Hybrid Feature Selection Based on Principal Component Analysis and Grey Wolf Optimizer Algorithm for Arabic News Article Classification
    Alomari, Osama Ahmad
    Elnagar, Ashraf
    Afyouni, Imad
    Shahin, Ismail
    Nassif, Ali Bou
    Hashem, Ibrahim Abaker
    Tubishat, Mohammad
    IEEE ACCESS, 2022, 10 : 121816 - 121830
  • [28] Feature selection based on partition clustering
    Liu, Shuang
    Zhao, Qiang
    Wu, Xiang
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (02) : 135 - 142
  • [29] Feature Selection in Gene Expression Data Using Principal Component Analysis and Rough Set Theory
    Mishra, Debahuti
    Dash, Rajashree
    Rath, Amiya Kumar
    Acharya, Milu
    SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 91 - 100
  • [30] Identification of distinct characteristics of postural sway in Parkinson's disease: A feature selection procedure based on principal component analysis
    Rocchi, L
    Chiari, L
    Cappello, A
    Horak, FB
    NEUROSCIENCE LETTERS, 2006, 394 (02) : 140 - 145