An Experimental Study on Unsupervised Clustering-based Feature Selection Methods

被引:1
作者
Covoes, Thiago F. [1 ]
Hruschka, Eduardo R. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, Sao Carlos, SP, Brazil
来源
2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS | 2009年
关键词
unsupervised feature selection; feature clustering; clustering problems; GENE-EXPRESSION DATA; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ISDA.2009.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
  • [31] Integration of dense subgraph finding with feature clustering for unsupervised feature selection
    Bandyopadhyay, Sanghamitra
    Bhadra, Tapas
    Mitra, Pabitra
    Maulik, Ujjwal
    [J]. PATTERN RECOGNITION LETTERS, 2014, 40 : 104 - 112
  • [32] Unsupervised feature selection via discrete spectral clustering and feature weights
    Shang, Ronghua
    Kong, Jiarui
    Wang, Lujuan
    Zhang, Weitong
    Wang, Chao
    Li, Yangyang
    Jiao, Licheng
    [J]. NEUROCOMPUTING, 2023, 517 : 106 - 117
  • [33] Unsupervised Feature Selection Technique Based on Genetic Algorithm for Improving the Text Clustering
    Abualigah, Laith Mohammad
    Khader, Ahamad Tajudin
    Al-Betar, Mohammed Azmi
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2016,
  • [34] An Improved Fast Clustering-Based Feature Subset Selection Algorithm for Multi Featured dataset
    Sharma, Poonam
    Mathur, Abhisek
    Chaturvedi, Sushil
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ENGINEERING AND TECHNOLOGY RESEARCH (ICAETR), 2014,
  • [35] SONOELASTOMICS FOR BREAST TUMOR CLASSIFICATION: A RADIOMICS APPROACH WITH CLUSTERING-BASED FEATURE SELECTION ON SONOELASTOGRAPHY
    Zhang, Qi
    Xiao, Yang
    Suo, Jingfeng
    Shi, Jun
    Yu, Jinhua
    Guo, Yi
    Wang, Yuanyuan
    Zheng, Hairong
    [J]. ULTRASOUND IN MEDICINE AND BIOLOGY, 2017, 43 (05) : 1058 - 1069
  • [36] Ranking Based Unsupervised Feature Selection Methods: An Empirical Comparative Study in High Dimensional Datasets
    Solorio-Fernandez, Saul
    Ariel Carrasco-Ochoa, J.
    Fco Martinez-Trinidad, Jose
    [J]. ADVANCES IN SOFT COMPUTING, MICAI 2018, PT I, 2018, 11288 : 205 - 218
  • [37] Novel hyperbolic clustering-based band hierarchy (HCBH) for effective unsupervised band selection of hyperspectral images
    Sun, He
    Zhang, Lei
    Ren, Jinchang
    Huang, Hua
    [J]. PATTERN RECOGNITION, 2022, 130
  • [38] Group Based Unsupervised Feature Selection
    Perera, Kushani
    Chan, Jeffrey
    Karunasekera, Shanika
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 805 - 817
  • [39] STUDY ON UNSUPERVISED FEATURE SELECTION METHOD BASED ON EXTENDED ENTROPY
    Sun, Zhanquan
    Li, Feng
    Huang, Huifen
    [J]. COMPUTING AND INFORMATICS, 2019, 38 (01) : 223 - 239
  • [40] Unsupervised feature selection method based on iterative similarity graph factorization and clustering by modularity
    Oliveira, Marcos de S.
    Queiroz, Sergio R. de M.
    de Carvalho, Francisco de A. T.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208