An Experimental Study on Unsupervised Clustering-based Feature Selection Methods

被引:1
作者
Covoes, Thiago F. [1 ]
Hruschka, Eduardo R. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, Sao Carlos, SP, Brazil
来源
2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS | 2009年
关键词
unsupervised feature selection; feature clustering; clustering problems; GENE-EXPRESSION DATA; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ISDA.2009.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
[21]   A new unsupervised feature selection algorithm using similarity-based feature clustering [J].
Zhu, Xiaoyan ;
Wang, Yu ;
Li, Yingbin ;
Tan, Yonghui ;
Wang, Guangtao ;
Song, Qinbao .
COMPUTATIONAL INTELLIGENCE, 2019, 35 (01) :2-22
[22]   Clustering-based feature subset selection with analysis on the redundancy-complementarity dimension [J].
Chen, Zhijun ;
Chen, Qiushi ;
Zhang, Yishi ;
Zhou, Lei ;
Jiang, Junfeng ;
Wu, Chaozhong ;
Huang, Zhen .
COMPUTER COMMUNICATIONS, 2021, 168 :65-74
[23]   A review of unsupervised feature selection methods [J].
Saúl Solorio-Fernández ;
J. Ariel Carrasco-Ochoa ;
José Fco. Martínez-Trinidad .
Artificial Intelligence Review, 2020, 53 :907-948
[24]   Research on Feature Selection Methods Based on Feature Clustering and Information Theory [J].
Wang, Wenhui ;
Zhou, Changyin .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 :71-82
[25]   Subspace Clustering via Joint Unsupervised Feature Selection [J].
Dong, Wenhua ;
Wu, Xiao-Jun ;
Li, Hui ;
Feng, Zhen-Hua ;
Kittler, Josef .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :3892-3898
[26]   A Clustering-Based Approach to Reduce Feature Redundancy [J].
de Amorim, Renato Cordeiro ;
Mirkin, Boris .
KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS: RECENT TRENDS, ADVANCES AND SOLUTIONS, KICSS 2013, 2016, 364 :465-475
[27]   CLUSTERING-BASED FEATURE LEARNING ON VARIABLE STARS [J].
Mackenzie, Cristobal ;
Pichara, Karim ;
Protopapas, Pavlos .
ASTROPHYSICAL JOURNAL, 2016, 820 (02)
[28]   A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data [J].
Song, Qinbao ;
Ni, Jingjie ;
Wang, Guangtao .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) :1-14
[29]   Unsupervised feature selection via discrete spectral clustering and feature weights [J].
Shang, Ronghua ;
Kong, Jiarui ;
Wang, Lujuan ;
Zhang, Weitong ;
Wang, Chao ;
Li, Yangyang ;
Jiao, Licheng .
NEUROCOMPUTING, 2023, 517 :106-117
[30]   A new unsupervised feature selection method for text clustering based on genetic algorithms [J].
Pirooz Shamsinejadbabki ;
Mohammad Saraee .
Journal of Intelligent Information Systems, 2012, 38 :669-684