Feature Subset Selection: A Correlation-Based SVM Filter Approach

被引:13
|
作者
Li, Boyang [1 ]
Wang, Qiangwei [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka, Japan
关键词
feature selection; correlation-based clustering; support vector machine; feature ranking;
D O I
10.1002/tee.20641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions. (C) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
引用
收藏
页码:173 / 179
页数:7
相关论文
共 50 条
  • [1] A Hybrid Feature Selection Approach by Correlation-based Filters and SVM-RFE
    Zhang, Jing
    Hu, Xuegang
    Li, Peipei
    He, Wei
    Zhang, Yuhong
    Li, Huizong
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3684 - 3689
  • [2] Addressing Low Dimensionality Feature Subset Selection: ReliefF(-k) or Extended Correlation-Based Feature Selection(eCFS)?
    Tallon-Ballesteros, Antonio J.
    Cavique, Luis
    Fong, Simon
    14TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2019), 2020, 950 : 251 - 260
  • [3] Distributed correlation-based feature selection in spark
    Palma-Mendoza, Raul Jose
    de-Marcos, Luis
    Rodriguez, Daniel
    Alonso-Betanzos, Amparo
    INFORMATION SCIENCES, 2019, 496 : 287 - 299
  • [4] A new filter-based Gene selection method based on dragonfly optimization and correlation-based feature selection
    Ghoneimy, Mohamed
    Nabil, Emad
    Badr, Amr
    El-Khamisy, Sherif F.
    BIOSCIENCE RESEARCH, 2019, 16 (03): : 3139 - 3154
  • [5] A Correlation-Based Feature Weighting Filter for Naive Bayes
    Jiang, Liangxiao
    Zhang, Lungan
    Li, Chaoqun
    Wu, Jia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 201 - 213
  • [6] A Novel Feature Selection Method Based on Correlation-Based Feature Selection in Cancer Recognition
    Lu, Xinguo
    Peng, Xianghua
    Deng, Yong
    Feng, Bingtao
    Liu, Ping
    Liao, Bo
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (02) : 427 - 433
  • [7] Correlation-Based Feature Selection for Enhanced Arrhythmia Classification
    Al Khaldy, Mohammad
    INTELLIGENT AND FUZZY SYSTEMS, VOL 2, INFUS 2024, 2024, 1089 : 355 - 364
  • [8] Distance Correlation-Based Feature Selection in Random Forest
    Ratnasingam, Suthakaran
    Munoz-Lopez, Jose
    ENTROPY, 2023, 25 (09)
  • [9] Heuristically Reducing the Cost of Correlation-based Feature Selection
    Brown, Katherine E.
    Talbert, Douglas A.
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 24 - 30
  • [10] Enhancing Big Data Feature Selection Using a Hybrid Correlation-Based Feature Selection
    Mohamad, Masurah
    Selamat, Ali
    Krejcar, Ondrej
    Crespo, Ruben Gonzalez
    Herrera-Viedma, Enrique
    Fujita, Hamido
    ELECTRONICS, 2021, 10 (23)