Feature Subset Selection: A Correlation-Based SVM Filter Approach

被引:13
作者
Li, Boyang [1 ]
Wang, Qiangwei [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka, Japan
关键词
feature selection; correlation-based clustering; support vector machine; feature ranking;
D O I
10.1002/tee.20641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions. (C) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
引用
收藏
页码:173 / 179
页数:7
相关论文
共 50 条
  • [41] Correlation-based Feature Ranking for Online Classification
    Osman, Hassab Elgawi
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3077 - 3082
  • [42] Cofea: correlation-based feature selection for single-cell chromatin accessibility data
    Li, Keyi
    Chen, Xiaoyang
    Song, Shuang
    Hou, Lin
    Chen, Shengquan
    Jiang, Rui
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)
  • [43] A multiclass SVM classifier with teaching learning based feature subset selection for enzyme subclass classification
    Pradhan, Debasmita
    Sahoo, Biswajit
    Misra, Bijan Bihari
    Padhy, Sudarsan
    APPLIED SOFT COMPUTING, 2020, 96
  • [44] A Feature Selection Based Serial SVM Ensemble Classifier
    Cao, Jianjun
    Lv, Guojun
    Chang, Chen
    Li, Hongmei
    IEEE ACCESS, 2019, 7 : 144516 - 144523
  • [45] Feature selection algorithm based on SVM
    Sun Jiongjiong
    Liu Jun
    Wei Xuguang
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4113 - 4116
  • [46] A novel hybrid wrapper-filter approach based on genetic algorithm, particle swarm optimization for feature subset selection
    Moslehi, Fateme
    Haeri, Abdorrahman
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (03) : 1105 - 1127
  • [47] Improved feature size customized fast correlation-based filter for Naive Bayes text classification
    Zhang, Yun
    Zhang, Yude
    He, Wei
    Yu, Shujuan
    Zhao, Shengmei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 3117 - 3127
  • [48] Correlation-based feature selection and classification via regression of segmented chromosomes using geometric features
    Tanvi Arora
    Renu Dhir
    Medical & Biological Engineering & Computing, 2017, 55 : 733 - 745
  • [49] Detecting Mild Cognitive Impairment from Spontaneous Speech by Correlation-Based Phonetic Feature Selection
    Gosztolya, Gabor
    Toth, Laszlo
    Grosz, Tamas
    Vincze, Veronika
    Hoffmann, Ildiko
    Szatloczki, Greta
    Pakaski, Magdolna
    Kalman, Janos
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 107 - 111
  • [50] A filter-based feature selection approach in multilabel classification
    Shaikh, Rafia
    Rafi, Muhammad
    Mahoto, Naeem Ahmed
    Sulaiman, Adel
    Shaikh, Asadullah
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):