Feature Subset Selection: A Correlation-Based SVM Filter Approach

被引:13
作者
Li, Boyang [1 ]
Wang, Qiangwei [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka, Japan
关键词
feature selection; correlation-based clustering; support vector machine; feature ranking;
D O I
10.1002/tee.20641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions. (C) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
引用
收藏
页码:173 / 179
页数:7
相关论文
共 50 条
  • [31] Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training
    Nasir, Inzamam Mashood
    Khan, Muhammad Attique
    Yasmin, Mussarat
    Shah, Jamal Hussain
    Gabryel, Marcin
    Scherer, Rafal
    Damasevicius, Robertas
    SENSORS, 2020, 20 (23) : 1 - 18
  • [32] Feature subset selection Filter-Wrapper based on low quality data
    Cadenas, Jose M.
    Carmen Garrido, M.
    Martinez, Raquel
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) : 6241 - 6252
  • [33] A Hybrid Approach for Feature Selection Based on Correlation Feature Selection and Genetic Algorithm
    Rani, Pooja
    Kumar, Rajneesh
    Jain, Anurag
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)
  • [34] A novel hybrid wrapper–filter approach based on genetic algorithm, particle swarm optimization for feature subset selection
    Fateme Moslehi
    Abdorrahman Haeri
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 1105 - 1127
  • [35] A filter approach to feature selection based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 84 - 89
  • [36] A feature subset selection algorithm based on feature activity and improved GA
    Li, Juan
    2015 11TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2015, : 206 - 210
  • [37] Towards a Better Feature Subset Selection Approach
    Shiba, Omar A. A.
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT 5TH INTERNATIONAL CONFERENCE 2010, 2010, : 675 - 678
  • [38] Correlation-based feature selection of single cell transcriptomics data from multiple sources
    Mitic, Nenad S.
    Malkov, Sasa N.
    Ruzicic, Mirjana M. Maljkovic
    Veljkovic, Aleksandar N.
    Cukic, Ivan Lj.
    Lin, Xin
    Lyu, Minjie
    Brusic, Vladimir
    JOURNAL OF BIG DATA, 2025, 12 (01)
  • [39] Investigating the effect of correlation-based feature selection on the performance of neural network in reservoir characterization
    Akande, Kabiru O.
    Owolabi, Taoreed O.
    Olatunji, Sunday O.
    JOURNAL OF NATURAL GAS SCIENCE AND ENGINEERING, 2015, 27 : 98 - 108
  • [40] A robust SVM-based approach with feature selection and outliers detection for classification problems
    Baldomero-Naranjo, Marta
    Martinez-Merino, Luisa I.
    Rodriguez-Chia, Antonio M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 178