Feature clustering based support vector machine recursive feature elimination for gene selection

被引:0
作者
Xiaojuan Huang
Li Zhang
Bangjun Wang
Fanzhang Li
Zhao Zhang
机构
[1] Soochow University Suzhou,School of Computer Science and Technology & Joint International Research Laboratory of Machine Learning and Neuromorphic Computing
来源
Applied Intelligence | 2018年 / 48卷
关键词
Support vector machine; Feature selection; Gene clustering; Recursive feature elimination; Gene relevancy; Gene redundancy;
D O I
暂无
中图分类号
学科分类号
摘要
In a DNA microarray dataset, gene expression data often has a huge number of features(which are referred to as genes) versus a small size of samples. With the development of DNA microarray technology, the number of dimensions increases even faster than before, which could lead to the problem of the curse of dimensionality. To get good classification performance, it is necessary to preprocess the gene expression data. Support vector machine recursive feature elimination (SVM-RFE) is a classical method for gene selection. However, SVM-RFE suffers from high computational complexity. To remedy it, this paper enhances SVM-RFE for gene selection by incorporating feature clustering, called feature clustering SVM-RFE (FCSVM-RFE). The proposed method first performs gene selection roughly and then ranks the selected genes. First, a clustering algorithm is used to cluster genes into gene groups, in each which genes have similar expression profile. Then, a representative gene is found to represent a gene group. By doing so, we can obtain a representative gene set. Then, SVM-RFE is applied to rank these representative genes. FCSVM-RFE can reduce the computational complexity and the redundancy among genes. Experiments on seven public gene expression datasets show that FCSVM-RFE can achieve a better classification performance and lower computational complexity when compared with the state-the-art-of methods, such as SVM-RFE.
引用
收藏
页码:594 / 607
页数:13
相关论文
共 50 条
[41]   Optimal Feature Selection for Support Vector Machine Classifiers [J].
Strub, O. .
2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, :304-308
[42]   Feature Selection of Power System Transient Stability Assessment Based on Random Forest and Recursive Feature Elimination [J].
Zhang, Chun ;
Li, Yansong ;
Yu, Zhihong ;
Tian, Fang .
2016 IEEE PES ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2016, :1264-1268
[43]   Hybrid Classification Model of Correlation-based Feature Selection and Support Vector Machine [J].
Dubey, Vimal Kumar ;
Saxena, Amit Kumar .
2016 IEEE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ADVANCED COMPUTING (ICCTAC), 2016,
[44]   Mass Diagnosis in Mammography with Mutual Information Based Feature Selection and Support Vector Machine [J].
Liu, Xiaoming ;
Li, Bo ;
Liu, Jun ;
Xu, Xin ;
Feng, Zhilin .
INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 :1-8
[45]   Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods [J].
Polat, Huseyin ;
Mehr, Homay Danaei ;
Cetin, Aydin .
JOURNAL OF MEDICAL SYSTEMS, 2017, 41 (04)
[46]   Differential evolution-based parameters optimisation and feature selection for support vector machine [J].
Li, Jun ;
Ding, Lixin ;
Li, Bo .
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2016, 13 (04) :355-363
[47]   Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods [J].
Huseyin Polat ;
Homay Danaei Mehr ;
Aydin Cetin .
Journal of Medical Systems, 2017, 41
[48]   SOUND EVENT CLASSIFICATION BASED ON FEATURE INTEGRATION, RECURSIVE FEATURE ELIMINATION AND STRUCTURED CLASSIFICATION [J].
Tran, Huy Dat ;
Li, Haizhou .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :177-180
[49]   The Improved Particle Swarm Optimization for Feature Selection of Support Vector Machine [J].
Wang, Sipeng ;
Ding, Sheng .
PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION SYSTEMS (ICCIS 2017), 2015, :314-317
[50]   Feature selection with kernelized multi-class support vector machine [J].
Guo, Yinan ;
Zhang, Zirui ;
Tang, Fengzhen .
PATTERN RECOGNITION, 2021, 117