A hybrid feature selection scheme for high-dimensional data

被引:27
作者
Ganjei, Mohammad Ahmadi [1 ]
Boostani, Reza [1 ]
机构
[1] Shiraz Univ, Dept Comp Sci Engn & IT, Shiraz, Iran
关键词
Hybrid feature selection; High-dimensional datasets; Wrapper; Classification; Clustering; Microarray; ANT COLONY OPTIMIZATION; FILTER; INFORMATION; ALGORITHMS; DIAGNOSIS;
D O I
10.1016/j.engappai.2022.104894
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a growing interest in developing feature subset selection schemes for high-dimensional datasets by filter, wrapper, embedded, and hybrid manners. In this paper, we propose a new hybrid (filter-wrapper) feature selection approach. At first, in the filter step, we rank input features according to their relevance with the class label. Afterwards, we apply different clustering methods for the classification of the selected features. We perform an inner and outer cluster ranking based on the primary feature ranking in the next step. Then, different search strategies are performed on the best cluster of features in the wrapper phase. Moreover, we add some of them to the feature set based on the classifiers (nearest neighbor, decision tree, support vector machine, and random forests) feedback. Then, the algorithm goes to the next cluster, and this process is continued till all clusters are met. Finally, we compare the results of the proposed method to the state-of-the-art schemes. Comparison results imply the superiority of the proposed method to the counterparts on eight high-dimensional datasets in terms of accuracy and computational complexity.
引用
收藏
页数:12
相关论文
共 52 条
[1]   DB-FFR: A Modified Feature Selection Algorithm to Improve Discrimination Rate Between Bipolar Mood Disorder (BMD) and Schizophrenic Patients [J].
Alimardani, F. ;
Boostani, R. .
IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2018, 42 (03) :251-260
[2]   Presenting a new search strategy to select synchronization values for classifying bipolar mood disorders from schizophrenic patients [J].
Alimardani, F. ;
Boostani, R. ;
Azadehdel, M. ;
Ghanizadeh, A. ;
Rastegar, K. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (02) :913-923
[3]   Lip Feature Extraction and Reduction for HMM-Based Visual Speech Recognition Systems [J].
Alizadeh, S. ;
Boostani, R. ;
Asadpour, V. .
ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, :561-+
[4]  
[Anonymous], 1963, DISTRIBUTION EREE MU
[5]  
[Anonymous], 1995, P 6 INT S MICR HUM S
[6]  
[Anonymous], 1997, COMP STUDY FEATURE S
[7]   Two hybrid wrapper-filter feature selection algorithms applied to high-dimensional microarray experiments [J].
Apolloni, Javier ;
Leguizamon, Guillermo ;
Alba, Enrique .
APPLIED SOFT COMPUTING, 2016, 38 :922-932
[8]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[9]  
Bermejo P., 2008, IPMU 08
[10]   Fast wrapper feature subset selection in high-dimensional datasets by means of filter re-ranking [J].
Bermejo, Pablo ;
de la Ossa, Luis ;
Gamez, Jose A. ;
Puerta, Jose M. .
KNOWLEDGE-BASED SYSTEMS, 2012, 25 (01) :35-44