Efficient feature selection and classification algorithm based on PSO and rough sets

被引:0
作者
Ramesh Kumar Huda
Haider Banka
机构
[1] Indian Institute of Technology (ISM),
来源
Neural Computing and Applications | 2019年 / 31卷
关键词
Feature selection; Rough sets; New quick reduct; Inconsistency handler; Classification; Fitness function; Particle Swarm Optimization;
D O I
暂无
中图分类号
学科分类号
摘要
The high-dimensional data are often characterized by more number of features with less number of instances. Many of the features are irrelevant and redundant. These features may be especially harmful in case of extreme number of features carries the problem of memory usage in order to represent the datasets. On the other hand relatively small training set, where this irrelevancy and redundancy makes harder to evaluate. Hence, in this paper we propose an efficient feature selection and classification method based on Particle Swarm Optimization (PSO) and rough sets. In this study, we propose the inconsistency handler algorithm for handling inconsistency in dataset, new quick reduct algorithm for handling irrelevant/noisy features and fitness function with three parameters, the classification quality of feature subset, remaining features and the accuracy of approximation. The proposed method is compared with two traditional and three fusion of PSO and rough set-based feature selection methods. In this study, Decision Tree and Naive Bayes classifiers are used to calculate the classification accuracy of the selected feature subset on nine benchmark datasets. The result shows that the proposed method can automatically selects small feature subset with better classification accuracy than using all features. The proposed method also outperforms the two traditional and three existing PSO and rough set-based feature selection methods in terms of the classification accuracy, cardinality of feature and stability indices. It is also observed that with increased weight on the classification quality of feature subset of the fitness function, there is a significant reduction in the cardinality of features and also achieve better classification accuracy as well.
引用
收藏
页码:4287 / 4303
页数:16
相关论文
共 72 条
[1]  
Settouti N(2016)Statistical comparisons of the top 10 algorithms in data mining for classification task Int J Interact Multimed Artif Intell Spec Issue Artif Intell 4 46-51
[2]  
Bechar MEA(1997)Feature selection for classification Intell Data Anal 1 131-156
[3]  
Chikh MA(2016)SVM and ANN based classification of plant diseases using feature reduction technique Int J Interact Multimed Artif Intell 3 1-9
[4]  
Dash M(1982)Rough sets Int J Comput Inf Sci 11 341-356
[5]  
Liu H(1997)Rough set approach to knowledge-based decision support Eur J Oper Res 99 48-57
[6]  
Pujari JD(2001)Rough set-aided keyword reduction for text categorization Appl Artif Intell 15 843-873
[7]  
Yakkundimath R(2010)Feature selection with intelligent dynamic swarm and rough set Expert Syst Appl 37 7026-7032
[8]  
Byadgi A(2000)Comparison of algorithms that select features for pattern classifiers Pattern Recogn 33 25-41
[9]  
Pawlak Z(1997)Wrappers for feature subset selection Artif Intell 97 273-324
[10]  
Pawlak Z(2003)An introduction to variable and feature selection J Mach Learn Res 3 1157-1182