A New Binary Particle Swarm Optimisation Algorithm for Feature Selection

被引:12
作者
Xue, Bing [1 ]
Nguyen, Su [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION | 2014年 / 8602卷
关键词
Binary particle swarm optimisation; Feature selection; Classification;
D O I
10.1007/978-3-662-45523-4_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection aims to select a small number of features from a large feature set to achieve similar or better classification performance than using all features. This paper develops a new binary particle swarm optimisation (PSO) algorithm (named PBPSO) based on which a new feature selection approach (PBPSOfs) is developed to reduce the number of features and increase the classification accuracy. The performance of PBPSOfs is compared with a standard binary PSO based feature selection algorithm (BPSOfs) and two traditional feature selection algorithms on 14 benchmark problems of varying difficulty. The results show that PBPSOfs can be successfully used for feature selection to select a small number of features and improve the classification performance over using all features. PBPSOfs further reduces the number of features selected by BPSOfs and simultaneously increases the classification accuracy, especially on datasets with a large number of features. Meanwhile, PBPSOfs achieves better performance than the two traditional feature selection algorithms. In addition, the results also show that PBPSO as a general binary optimisation technique can achieve better performance than standard binary PSO and uses less computational time.
引用
收藏
页码:501 / 513
页数:13
相关论文
共 18 条
[1]  
[Anonymous], 2002, Computational Intelligence an Introduction
[2]  
Caruana R., 1994, P 11 INT C INT C MAC, P28, DOI [10.1016/B978-1-55860-335-6.50012-X, DOI 10.1016/B978-1-55860-335-6.50012-X]
[3]   Improved binary PSO for feature selection using gene expression data [J].
Chuang, Li-Yeh ;
Chang, Hsueh-Wei ;
Tu, Chung-Jui ;
Yang, Cheng-Hong .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (01) :29-38
[4]  
Dash M., 1997, Intelligent Data Analysis, V1
[5]  
Frank A., 2010, UCI machine learning repository, V213
[6]   Large-scale attribute selection using wrappers [J].
Guetlein, Martin ;
Frank, Eibe ;
Hall, Mark ;
Karwath, Andreas .
2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, :332-339
[7]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[8]  
Kennedy J, 1997, IEEE SYS MAN CYBERN, P4104, DOI 10.1109/ICSMC.1997.637339
[9]  
Kennedy J, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS PROCEEDINGS, VOLS 1-6, P1942, DOI 10.1109/icnn.1995.488968
[10]  
Khan M. A. A., 2007, Proceedings. IEEE SoutheastCon 2007 (IEEE Cat. No.07CH37882)