BINARY PSO AND ROUGH SET THEORY FOR FEATURE SELECTION: A MULTI-OBJECTIVE FILTER BASED APPROACH

被引:40
作者
Xue, Bing [1 ,2 ]
Cervante, Liam [1 ]
Shang, Lin [2 ]
Browne, Will [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210046, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Feature selection; particle swarm optimization; rough set theory; multi-objective optimization;
D O I
10.1142/S1469026814500096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a multi-objective problem, where the two main objectives are to maximize the classification accuracy and minimize the number of features. However, most of the existing algorithms belong to single objective, wrapper approaches. In this work, we investigate the use of binary particle swarm optimization (BPSO) and probabilistic rough set (PRS) for multi-objective feature selection. We use PRS to propose a new measure for the number of features based on which a new filter based single objective algorithm (PSOPRSE) is developed. Then a new filter-based multi-objective algorithm (MORSE) is proposed, which aims to maximize a measure for the classification performance and minimize the new measure for the number of features. MORSE is examined and compared with PSOPRSE, two existing PSO-based single objective algorithms, two traditional methods, and the only existing BPSO and PRS-based multi-objective algorithm (MORSN). Experiments have been conducted on six commonly used discrete datasets with a relative small number of features and six continuous datasets with a large number of features. The classification performance of the selected feature subsets are evaluated by three classification algorithms (decision trees, Naive Bayes, and k-nearest neighbors). The results show that the proposed algorithms can automatically select a smaller number of features and achieve similar or better classification performance than using all features. PSOPRSE achieves better performance than the other two PSO-based single objective algorithms and the two traditional methods. MORSN and MORSE outperform all these five single objective algorithms in terms of both the classification performance and the number of features. MORSE achieves better classification performance than MORSN. These filter algorithms are general to the three different classification algorithms.
引用
收藏
页数:34
相关论文
共 41 条
[1]   LEARNING BOOLEAN CONCEPTS IN THE PRESENCE OF MANY IRRELEVANT FEATURES [J].
ALMUALLIM, H ;
DIETTERICH, TG .
ARTIFICIAL INTELLIGENCE, 1994, 69 (1-2) :279-305
[2]  
Bache K., 2013, UCI MACHINE LEARNING
[3]   Feature selection with Intelligent Dynamic Swarm and Rough Set [J].
Bae, Changseok ;
Yeh, Wei-Chang ;
Chung, Yuk Ying ;
Liu, Sin-Long .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) :7026-7032
[4]  
Cervante Liam, 2012, AI 2012: Advances in Artificial Intelligence. 25th Australasian Conference. Proceedings, P313, DOI 10.1007/978-3-642-35101-3_27
[5]  
Cervante L., 2012, IEEE C EV COMP CEC 2, P1
[6]  
Cervante L, 2013, 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), P2428
[7]  
Cervante L, 2013, LECT NOTES COMPUT SC, V7832, P25, DOI 10.1007/978-3-642-37198-1_3
[8]  
Chakraborty B, 2002, ISIE 2002: PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, VOLS 1-4, P315, DOI 10.1109/ISIE.2002.1026085
[9]   Feature Subset Selection by Particle Swarm Optimization with Fuzzy Fitness Function [J].
Chakraborty, Basabi .
2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, :1038-1042
[10]   A rough set approach to feature selection based on ant colony optimization [J].
Chen, Yumin ;
Miao, Duoqian ;
Wang, Ruizhi .
PATTERN RECOGNITION LETTERS, 2010, 31 (03) :226-233