The Feature Selection for Classification by Applying the Significant Matrix with SPEA2

被引:0
作者
Chuasuwan, Ekapong [1 ]
Eiamkanitchat, Narissara [1 ]
机构
[1] Chiang Mai Univ, Fac Engn, Dept Comp Engn, Chiang Mai 50000, Thailand
来源
2013 INTERNATIONAL COMPUTER SCIENCE AND ENGINEERING CONFERENCE (ICSEC) | 2013年
关键词
component; Feature Selection; Signigicant Matrix; Decision Tree; Genetic Algotihm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel application of Genetic Algorithm for the feature selection. The main purpose is to provide proper subset features for decision tree construction in the classification task New method with the use of "Significant Matrix" on genetic algorithm is presented. The main function is to calculate the relationship between the feature and class label assigned to a fitness value for the population. The algorithm presented important features selected by considering the class of the data and number of features for the least amount in the Significant Matrix. The next step will then update the feature number and the record number to repeat the process until a stop condition is met. Classification by decision tree is used to verify the importance of the features selected by the proposed method. The model tested with 11 different datasets. The results show that the method yields high accuracy of the classification and higher satisfaction compared to classification using artificial neural network Experimental results show that the proposed method not only provides a higher accuracy, but also reduce the complexity by using less features of the dataset.
引用
收藏
页码:359 / 364
页数:6
相关论文
共 16 条
[1]  
Bache K., 2013, UCI Machine Learning Repository
[2]  
Chis Monica, 2008, INT J COMPUT INF SCI, V2, P237
[3]  
Gehrke J., 2000, SPRINGER NETHERLANDS, V4
[4]  
Golberg D. E., 1989, GENETIC ALGORITHMS S, V1989, P36
[5]  
Guo Pengfei, 2010, INT C BIOM ENG INF B
[6]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157
[7]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[8]   Shared memory parallelization of data mining algorithms: Techniques, programming interface, and performance [J].
Jin, RM ;
Yang, G ;
Agrawal, G .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (01) :71-89
[9]  
Kriegel Hans-Peter, 2006, DATA MINING CONCEPTS
[10]  
Kumar Pardeep, 2011, WORLD C INF COMM TEC