Wrapper Feature Subset Selection for Dimension Reduction Based on Ensemble Learning Algorithm

被引:76
作者
Panthong, Rattanawadee [1 ]
Srivihok, Anongnart [1 ]
机构
[1] Kasetsart Univ, Dept Comp Sci, Bangkok 10900, Thailand
来源
THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015 | 2015年 / 72卷
关键词
Feature selection; Ensemble learning algorithm; dimension reduction; classification; data mining;
D O I
10.1016/j.procs.2015.12.117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is a technique to choose a subset of variables from the multidimensional data which can improve the classification accuracy in diversity datasets. In addition, the best feature subset selection method can reduce the cost of feature measurement. This work focuses on the use of wrapper feature selection. This study use methods of sequential forward selection (SFS), sequential backward selection (SBS) and optimize selection (evolutionary) based on ensemble algorithms namely Bagging and AdaBoost by subset evaluations which are performed using two classifiers; Decision Tree and Naive Bayes. Thirteen datasets containing different numbers of attributes and dimensions are obtained from the UCI Machine Learning Repository. This study shows that the search technique using SFS based on the bagging algorithm using Decision Tree obtained better results in average accuracy (89.60%) than other methods. The benefits of the feature subset selection are an increased accuracy rate and a reduced run-time when searching multimedia data consisting of a large number of multidimensional datasets. (C) 2015 Published by Elsevier B.V.
引用
收藏
页码:162 / 169
页数:8
相关论文
共 19 条
[1]  
[Anonymous], IETE TECHNICAL REV
[2]  
[Anonymous], SMART COMPUTING REV
[3]  
Beniwal S., 2012, International Journal of Engineering Research Technology (IJERT), V1, P1
[4]   Attribute bagging: improving accuracy of classifier ensembles by using random feature subsets [J].
Bryll, R ;
Gutierrez-Osuna, R ;
Quek, F .
PATTERN RECOGNITION, 2003, 36 (06) :1291-1302
[5]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[6]   Hybrid decision tree and naive Bayes classifiers for multi-class classification tasks [J].
Farid, Dewan Md. ;
Zhang, Li ;
Rahman, Chowdhury Mofizur ;
Hossain, M. A. ;
Strachan, Rebecca .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) :1937-1946
[7]  
Frank A., 2010, UCI MACHINE LEARNING
[8]   Pattern Analysis for Machine Olfaction: A Review [J].
Gutierrez-Osuna, Ricardo .
IEEE SENSORS JOURNAL, 2002, 2 (03) :189-202
[9]  
Janecek A., 2008, Journal of Machine Learning Research: Workshop and Conference Proceedings, P90, DOI DOI 10.1023/A:1010933404324
[10]  
Jantawan B, 2014, INT J INNOVATIVE RES