Feature subset selection for irrelevant data removal using Decision Tree Algorithm

被引:0
作者
Evangeline, D. Preetha [1 ]
Sandhiya, C. [2 ]
Anandhakumar, P. [1 ]
Raj, G. Deepti [2 ]
Rajendran, T. [2 ]
机构
[1] Anna Univ, Dept Comp Technol, Chennai 600044, Tamil Nadu, India
[2] Chettinad Coll Engg & Technol, Dept Comp Sci & Engn, Karur 639114, India
来源
2013 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC) | 2013年
关键词
Feature subset selection; Decision tree; feature clustering;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Feature subset selection is an effective way for reducing dimensionality, removmg irrelevant data, and improving result accuracy. Feature subset selection can be viewed as the process of identifying and removing as many irrelevant and redundant features as possible. This is because 1) irrelevant features do not contribute to the predictive accuracy and 2) redundant features do not redound to getting a better predictor for that they provide mostly information which is already present in other feature(s). Irrelevant features, along with redundant features, severely affect the accuracy of the learning machines. In this paper, exceptional vigilance is made on characteristic assortment for classification with data. Here an algorithm is utilized that plans attributes founded on their significance. Then, the organized attributes can be utilized as input one easy algorithm for building decision tree (Oblivious Tree). Outcomes show that this decision tree uses boasted chosen by suggested algorithm outperformed conclusion tree without feature selection. From the experimental outcomes, it is observed that, this procedure develops lesser tree having an agreeable accuracy. The results obtained with decision tree method for selection of datasets has resulted with 85.87% when compared with other techniques.
引用
收藏
页码:268 / 274
页数:7
相关论文
共 17 条
[1]  
[Anonymous], P 18 INT C MACH LEAR
[2]  
[Anonymous], P IEEE INT C DAT MIN
[3]  
[Anonymous], P 31 ANN M OH STAT U
[4]  
Guyon I., P 9 INT C PATT
[5]   Toward integrating feature selection algorithms for classification and clustering [J].
Liu, H ;
Yu, L .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) :491-502
[6]   A selective sampling approach to active feature selection [J].
Liu, H ;
Motoda, H ;
Yu, L .
ARTIFICIAL INTELLIGENCE, 2004, 159 (1-2) :49-74
[7]  
Mitchell T., 1982, ARTIFICIAL INTELLIGE, V18
[8]  
Modrzejewski Maciej, 1993, P EUR C MACH LEARN
[9]  
Sionim N., 2002, P 19 INT C MACH LEAR
[10]  
Smyth P., 2002, COMM ACM, V45