A Genetic Based Wrapper Feature Selection Approach Using Nearest Neighbour Distance Matrix

被引:0
作者
Sainin, Mohd Shamrie [1 ]
Alfred, Rayner [2 ]
机构
[1] Univ Utara Malaysia, Dept Comp Sci, Coll Arts & Sci, Sintok, Kedah, Malaysia
[2] Univ Malaysia Sabah, Sch Engn & Informat Technol, Kota Kinabalu, Malaysia
来源
2011 3RD CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO) | 2011年
关键词
machine learning; data mining; data mining optimization; nearest neighbour; distance matrix; classification; feature selection; genetic algorithm; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for data mining optimization receives quite a high demand especially on high-dimensional feature vectors of a data. Feature selection is a method used to select the best feature (or combination of features) for the data in order to achieve similar or better classification rate. Currently, there are three types of feature selection methods: filter, wrapper and embedded. This paper describes a genetic based wrapper approach that optimizes feature selection process embedded in a classification technique called a supervised Nearest Neighbour Distance Matrix (NNDM). This method is implemented and tested on several datasets obtained from the UCI Machine Learning Repository and other datasets. The results demonstrate a significant impact on the predictive accuracy for feature selection combined with the supervised NNDM in classifying new instances. Therefore it can be used in other applications that require feature dimension reduction such as image and bioinformatics classifications.
引用
收藏
页码:237 / 242
页数:6
相关论文
共 30 条
[1]   First steps toward an electronic field guide for plants [J].
Agarwal, Gaurav ;
Belhumeur, Peter ;
Feiner, Steven ;
Jacobs, David ;
Jacobs, David ;
Kress, W. John ;
Ramamoorthi, Ravi ;
Bourg, Norman A. ;
Dixit, Nandan ;
Ling, Haibin ;
Mahajan, Dhruv ;
Russell, Rusty ;
Shirdhonkar, Sameer ;
Sunkavalli, Kalyan ;
White, Sean .
TAXON, 2006, 55 (03) :597-610
[2]  
[Anonymous], 1994, FEATURE SELECTION ME
[3]  
[Anonymous], 1975, ANAL BEHAV CLASS GEN
[4]  
[Anonymous], 19 INT C PATT REC IC
[5]  
[Anonymous], 2007, Uci machine learning repository
[6]  
De Jong K.A., 1990, International Conference on Parallel Problem Solving from Nature, P38, DOI [DOI 10.1007/BFB0029729, 10.1007/BFb0029729]
[7]  
FAYYAD UM, 1993, IJCAI-93, VOLS 1 AND 2, P1022
[8]  
Frank Z. B. I., 1990, GENETIC ALGORITHMS F
[9]  
Freitas A. A., 2009, ADV EVOLUTIONARY COM, P819
[10]   Feature selection: Evaluation, application, and small sample performance [J].
Jain, A ;
Zongker, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (02) :153-158