A Discernibility-Based Approach to Feature Selection for Microarray Data

被引:0
作者
Voulgaris, Zacharias [1 ]
Magoulas, George D. [1 ]
机构
[1] Univ London Birkbeck Coll, Sch Comp Sci & Informat Syst, London WC1E 7HX, England
来源
2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2 | 2008年
关键词
classification problems; feature selection; dimensionality reduction; discernibility; microarray data;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection has been used widely for a variety of data, yielding higher speeds and reduced computational cost for the classification process. However, it is in microarray datasets where its advantages become more evident and are more required. In this paper we present a novel approach to accomplish this based on the concept of discernibility that we introduce to depict how separated the classes of a dataset are. We develop and test two independent feature selection methods that follow this approach. The results of oar experiments on four microarray datasets show that discernibility-based feature selection reduces the dimensionality of the datasets involved without compromising the performance of the classifiers.
引用
收藏
页码:818 / 823
页数:6
相关论文
共 17 条
[1]   A robust meta-classification strategy for cancer diagnosis from gene expression data [J].
Alexe, G ;
Bhanot, G ;
Venkataraghavan, B ;
Ramaswamy, R ;
Lepre, J ;
Levine, AJ ;
Stolovitzky, G .
2005 IEEE Computational Systems Bioinformatics Conference, Proceedings, 2005, :322-325
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]  
[Anonymous], 1998, ROUGH SETS TUTORIAL
[4]  
AYDIN T, 2006, 14 TURK S ART INT AR, P168
[5]   Saliency analysis of support vector machines for gene selection in tissue classification [J].
Cao, L ;
Seng, CK ;
Gu, Q ;
Lee, HP .
NEURAL COMPUTING & APPLICATIONS, 2003, 11 (3-4) :244-249
[6]  
CHAO S, 2005, 3 AS PAC BIOINF C JA, P211
[7]  
Duda RO, 2006, PATTERN CLASSIFICATI
[8]   Feature selection for support vector machines by means of genetic algorithms [J].
Fröhlich, H ;
Chapelle, O ;
Schölkopf, B .
15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, :142-148
[9]   A FUZZY K-NEAREST NEIGHBOR ALGORITHM [J].
KELLER, JM ;
GRAY, MR ;
GIVENS, JA .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (04) :580-585
[10]  
LECOCKE M, 2005, DEP BIOSTATISTICS WO