Gene ontology driven feature selection from microarray gene expression data

被引:0
作者
Qi, Jianlong [1 ]
Tang, Jian [1 ]
机构
[1] Mem Univ Newfoundland, Dept Comp Sci, St John, NF A1B 3X5, Canada
来源
PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY | 2006年
关键词
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the main challenges in the classification of microarray gene expression data is the small sample size compared with the large number of genes, so feature selection is an essential step to remove genes not relevant to class label. Traditional gene selection methods often select the top-ranked genes based on their individual discriminative powers. The problem with these simple ranking models is that they evaluate genes in isolation and this may introduce redundancy among the selected feature subset. Most redundancy based methods solely evaluate gene expression levels. This may decrease the effectiveness of feature selection since some values may not be accurately measured. In this paper, we propose a gene ontology based method for feature selection. The novelty of this model is to detect redundancy between a pair of genes by the convex combination of their expression similarity and semantic similarity in gene ontology. The effectiveness of our method is demonstrated by the experiment in two widely used datasets.
引用
收藏
页码:428 / +
页数:2
相关论文
共 14 条
[1]  
ALON U, 1999, P NATL ACAD SCI
[2]  
BUDANITSKY A, 2001, P WORKSH WORDN OTH L
[3]   SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data [J].
Diehn, M ;
Sherlock, G ;
Binkley, G ;
Jin, H ;
Matese, JC ;
Hernandez-Boussard, T ;
Rees, CA ;
Cherry, JM ;
Botstein, D ;
Brown, PO ;
Alizadeh, AA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :219-223
[4]   Minimum redundancy feature selection from microarray gene expression data [J].
Ding, C ;
Peng, HC .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :523-528
[5]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[6]  
HANCZAR B, 2003, ACM SIGKDD EXPLORATI
[7]  
Jiang J. J., 1998, P INT C RES COMP LIN
[8]  
Lin D., 1998, P 15 INT C MACH LEAR, P296
[9]   QUANTITATIVE MONITORING OF GENE-EXPRESSION PATTERNS WITH A COMPLEMENTARY-DNA MICROARRAY [J].
SCHENA, M ;
SHALON, D ;
DAVIS, RW ;
BROWN, PO .
SCIENCE, 1995, 270 (5235) :467-470
[10]   Improving missing value estimation in microarray data with gene ontology [J].
Tuikkala, J ;
Elo, L ;
Nevalainen, OS ;
Aittokallio, T .
BIOINFORMATICS, 2006, 22 (05) :566-572