Mining gene-sample-time microarray data: a coherent gene cluster discovery approach

被引:9
作者
Jiang, Daxin
Pei, Jian
Ramanathan, Murali
Lin, Chuan
Tang, Chun
Zhang, Aidong
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Simon Fraser Univ, Burnaby, BC V5A 1S6, Canada
[3] SUNY Buffalo, Buffalo, NY USA
基金
美国国家科学基金会;
关键词
bioinformatics; clustering; microarray data;
D O I
10.1007/s10115-006-0031-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extensive studies have shown that mining microarray data sets is important in bioinformatics research and biomedical applications. In this paper, we explore a novel type of gene-sample-time microarray data sets that records the expression levels of various genes under a set of samples during a series of time points. In particular, we propose the mining of coherent gene clusters from such data sets. Each cluster contains a subset of genes and a subset of samples such that the genes are coherent on the samples along the time series. The coherent gene clusters may identify the samples corresponding to some phenotypes (e.g., diseases), and suggest the candidate genes correlated to the phenotypes. We present two efficient algorithms, namely the Sample-Gene Search and the Gene-Sample Search, to mine the complete set of coherent gene clusters. We empirically evaluate the performance of our approaches on both a real microarray data set and synthetic data sets. The test results have shown that our approaches are both efficient and effective to find meaningful coherent gene clusters.
引用
收藏
页码:305 / 335
页数:31
相关论文
共 43 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[4]  
[Anonymous], P 1998 ACM SIGMOD IN
[5]  
[Anonymous], 2001, Bioinformatics
[6]  
Ben-Dor A, 2001, P 5 ANN INT C COMP M, P31, DOI DOI 10.1145/369133.369167
[7]  
BLAKE J, 2003, CURRENT PROTOCOLS BI
[8]  
CHENG Y, 2000, P 8 INT C INT SYST M, P93
[9]   Adaptive quality-based clustering of gene expression profiles [J].
De Smet, F ;
Mathys, J ;
Marchal, K ;
Thijs, G ;
De Moor, B ;
Moreau, Y .
BIOINFORMATICS, 2002, 18 (05) :735-746
[10]   Identification of genes differentially regulated by interferon α, β, or γ using oligonucleotide arrays [J].
Der, SD ;
Zhou, AM ;
Williams, BRG ;
Silverman, RH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (26) :15623-15628