An ensemble biclustering approach for querying gene expression compendia with experimental lists

被引:9
作者
De Smet, Riet [2 ,3 ]
Marchal, Kathleen [1 ]
机构
[1] Katholieke Univ Leuven, Dept Microbial & Mol Syst, B-3001 Louvain, Belgium
[2] Univ Ghent, Dept Plant Biotechnol & Genet, B-9000 Ghent, Belgium
[3] Univ Ghent, VIB, Dept Plant Syst Biol, B-9000 Ghent, Belgium
关键词
ESCHERICHIA-COLI; TRANSCRIPTION; VISUALIZATION; DISCOVERY; FRAMEWORK; NETWORKS; MODEL;
D O I
10.1093/bioinformatics/btr307
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Query-based biclustering techniques allow interrogating a gene expression compendium with a given gene or gene list. They do so by searching for genes in the compendium that have a profile close to the average expression profile of the genes in this query-list. As it can often not be guaranteed that the genes in a long query-list will all be mutually coexpressed, it is advisable to use each gene separately as a query. This approach, however, leaves the user with a tedious post-processing of partially redundant biclustering results. The fact that for each query-gene multiple parameter settings need to be tested in order to detect the 'most optimal bicluster size' adds to the redundancy problem. Results: To aid with this post-processing, we developed an ensemble approach to be used in combination with query-based biclustering. The method relies on a specifically designed consensus matrix in which the biclustering outcomes for multiple query-genes and for different possible parameter settings are merged in a statistically robust way. Clustering of this matrix results in distinct, non-redundant consensus biclusters that maximally reflect the information contained within the original query-based biclustering results. The usefulness of the developed approach is illustrated on a biological case study in Escherichia coli.
引用
收藏
页码:1948 / 1956
页数:9
相关论文
共 29 条
[1]   Mining for coexpression across hundreds of datasets using novel rank aggregation and visualization methods [J].
Adler, Priit ;
Kolde, Raivo ;
Kull, Meelis ;
Tkachenko, Aleksandr ;
Peterson, Hedi ;
Reimand, Jueri ;
Vilo, Jaak .
GENOME BIOLOGY, 2009, 10 (12)
[2]  
[Anonymous], 2000, A cluster algorithm for graphs, DOI DOI 10.1016/J.COSREV.2007.05.001
[3]   An ensemble framework for clustering protein-protein interaction networks [J].
Asur, Sitaram ;
Ucar, Duygu ;
Parthasarathy, Srinivasan .
BIOINFORMATICS, 2007, 23 (13) :I29-I40
[4]   Modular analysis of gene expression data with R [J].
Csardi, Gabor ;
Kutalik, Zoltan ;
Bergmann, Sven .
BIOINFORMATICS, 2010, 26 (10) :1376-1377
[5]   Query-driven module discovery in microarray data [J].
Dhollander, Thomas ;
Sheng, Qizheng ;
Lemmens, Karen ;
De Moor, Bart ;
Marchal, Kathleen ;
Moreau, Yves .
BIOINFORMATICS, 2007, 23 (19) :2573-2580
[6]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[7]   RegulonDB (version 6.0):: gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation [J].
Gama-Castro, Socorro ;
Jimenez-Jacinto, Veronica ;
Peralta-Gil, Martin ;
Santos-Zavaleta, Alberto ;
Penaloza-Spinola, Monica I. ;
Contreras-Moreira, Bruno ;
Segura-Salazar, Juan ;
Muniz-Rascado, Luis ;
Martinez-Flores, Irma ;
Salgado, Heladia ;
Bonavides-Martinez, Cesar ;
Abreu-Goodger, Cei ;
Rodriguez-Penagos, Carlos ;
Miranda-Rios, Juan ;
Morett, Enrique ;
Merino, Enrique ;
Huerta, Araceli M. ;
Trevino-Quintanilla, Luis ;
Collado-Vides, Julio .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D120-D124
[8]   Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data [J].
Gao, F ;
Foat, BC ;
Bussemaker, HJ .
BMC BIOINFORMATICS, 2004, 5 (1)
[9]   Assessing experimentally derived interactions in a small world [J].
Goldberg, DS ;
Roth, FP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (08) :4372-4376
[10]   Transcription factor distribution in Escherichia coli:: studies with FNR protein [J].
Grainger, David C. ;
Aiba, Hirofumi ;
Hurd, Douglas ;
Browning, Douglas F. ;
Busby, Stephen J. W. .
NUCLEIC ACIDS RESEARCH, 2007, 35 (01) :269-278