Automatic layout and visualization of biclusters

被引:36
作者
Grothaus, Gregory A. [1 ,2 ]
Mufti, Adeel [1 ]
Murali, T. M. [1 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Google Inc, Mountain View, CA 94043 USA
关键词
D O I
10.1186/1748-7188-1-15
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biclustering has emerged as a powerful algorithmic tool for analyzing measurements of gene expression. A number of different methods have emerged for computing biclusters in gene expression data. Many of these algorithms may output a very large number of biclusters with varying degrees of overlap. There are no systematic methods that create a two-dimensional layout of the computed biclusters and display overlaps between them. Results: We develop a novel algorithm for laying out biclusters in a two-dimensional matrix whose rows ( respectively, columns) are rows ( respectively, columns) of the original dataset. We display each bicluster as a contiguous submatrix in the layout. We allow the layout to have repeated rows and/or columns from the original matrix as required, but we seek a layout of the smallest size. We also develop a web-based search interface for the user to query the genes and samples of interest and visualise the layout of biclusters matching the queries. Conclusion: We demonstrate the usefulness of our approach on gene expression data for two types of leukaemia and on protein-DNA binding data for two growth conditions in Saccharomyces cerevisiae. The software implementing the layout algorithm is available at http:// bioinformatics.cs.vt.edu/similar to murali/papers/bivoc.
引用
收藏
页数:11
相关论文
共 37 条
[1]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[2]  
Alizadeh F, 1995, J Comput Biol, V2, P159, DOI 10.1089/cmb.1995.2.159
[3]   Computational discovery of gene modules and regulatory networks [J].
Bar-Joseph, Z ;
Gerber, GK ;
Lee, TI ;
Rinaldi, NJ ;
Yoo, JY ;
Robert, F ;
Gordon, DB ;
Fraenkel, E ;
Jaakkola, TS ;
Young, RA ;
Gifford, DK .
NATURE BIOTECHNOLOGY, 2003, 21 (11) :1337-1342
[4]  
BATZOGLOU S, 2000, J DISCRETE ALGORITHM, V1, P51
[5]   The TOR signalling pathway controls nuclear localization of nutrient-regulated transcription factors [J].
Beck, T ;
Hall, MN .
NATURE, 1999, 402 (6762) :689-692
[6]  
BERGER R, 1967, CONTRIBUTIONS U CALI, V3, P1
[7]   Similarities and differences in genome-wide expression data of six organisms [J].
Bergmann, S ;
Ihmels, J ;
Barkai, N .
PLOS BIOLOGY, 2004, 2 (01) :85-93
[8]   TESTING FOR CONSECUTIVE ONES PROPERTY, INTERVAL GRAPHS, AND GRAPH PLANARITY USING PQ-TREE ALGORITHMS [J].
BOOTH, KS ;
LUEKER, GS .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1976, 13 (03) :335-379
[9]  
Califano A, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P75
[10]  
CHENG Y, 2000, P 8 INT C INT SYST M, P93