Kerfuffle: a web tool for multi-species gene colocalization analysis

被引:4
作者
Aboukhalil, Robert [1 ]
Fendler, Bernard [1 ]
Atwal, Gurinder S. [1 ]
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
关键词
Genes; Clusters; Colocalization; Conservation; Synteny; CLUSTERS; SYNTENY;
D O I
10.1186/1471-2105-14-22
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The evolutionary pressures that underlie the large-scale functional organization of the genome are not well understood in eukaryotes. Recent evidence suggests that functionally similar genes may colocalize (cluster) in the eukaryotic genome, suggesting the role of chromatin-level gene regulation in shaping the physical distribution of coordinated genes. However, few of the bioinformatic tools currently available allow for a systematic study of gene colocalization across several, evolutionarily distant species. Furthermore, most tools require the user to input manually curated lists of gene position information, DNA sequence or gene homology relations between species. With the growing number of sequenced genomes, there is a need to provide new comparative genomics tools that can address the analysis of multi-species gene colocalization. Results: Kerfuffle is a web tool designed to help discover, visualize, and quantify the physical organization of genomes by identifying significant gene colocalization and conservation across the assembled genomes of available species (currently up to 47, from humans to worms). Kerfuffle only requires the user to specify a list of human genes and the names of other species of interest. Without further input from the user, the software queries the e!Ensembl BioMart server to obtain positional information and discovers homology relations in all genes and species specified. Using this information, Kerfuffle performs a multi-species clustering analysis, presents downloadable lists of clustered genes, performs Monte Carlo statistical significance calculations, estimates how conserved gene clusters are across species, plots histograms and interactive graphs, allows users to save their queries, and generates a downloadable visualization of the clusters using the Circos software. These analyses may be used to further explore the functional roles of gene clusters by interrogating the enriched molecular pathways associated with each cluster. Conclusions: Kerfuffle is a new, easy-to-use and publicly available tool to aid our understanding of functional genomics and comparative genomics. This software allows for flexibility and quick investigations of a user-defined set of genes, and the results may be saved online for further analysis. Kerfuffle is freely available at http://atwallab.org/kerfuffle, is implemented in JavaScript (using jQuery and jsCharts libraries) and PHP 5.2, runs on an Apache server, and stores data in flat files and an SQLite database.
引用
收藏
页数:8
相关论文
共 22 条
[1]   Selection upon Genome Architecture: Conservation of Functional Neighborhoods with Changing Genes [J].
Al-Shahrour, Fatima ;
Minguez, Pablo ;
Marques-Bonet, Tomas ;
Gazave, Elodie ;
Navarro, Arcadi ;
Dopazo, Joaquin .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (10)
[2]  
Blumenthal Thomas, 2004, Briefings in Functional Genomics & Proteomics, V3, P199, DOI 10.1093/bfgp/3.3.199
[3]   AmiGO: online access to ontology and annotation data [J].
Carbon, Seth ;
Ireland, Amelia ;
Mungall, Christopher J. ;
Shu, ShengQiang ;
Marshall, Brad ;
Lewis, Suzanna .
BIOINFORMATICS, 2009, 25 (02) :288-289
[4]   HOMEOTIC GENES AND THE EVOLUTION OF ARTHROPODS AND CHORDATES [J].
CARROLL, SB .
NATURE, 1995, 376 (6540) :479-485
[5]   The evolutionary dynamics of eukaryotic gene order [J].
Hurst, LD ;
Pál, C ;
Lercher, MJ .
NATURE REVIEWS GENETICS, 2004, 5 (04) :299-310
[6]   KEGG for integration and interpretation of large-scale molecular data sets [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Sato, Yoko ;
Furumichi, Miho ;
Tanabe, Mao .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D109-D114
[7]   Ensembl BioMarts: a hub for data retrieval across taxonomic space [J].
Kinsella, Rhoda J. ;
Kaehaeri, Andreas ;
Haider, Syed ;
Zamora, Jorge ;
Proctor, Glenn ;
Spudich, Giulietta ;
Almeida-King, Jeff ;
Staines, Daniel ;
Derwent, Paul ;
Kerhornou, Arnaud ;
Kersey, Paul ;
Flicek, Paul .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
[8]   Circos: An information aesthetic for comparative genomics [J].
Krzywinski, Martin ;
Schein, Jacqueline ;
Birol, Inanc ;
Connors, Joseph ;
Gascoyne, Randy ;
Horsman, Doug ;
Jones, Steven J. ;
Marra, Marco A. .
GENOME RESEARCH, 2009, 19 (09) :1639-1645
[9]   Genomic gene clustering analysis of pathways in eukaryotes [J].
Lee, JM ;
Sonnhammer, ELL .
GENOME RESEARCH, 2003, 13 (05) :875-882
[10]   Clustering of housekeeping genes provides a unified model of gene order in the human genome [J].
Lercher, MJ ;
Urrutia, AO ;
Hurst, LD .
NATURE GENETICS, 2002, 31 (02) :180-183