Evaluation of phylogenetic footprint discovery for predicting bacterial cis-regulatory elements and revealing their evolution

被引:37
作者
Janky, Rekin's [1 ]
van Helden, Jacques [1 ]
机构
[1] Univ Libre Bruxelles, Lab Bioinformat Genomes & Reseaux, B-1050 Brussels, Belgium
关键词
D O I
10.1186/1471-2105-9-37
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The detection of conserved motifs in promoters of orthologous genes (phylogenetic footprints) has become a common strategy to predict cis-acting regulatory elements. Several software tools are routinely used to raise hypotheses about regulation. However, these tools are generally used as black boxes, with default parameters. A systematic evaluation of optimal parameters for a footprint discovery strategy can bring a sizeable improvement to the predictions. Results: We evaluate the performances of a footprint discovery approach based on the detection of over-represented spaced motifs. This method is particularly suitable for (but not restricted to) Bacteria, since such motifs are typically bound by factors containing a Helix-Turn-Helix domain. We evaluated footprint discovery in 368 Escherichia coli K12 genes with annotated sites, under 40 different combinations of parameters (taxonomical level, background model, organism-specific filtering, operon inference). Motifs are assessed both at the levels of correctness and significance. We further report a detailed analysis of 181 bacterial orthologs of the LexA repressor. Distinct motifs are detected at various taxonomical levels, including the 7 previously characterized taxon-specific motifs. In addition, we highlight a significantly stronger conservation of half-motifs in Actinobacteria, relative to Firmicutes, suggesting an intermediate state in specificity switching between the two Gram-positive phyla, and thereby revealing the on-going evolution of LexA autoregulation. Conclusion: The footprint discovery method proposed here shows excellent results with E. coli and can readily be extended to predict cis-acting regulatory signals and propose testable hypotheses in bacterial genomes for which nothing is known about regulation.
引用
收藏
页数:26
相关论文
共 79 条
[1]   Regulog analysis:: Detection of conserved regulatory networks across bacteria:: Application to Staphylococcus aureus [J].
Alkema, WBL ;
Lenhard, B ;
Wasserman, WW .
GENOME RESEARCH, 2004, 14 (07) :1362-1373
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Functional determinants of transcription factors in Escherichia coli:: protein families and binding sites [J].
Babu, MM ;
Teichmann, SA .
TRENDS IN GENETICS, 2003, 19 (02) :75-79
[4]  
Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
[5]  
Bailey TL., 1994, P 2 INT C INT SYST M, V2, P28
[6]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :15-18
[7]   Aligning multiple genomic sequences with the threaded blockset aligner [J].
Blanchette, M ;
Kent, WJ ;
Riemer, C ;
Elnitski, L ;
Smit, AFA ;
Roskin, KM ;
Baertsch, R ;
Rosenbloom, K ;
Clawson, H ;
Green, ED ;
Haussler, D ;
Miller, W .
GENOME RESEARCH, 2004, 14 (04) :708-715
[8]   FootPrinter: a program designed for phylogenetic footprinting [J].
Blanchette, M ;
Tompa, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3840-3842
[9]   Approaches to the automatic discovery of patterns in biosequences [J].
Brazma, A ;
Jonassen, I ;
Eidhammer, I ;
Gilbert, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) :279-305
[10]   Predicting gene regulatory elements in silico on a genomic scale [J].
Brazma, A ;
Jonassen, I ;
Vilo, J ;
Ukkonen, E .
GENOME RESEARCH, 1998, 8 (11) :1202-1215