CREME: a framework for identifying cis-regulatory modules in human-mouse conserved segments

被引:77
作者
Sharan, Roded [1 ]
Ovcharenko, Ivan [2 ]
Ben-Hur, Asa [3 ]
Karp, Richard M. [1 ]
机构
[1] Int Comp Sci Inst, Berkeley, CA 94704 USA
[2] Univ Calif Berkeley, Lawrence Berkeley Lab, Genome Sci Dept, Berkeley, CA 94720 USA
[3] Stanford Univ, Dept Biochem, Stanford, CA 94305 USA
关键词
Cis-regulatory module; transcription factor binding site; motif cluster; statistical test;
D O I
10.1093/bioinformatics/btg1039
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The binding of transcription factors to specific regulatory sequence elements is a primary mechanism for controlling gene transcription. Recent findings suggest a modular organization of binding sites for transcription factors that cooperate in the regulation of genes. In this work we establish a framework for finding recurrent cis-regulatory modules in the promoters of a selected set of genes and scoring their statistical significance. Results: Proceeding from a database of identified binding site motifs and their genomic locations we seek motifs whose frequency in the selected promoters is different than in a background promoter set. We present several statistical tests designed for this purpose. We provide a hashing algorithm for detecting combinations of these motifs that co-occur in clusters within the selected promoters. The significance of such co-occurrences is evaluated using novel statistical scores. Our methods are combined in CREME, a suite of software which includes a browser for viewing the pattern of occurrence of selected cis-regulatory modules. We applied our methodology to find modules within human-mouse conserved promoter segments, focusing on cell cycle regulated genes and stress response related genes. To validate the biological significance of the identified modules we tested whether the associated genes tended to be co-expressed or share similar function. In the cell cycle set five of the seven identified sets of genes were coherently expressed. On the stress response data four of the six detected sets fell predominantly into well-defined functional sub-categories.
引用
收藏
页码:i283 / i291
页数:9
相关论文
共 23 条
  • [1] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [2] Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome
    Berman, BP
    Nibu, Y
    Pfeiffer, BD
    Tomancak, P
    Celniker, SE
    Levine, M
    Rubin, GM
    Eisen, MB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) : 757 - 762
  • [3] Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells
    Elkon, R
    Linhart, C
    Sharan, R
    Shamir, R
    Shiloh, Y
    [J]. GENOME RESEARCH, 2003, 13 (05) : 773 - 780
  • [4] Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences
    Frith, MC
    Spouge, JL
    Hansen, U
    Weng, ZP
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (14) : 3214 - 3224
  • [5] Detection of cis-element clusters in higher eukaryotic DNA
    Frith, MC
    Hansen, U
    Weng, ZP
    [J]. BIOINFORMATICS, 2001, 17 (10) : 878 - 889
  • [6] Identifying target sites for cooperatively binding factors
    GuhaThakurta, D
    Stormo, GD
    [J]. BIOINFORMATICS, 2001, 17 (07) : 608 - 621
  • [7] Halfon MS, 2002, GENOME RES, V12, P1019, DOI 10.1101/gr.228902
  • [8] Predicting transcription factor synergism
    Hannenhalli, S
    Levy, S
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (19) : 4278 - 4284
  • [9] Kel-Margoulis O V, 2002, Pac Symp Biocomput, P187
  • [10] Functional promoter modules can be defected by formal models independent of overall nucleoside sequence similarity
    Klingenhoff, A
    Frech, K
    Quandt, K
    Werner, T
    [J]. BIOINFORMATICS, 1999, 15 (03) : 180 - 186