MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites

被引:44
作者
Schmid, Christoph D. [1 ,2 ,3 ]
Bucher, Philipp [1 ]
机构
[1] Swiss Inst Expt Canc Res GR BUCHER, SV ISREC, Ecole Polytech Fed Lausanne, Swiss Inst Bioinformat, Lausanne, Switzerland
[2] Swiss Trop & Publ Hlth Inst Swiss TPH, Basel, Switzerland
[3] Univ Basel, Basel, Switzerland
来源
PLOS ONE | 2010年 / 5卷 / 07期
关键词
GENOME-WIDE IDENTIFICATION; CHIP-SEQ EXPERIMENTS; DNA-BINDING; ELEMENTS; DATABASE; NETWORK; METHYLATION; ACTIVATION; VERTEBRATE; DISCOVERY;
D O I
10.1371/journal.pone.0011425
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Chromatin immunoprecipitation combined with massively parallel sequencing methods (ChIP-seq) is becoming the standard approach to study interactions of transcription factors (TF) with genomic sequences. At the example of public STAT1 ChIP-seq data sets, we present novel approaches for the interpretation of ChIP-seq data. We compare recently developed approaches to determine STAT1 binding sites from ChIP-seq data. Assessing the content of the established consensus sequence for STAT1 binding sites, we find that the usage of "negative control'' ChIP-seq data fails to provide substantial advantages. We derive a single refined probabilistic model of STAT1 binding sequences from these ChIP-seq data. Contrary to previous claims, we find no evidence that STAT1 binds to multiple distinct motifs upon interferon-gamma stimulation in vivo. While a large majority of genomic sites with high ChIP-seq signal is associated with a nucleotide sequence ressembling a STAT1 binding site, only a very small subset of the over 5 million potential STAT1 binding sites in the human genome is covered by ChIP-seq data. Furthermore a surprisingly large fraction of the ChIP-seq signal (5%) is absorbed by a small family of repetitive sequences (MER41). The observation of the binding of activated STAT1 protein to a specific repetitive element bolsters similar reports concerning p53 and other TFs, and strengthens the notion of an involvement of repeats in gene regulation. Incidentally MER41 are specific to primates, consequently, regulatory mechanisms in the IFN-STAT pathway might fundamentally differ between primates and rodents. On a methodological aspect, the presence of large numbers of nearly identical binding sites in repetitive sequences may lead to wrong conclusions about intrinsic binding preferences of TF as illustrated by the spacing analysis STAT1 tandem motifs. Therefore, ChIP-seq data should be analyzed independently within repetitive and non-repetitive sequences.
引用
收藏
页数:10
相关论文
共 49 条
  • [1] Follow-up of the Swiss Cohort Study on Air Pollution and Lung Diseases in Adults (SAPALDIA 2) 1991-2003:: methods and characterization of participants
    Ackermann-Liebrich, U
    Kuna-Dibbert, B
    Probst-Hensch, NM
    Schindler, C
    Dietrich, DF
    Stutz, EZ
    Bayer-Oglesby, L
    Baum, F
    Brändli, O
    Brutsche, M
    Downs, SH
    Keidel, D
    Gerbase, MW
    Imboden, M
    Keller, R
    Knöpfli, B
    Künzli, N
    Nicod, L
    Pons, M
    Staedele, P
    Tschopp, JM
    Zellweger, JP
    Leuenberger, P
    [J]. SOZIAL-UND PRAVENTIVMEDIZIN, 2005, 50 (04): : 245 - 263
  • [2] Diversity and Complexity in DNA Recognition by Transcription Factors
    Badis, Gwenael
    Berger, Michael F.
    Philippakis, Anthony A.
    Talukder, Shaheynoor
    Gehrke, Andrew R.
    Jaeger, Savina A.
    Chan, Esther T.
    Metzler, Genita
    Vedenko, Anastasia
    Chen, Xiaoyu
    Kuznetsov, Hanna
    Wang, Chi-Fong
    Coburn, David
    Newburger, Daniel E.
    Morris, Quaid
    Hughes, Timothy R.
    Bulyk, Martha L.
    [J]. SCIENCE, 2009, 324 (5935) : 1720 - 1723
  • [3] Combining evidence using p-values: application to sequence homology searches
    Bailey, TL
    Gribskov, M
    [J]. BIOINFORMATICS, 1998, 14 (01) : 48 - 54
  • [4] Bailey TL., 1994, Proc Int Conf Intel Syst Mol Biol, V2, P28
  • [5] SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS
    BERG, OG
    VONHIPPEL, PH
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) : 723 - 743
  • [6] Evolution of the mammalian transcription factor binding repertoire via transposable elements
    Bourque, Guillaume
    Leong, Bernard
    Vega, Vinsensius B.
    Chen, Xi
    Lee, Yen Ling
    Srinivasan, Kandhadayar G.
    Chew, Joon-Lin
    Ruan, Yijun
    Wei, Chia-Lin
    Ng, Huck Hui
    Liu, Edison T.
    [J]. GENOME RESEARCH, 2008, 18 (11) : 1752 - 1762
  • [7] JAK-STAT PATHWAYS AND TRANSCRIPTIONAL ACTIVATION IN RESPONSE TO IFNS AND OTHER EXTRACELLULAR SIGNALING PROTEINS
    DARNELL, JE
    KERR, IM
    STARK, GR
    [J]. SCIENCE, 1994, 264 (5164) : 1415 - 1421
  • [8] DNA binding specificity of different STAT proteins -: Comparison of in vitro specificity with natural target sites
    Ehret, GB
    Reichenbach, P
    Schindler, U
    Horvath, CM
    Fritz, S
    Nabholz, M
    Bucher, P
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (09) : 6675 - 6688
  • [9] FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology
    Fejes, Anthony P.
    Robertson, Gordon
    Bilenky, Mikhail
    Varhol, Richard
    Bainbridge, Matthew
    Jones, Steven J. M.
    [J]. BIOINFORMATICS, 2008, 24 (15) : 1729 - 1730
  • [10] Limitations and potentials of current motif discovery algorithms
    Hu, JJ
    Li, B
    Kihara, D
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 (15) : 4899 - 4913