The Role of Genome Accessibility in Transcription Factor Binding in Bacteria

被引:6
作者
Gomes, Antonio L. C. [1 ]
Wang, Harris H. [1 ,2 ]
机构
[1] Columbia Univ, Dept Syst Biol, New York, NY USA
[2] Columbia Univ, Dept Pathol & Cell Biol, New York, NY USA
关键词
PROTEIN-DNA INTERACTIONS; CHIP-SEQ DATA; GENE-EXPRESSION; MYCOBACTERIUM-TUBERCULOSIS; CHROMATIN ACCESSIBILITY; MODEL; YEAST; SPECIFICITY; LANDSCAPES; PROMOTERS;
D O I
10.1371/journal.pcbi.1004891
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
ChIP-seq enables genome-scale identification of regulatory regions that govern gene expression. However, the biological insights generated from ChIP-seq analysis have been limited to predictions of binding sites and cooperative interactions. Furthermore, ChIP-seq data often poorly correlate with in vitro measurements or predicted motifs, highlighting that binding affinity alone is insufficient to explain transcription factor (TF)-binding in vivo. One possibility is that binding sites are not equally accessible across the genome. A more comprehensive biophysical representation of TF-binding is required to improve our ability to understand, predict, and alter gene expression. Here, we show that genome accessibility is a key parameter that impacts TF-binding in bacteria. We developed a thermodynamic model that parameterizes ChIP-seq coverage in terms of genome accessibility and binding affinity. The role of genome accessibility is validated using a large-scale ChIP-seq dataset of the M. tuberculosis regulatory network. We find that accounting for genome accessibility led to a model that explains 63% of the ChIP-seq profile variance, while a model based in motif score alone explains only 35% of the variance. Moreover, our framework enables de novo ChIP-seq peak prediction and is useful for inferring TF-binding peaks in new experimental conditions by reducing the need for additional experiments. We observe that the genome is more accessible in intergenic regions, and that increased accessibility is positively correlated with gene expression and anti-correlated with distance to the origin of replication. Our biophysically motivated model provides a more comprehensive description of TF-binding in vivo from first principles towards a better representation of gene regulation in silico, with promising applications in systems biology.
引用
收藏
页数:16
相关论文
共 52 条
[11]   Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE [J].
Foat, Barrett C. ;
Morozov, Alexandre V. ;
Bussemaker, Harmen J. .
BIOINFORMATICS, 2006, 22 (14) :E141-E149
[12]   ChIP-Seq and the Complexity of Bacterial Transcriptional Regulation [J].
Galagan, James ;
Lyubetskaya, Anna ;
Gomes, Antonio .
SYSTEMS BIOLOGY, 2013, 363 :43-68
[13]   The Mycobacterium tuberculosis regulatory network and hypoxia [J].
Galagan, James E. ;
Minch, Kyle ;
Peterson, Matthew ;
Lyubetskaya, Anna ;
Azizi, Elham ;
Sweet, Linsday ;
Gomes, Antonio ;
Rustad, Tige ;
Dolganov, Gregory ;
Glotova, Irina ;
Abeel, Thomas ;
Mahwinney, Chris ;
Kennedy, Adam D. ;
Allard, Rene ;
Brabant, William ;
Krueger, Andrew ;
Jaini, Suma ;
Honda, Brent ;
Yu, Wen-Han ;
Hickey, Mark J. ;
Zucker, Jeremy ;
Garay, Christopher ;
Weiner, Brian ;
Sisk, Peter ;
Stolte, Christian ;
Winkler, Jessica K. ;
Van de Peer, Yves ;
Iazzetti, Paul ;
Camacho, Diogo ;
Dreyfuss, Jonathan ;
Liu, Yang ;
Dorhoi, Anca ;
Mollenkopf, Hans-Joachim ;
Drogaris, Paul ;
Lamontagne, Julie ;
Zhou, Yiyong ;
Piquenot, Julie ;
Park, Sang Tae ;
Raman, Sahadevan ;
Kaufmann, Stefan H. E. ;
Mohney, Robert P. ;
Chelsky, Daniel ;
Moody, D. Branch ;
Sherman, David R. ;
Schoolnik, Gary K. .
NATURE, 2013, 499 (7457) :178-183
[14]   Analysis of combinatorial cis-regulation in synthetic and genomic promoters [J].
Gertz, Jason ;
Siggia, Eric D. ;
Cohen, Barak A. .
NATURE, 2009, 457 (7226) :215-U113
[15]   Decoding ChIP-seq with a double-binding signal refines binding peaks to single-nucleotides and predicts cooperative interaction [J].
Gomes, Antonio L. C. ;
Abeel, Thomas ;
Peterson, Matthew ;
Azizi, Elham ;
Lyubetskaya, Anna ;
Carvalho, Luis ;
Galagan, James .
GENOME RESEARCH, 2014, 24 (10) :1686-1697
[16]   Lsr2 is a nucleoid-associated protein that targets AT-rich sequences and virulence genes in Mycobacterium tuberculosis [J].
Gordon, Blair R. G. ;
Li, Yifei ;
Wang, Linru ;
Sintsova, Anna ;
van Bakel, Harm ;
Tian, Songhai ;
Navarre, William Wiley ;
Xia, Bin ;
Liu, Jun .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (11) :5154-5159
[17]   ChIP-nexus enables improved detection of in vivo transcription factor binding footprints [J].
He, Qiye ;
Johnston, Jeff ;
Zeitlinger, Julia .
NATURE BIOTECHNOLOGY, 2015, 33 (04) :395-U108
[18]   The seven E-coli ribosomal RNA operon upstream regulatory regions differ in structure and transcription factor binding efficiencies [J].
Hillebrand, A ;
Wurm, R ;
Menzel, A ;
Wagner, R .
BIOLOGICAL CHEMISTRY, 2005, 386 (06) :523-534
[19]   Genome-wide mapping of in vivo protein-DNA interactions [J].
Johnson, David S. ;
Mortazavi, Ali ;
Myers, Richard M. ;
Wold, Barbara .
SCIENCE, 2007, 316 (5830) :1497-1502
[20]   A Whole-Cell Computational Model Predicts Phenotype from Genotype [J].
Karr, Jonathan R. ;
Sanghvi, Jayodita C. ;
Macklin, Derek N. ;
Gutschow, Miriam V. ;
Jacobs, Jared M. ;
Bolival, Benjamin ;
Assad-Garcia, Nacyra, Jr. ;
Glass, John I. ;
Covert, Markus W. .
CELL, 2012, 150 (02) :389-401