Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding

被引:47
作者
Le, Daniel D. [1 ]
Shimko, Tyler C. [1 ]
Aditham, Arjun K. [2 ,3 ]
Keys, Allison M. [3 ,4 ]
Longwell, Scott A. [2 ]
Orenstein, Yaron [5 ]
Fordyce, Polly M. [1 ,2 ,3 ,6 ]
机构
[1] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
[3] Stanford Univ, Stanford ChEM H Chem Engn & Med Human Hlth, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Chem, Stanford, CA 94305 USA
[5] Ben Gurion Univ Negev, Dept Elect & Comp Engn, POB 653, Beer Sheva, Israel
[6] Chan Zuckerberg Biohub, San Francisco, CA 94158 USA
关键词
protein-DNA binding; transcription factor binding; transcription factor specificity; microfluidics; transcriptional regulation; DNA-BINDING; QUANTITATIVE-ANALYSIS; REGULATORY SITES; OPEN CHROMATIN; SPECIFICITY; PROTEIN; RECOGNITION; SEQUENCE; MODELS; SHAPE;
D O I
10.1073/pnas.1715888115
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Transcription factors (TFs) are primary regulators of gene expression in cells, where they bind specific genomic target sites to control transcription. Quantitative measurements of TF-DNA binding energies can improve the accuracy of predictions of TF occupancy and downstream gene expression in vivo and shed light on how transcriptional networks are rewired throughout evolution. Here, we present a sequencing-based TF binding assay and analysis pipeline (BET-seq, for Binding Energy Topography by sequencing) capable of providing quantitative estimates of binding energies for more than one million DNA sequences in parallel at high energetic resolution. Using this platform, we measured the binding energies associated with all possible combinations of 10 nucleotides flanking the known consensus DNA target interacting with two model yeast TFs, Pho4 and Cbf1. A large fraction of these flanking mutations change overall binding energies by an amount equal to or greater than consensus site mutations, suggesting that current definitions of TF binding sites may be too restrictive. By systematically comparing estimates of binding energies output by deep neural networks (NNs) and biophysical models trained on these data, we establish that dinucleotide (DN) specificities are sufficient to explain essentially all variance in observed binding behavior, with Cbf1 binding exhibiting significantly more nonadditivity than Pho4. NN-derived binding energies agree with orthogonal biochemical measurements and reveal that dynamically occupied sites in vivo are both energetically and mutationally distant from the highest affinity sites.
引用
收藏
页码:E3702 / E3711
页数:10
相关论文
共 96 条
  • [1] Deconvolving the Recognition of DNA Shape from Sequence
    Abe, Namiko
    Dror, Iris
    Yang, Lin
    Slattery, Matthew
    Zhou, Tianyin
    Bussemaker, Harmen J.
    Rohs, Remo
    Mann, Richard S.
    [J]. CELL, 2015, 161 (02) : 307 - 318
  • [2] Nonconsensus Protein Binding to Repetitive DNA Sequence Elements Significantly Affects Eukaryotic Genomes
    Afek, Ariel
    Cohen, Hila
    Barber-Zucker, Shiran
    Gordan, Raluca
    Lukatsky, David B.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (08)
  • [3] Protein-DNA binding in the absence of specific base-pair recognition
    Afek, Ariel
    Schipper, Joshua L.
    Horton, John
    Gordan, Raluca
    Lukatsky, David B.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (48) : 17140 - 17145
  • [4] A thousand empirical adaptive landscapes and their navigability
    Aguilar-Rodriguez, Jose
    Payne, Joshua L.
    Wagner, Andreas
    [J]. NATURE ECOLOGY & EVOLUTION, 2017, 1 (02):
  • [5] A Linear Model for Transcription Factor Binding Affinity Prediction in Protein Binding Microarrays
    Annala, Matti
    Laurila, Kirsti
    Lahdesmaki, Harri
    Nykter, Matti
    [J]. PLOS ONE, 2011, 6 (05):
  • [6] [Anonymous], 2015, ARXIV PREPRINT ARXIV
  • [7] Differential binding of the related transcription factors Pho4 and Cbf1 can tune the sensitivity of promoters to different levels of an induction signal
    Aow, Jonathan S. Z.
    Xue, Xiaowei
    Run, Jin-Quan
    Lim, Geoffrey F. S.
    Goh, Wee Siong
    Clarke, Neil D.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (09) : 4877 - 4887
  • [8] Diversity and Complexity in DNA Recognition by Transcription Factors
    Badis, Gwenael
    Berger, Michael F.
    Philippakis, Anthony A.
    Talukder, Shaheynoor
    Gehrke, Andrew R.
    Jaeger, Savina A.
    Chan, Esther T.
    Metzler, Genita
    Vedenko, Anastasia
    Chen, Xiaoyu
    Kuznetsov, Hanna
    Wang, Chi-Fong
    Coburn, David
    Newburger, Daniel E.
    Morris, Quaid
    Hughes, Timothy R.
    Bulyk, Martha L.
    [J]. SCIENCE, 2009, 324 (5935) : 1720 - 1723
  • [9] Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities
    Berger, Michael F.
    Philippakis, Anthony A.
    Qureshi, Aaron M.
    He, Fangxue S.
    Estep, Preston W., III
    Bulyk, Martha L.
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (11) : 1429 - 1435
  • [10] Transcriptional regulation by the numbers: models
    Bintu, L
    Buchler, NE
    Garcia, HG
    Gerland, U
    Hwa, T
    Kondev, J
    Phillips, R
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2005, 15 (02) : 116 - 124