Defining the Plasticity of Transcription Factor Binding Sites by Deconstructing DNA Consensus Sequences: The PhoP-Binding Sites among Gamma/Enterobacteria

被引:32
|
作者
Harari, Oscar [1 ,2 ]
Park, Sun-Yang [3 ]
Huang, Henry [3 ]
Groisman, Eduardo A. [3 ,4 ]
Zwir, Igor [1 ,3 ,4 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, Granada, Spain
[2] Washington Univ, Sch Med, Dept Psychiat, St Louis, MO 63110 USA
[3] Washington Univ, Sch Med, Dept Mol Microbiol, St Louis, MO 63110 USA
[4] Washington Univ, Sch Med, Howard Hughes Med Inst, St Louis, MO 63110 USA
关键词
FUZZY-LOGIC CONTROLLERS; ESCHERICHIA-COLI; MOLECULAR CHARACTERIZATION; SALMONELLA-TYPHIMURIUM; REGULATORY NETWORK; PROTEIN; DISCOVERY; GENOME; GENES; IDENTIFICATION;
D O I
10.1371/journal.pcbi.1000862
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Transcriptional regulators recognize specific DNA sequences. Because these sequences are embedded in the background of genomic DNA, it is hard to identify the key cis-regulatory elements that determine disparate patterns of gene expression. The detection of the intra- and inter-species differences among these sequences is crucial for understanding the molecular basis of both differential gene expression and evolution. Here, we address this problem by investigating the target promoters controlled by the DNA-binding PhoP protein, which governs virulence and Mg2+ homeostasis in several bacterial species. PhoP is particularly interesting; it is highly conserved in different gamma/enterobacteria, regulating not only ancestral genes but also governing the expression of dozens of horizontally acquired genes that differ from species to species. Our approach consists of decomposing the DNA binding site sequences for a given regulator into families of motifs (i.e., termed submotifs) using a machine learning method inspired by the "Divide & Conquer'' strategy. By partitioning a motif into sub-patterns, computational advantages for classification were produced, resulting in the discovery of new members of a regulon, and alleviating the problem of distinguishing functional sites in chromatin immunoprecipitation and DNA microarray genome-wide analysis. Moreover, we found that certain partitions were useful in revealing biological properties of binding site sequences, including modular gains and losses of PhoP binding sites through evolutionary turnover events, as well as conservation in distant species. The high conservation of PhoP submotifs within gamma/enterobacteria, as well as the regulatory protein that recognizes them, suggests that the major cause of divergence between related species is not due to the binding sites, as was previously suggested for other regulators. Instead, the divergence may be attributed to the fast evolution of orthologous target genes and/or the promoter architectures resulting from the interaction of those binding sites with the RNA polymerase.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Predicting in-vitro Transcription Factor Binding Sites Using DNA Sequence plus Shape
    Zhang, Qinhu
    Shen, Zhen
    Huang, De-Shuang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (02) : 667 - 676
  • [42] Caenorhabditis elegans transposable elements harbor diverse transcription factor DNA-binding sites
    Garrigues, Jacob M.
    Pasquinelli, Amy E.
    G3-GENES GENOMES GENETICS, 2022, 12 (03):
  • [43] Rare genetic variation at transcription factor binding sites modulates local DNA methylation profiles
    Martin-Trujillo, Alejandro
    Patel, Nihir
    Richter, Felix
    Jadhav, Bharati
    Garg, Paras
    Morton, Sarah U.
    McKean, David M.
    DePalma, Steven R.
    Goldmuntz, Elizabeth
    Gruber, Dorota
    Kim, Richard
    Newburger, Jane W.
    Porter, George A., Jr.
    Giardini, Alessandro
    Bernstein, Daniel
    Tristani-Firouzi, Martin
    Seidman, Jonathan G.
    Seidman, Christine E.
    Chung, Wendy K.
    Gelb, Bruce D.
    Sharp, Andrew J.
    PLOS GENETICS, 2020, 16 (11):
  • [44] Intermolecular epistasis shaped the function and evolution of an ancient transcription factor and its DNA binding sites
    Anderson, Dave W.
    McKeown, Alesia N.
    Thornton, Joseph W.
    ELIFE, 2015, 4
  • [45] Bayesian multiple-instance motif discovery with BAMBI: inference of recombinase and transcription factor binding sites
    Jajamovich, Guido H.
    Wang, Xiaodong
    Arkin, Adam P.
    Samoilov, Michael S.
    NUCLEIC ACIDS RESEARCH, 2011, 39 (21) : e146
  • [46] Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes
    Zhang, Shaoqiang
    Li, Shan
    Pham, Phuc T.
    Su, Zhengchang
    BMC BIOINFORMATICS, 2010, 11
  • [47] TEMPLE: analysing population genetic variation at transcription factor binding sites
    Litovchenko, Maria
    Laurent, Stefan
    MOLECULAR ECOLOGY RESOURCES, 2016, 16 (06) : 1428 - 1434
  • [48] HOCOMOCO: expansion and enhancement of the collection of transcription factor binding sites models
    Kulakovskiy, Ivan V.
    Vorontsov, Ilya E.
    Yevshin, Ivan S.
    Soboleva, Anastasiia V.
    Kasianov, Artem S.
    Ashoor, Haitham
    Ba-alawi, Wail
    Bajic, Vladimir B.
    Medvedeva, Yulia A.
    Kolpakov, Fedor A.
    Makeev, Vsevolod J.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D116 - D125
  • [49] Evolving Spiking Neural Networks for Predicting Transcription Factor Binding Sites
    Sichtig, Heike
    Schaffer, J. David
    Riva, Alberto
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [50] TF-COMB - Discovering grammar of transcription factor binding sites
    Bentsen, Mette
    Heger, Vanessa
    Schultheis, Hendrik
    Kuenne, Carsten
    Looso, Mario
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 4040 - 4051