A new framework for identifying cis-regulatory motifs in prokaryotes

被引:29
|
作者
Li, Guojun [1 ,2 ,3 ]
Liu, Bingqiang [1 ,2 ,3 ]
Ma, Qin [1 ,2 ,3 ]
Xu, Ying [1 ,2 ,4 ]
机构
[1] Univ Georgia, Dept Biochem & Mol Biol, Computat Syst Biol Lab, Athens, GA 30602 USA
[2] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[3] Shandong Univ, Sch Math, Jinan 250100, Peoples R China
[4] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Jilin, Peoples R China
基金
美国国家科学基金会;
关键词
FACTOR-BINDING SITES; GAMMA-PROTEOBACTERIAL GENOMES; ESCHERICHIA-COLI; TRACTOR-DB; DNA; TRANSCRIPTION; DISCOVERY; SEQUENCES; DATABASE; PROTEIN;
D O I
10.1093/nar/gkq948
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a new algorithm, BOBRO, for prediction of cis-regulatory motifs in a given set of promoter sequences. The algorithm substantially improves the prediction accuracy and extends the scope of applicability of the existing programs based on two key new ideas: (i) we developed a highly effective method for reliably assessing the possibility for each position in a given promoter to be the (approximate) start of a conserved sequence motif; and (ii) we developed a highly reliable way for recognition of actual motifs from the accidental ones based on the concept of 'motif closure'. These two key ideas are embedded in a classical framework for motif finding through finding cliques in a graph but have made this framework substantially more sensitive as well as more selective in motif finding in a very noisy background. A comparative analysis shows that the performance coefficient was improved from 29% to 41% by our program compared to the best among other six state-of-the-art prediction tools on a large-scale data sets of promoters from one genome, and also consistently improved by substantial margins on another kind of large-scale data sets of orthologous promoters across multiple genomes. The power of BOBRO in dealing with noisy data was further demonstrated through identification of the motifs of the global transcriptional regulators by running it over 2390 promoter sequences of Escherichia coli K12.
引用
收藏
页码:E42 / U54
页数:9
相关论文
共 50 条
  • [31] Assessing Computational Methods of Cis-Regulatory Module Prediction
    Su, Jing
    Teichmann, Sarah A.
    Down, Thomas A.
    PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
  • [32] Study of Cis-regulatory Elements in the Ascidian Ciona intestinalis
    Irvine, Steven Q.
    CURRENT GENOMICS, 2013, 14 (01) : 56 - 67
  • [33] Parallel Evolution of Chordate Cis-Regulatory Code for Development
    Doglio, Laura
    Goode, Debbie K.
    Pelleri, Maria C.
    Pauls, Stefan
    Frabetti, Flavia
    Shimeld, Sebastian M.
    Vavouri, Tanya
    Elgar, Greg
    PLOS GENETICS, 2013, 9 (11):
  • [34] Cis-regulatory elements and human evolution
    Siepel, Adam
    Arbiza, Leonardo
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2014, 29 : 81 - 89
  • [35] Unraveling Transcriptional Control in Arabidopsis Using cis-Regulatory Elements and Coexpression Networks
    Vandepoele, Klaas
    Quimbaya, Mauricio
    Casneuf, Tine
    De Veylder, Lieven
    Van de Peer, Yves
    PLANT PHYSIOLOGY, 2009, 150 (02) : 535 - 546
  • [36] Cis-regulatory landscapes in development and evolution
    Maeso, Ignacio
    Acemel, Rafael D.
    Luis Gomez-Skarmeta, Jose
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2017, 43 : 17 - 22
  • [37] Bacterial cis-regulatory RNA structures
    Gelfand M.S.
    Molecular Biology, 2006, 40 (4) : 541 - 550
  • [38] Modeling the cis-regulatory modules of genes expressed in developmental stages of Drosophila melanogaster
    Lopez, Yosvany
    Vandenbon, Alexis
    Nose, Akinao
    Nakai, Kenta
    PEERJ, 2017, 5
  • [39] Bacterial cis-regulatory RNA structures
    Gelfand, M. S.
    MOLECULAR BIOLOGY, 2006, 40 (04) : 609 - 619
  • [40] Mutagenesis of GATA motifs controlling the endoderm regulator elt-2 reveals distinct dominant and secondary cis-regulatory elements
    Du, Lawrence
    Tracy, Sharon
    Rifkin, Scott A.
    DEVELOPMENTAL BIOLOGY, 2016, 412 (01) : 160 - 170