Fast, Sensitive Discovery of Conserved Genome-Wide Motifs

被引:5
|
作者
Ihuegbu, Nnamdi E. [1 ]
Stormo, Gary D. [1 ]
Buhler, Jeremy [2 ]
机构
[1] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[2] Washington Univ, Dept Comp Sci & Engn, St Louis, MO 63108 USA
关键词
ChIP analysis; cis-regulatory elements; eukaryotic motif-finding; fast motif-finding; genome-wide motif-finding; motif-expression association; motif redundancy; transcription factor binding site discovery; TRANSCRIPTION-FACTOR-BINDING; CAENORHABDITIS-ELEGANS; COREGULATED GENES; REGULATORY MOTIFS; SITES; DNA; IDENTIFICATION; INFORMATION; PROMOTERS; SEQUENCES;
D O I
10.1089/cmb.2011.0249
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Regulatory sites that control gene expression are essential to the proper functioning of cells, and identifying them is critical for modeling regulatory networks. We have developed Magma (Multiple Aligner of Genomic Multiple Alignments), a software tool for multiple species, multiple gene motif discovery. Magma identifies putative regulatory sites that are conserved across multiple species and occur near multiple genes throughout a reference genome. Magma takes as input multiple alignments that can include gaps. It uses efficient clustering methods that make it about 70 times faster than PhyloNet, a previous program for this task, with slightly greater sensitivity. We ran Magma on all non-coding DNA conserved between Caenorhabditis elegans and five additional species, about 70Mbp in total, in < 4h. We obtained 2,309 motifs with lengths of 6-20 bp, each occurring at least 10 times throughout the genome, which collectively covered about 566 kbp of the genomes, approximately 0.8% of the input. Predicted sites occurred in all types of non-coding sequence but were especially enriched in the promoter regions. Comparisons to several experimental datasets show that Magma motifs correspond to a variety of known regulatory motifs.
引用
收藏
页码:139 / 147
页数:9
相关论文
共 50 条
  • [1] Genome-wide Search for Coaxial Helical Stacking Motifs
    Byron, Kevin
    Wang, Jason T. L.
    Wen, Dongrong
    IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 260 - 265
  • [2] Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated
    Choy, Mun-Kit
    Movassagh, Mehregan
    Goh, Hock-Guan
    Bennett, Martin R.
    Down, Thomas A.
    Foo, Roger S. Y.
    BMC GENOMICS, 2010, 11
  • [3] Genome-wide identification of conserved regulatory function in diverged sequences
    Taher, Leila
    McGaughey, David M.
    Maragh, Samantha
    Aneas, Ivy
    Bessling, Seneca L.
    Miller, Webb
    Nobrega, Marcelo A.
    McCallion, Andrew S.
    Ovcharenko, Ivan
    GENOME RESEARCH, 2011, 21 (07) : 1139 - 1149
  • [4] Genome-Wide Identification of Effector Candidates With Conserved Motifs From the Wheat Leaf Rust FungusPuccinia triticina
    Zhao, Shuqing
    Shang, Xiaofeng
    Bi, Weishuai
    Yu, Xiumei
    Liu, Daqun
    Kang, Zhensheng
    Wang, Xiaojie
    Wang, Xiaodong
    FRONTIERS IN MICROBIOLOGY, 2020, 11
  • [5] Genome-wide analysis predicts DNA structural motifs as nucleosome exclusion signals
    Halder, Kangkan
    Halder, Rashi
    Chowdhury, Shantanu
    MOLECULAR BIOSYSTEMS, 2009, 5 (12) : 1703 - 1712
  • [6] Genome-wide discovery of human heart enhancers
    Narlikar, Leelavati
    Sakabe, Noboru J.
    Blanski, Alexander A.
    Arimura, Fabio E.
    Westlund, John M.
    Nobrega, Marcelo A.
    Ovcharenko, Ivan
    GENOME RESEARCH, 2010, 20 (03) : 381 - 392
  • [7] Genome-wide discovery of G-quadruplexes in barley
    Cagirici, H. Busra
    Budak, Hikmet
    Sen, Taner Z.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [8] Genome-Wide Discovery of Small RNAs in Mycobacterium tuberculosis
    Miotto, Paolo
    Forti, Francesca
    Ambrosi, Alessandro
    Pellin, Danilo
    Veiga, Diogo F.
    Balazsi, Gabor
    Gennaro, Maria L.
    Di Serio, Clelia
    Ghisotti, Daniela
    Cirillo, Daniela M.
    PLOS ONE, 2012, 7 (12):
  • [9] ReMo-SNPs: a new software tool for identification of polymorphisms in regions and motifs genome-wide
    Graae, Lisette
    Paddock, Silvia
    Belin, Andrea Carmine
    GENETICS RESEARCH, 2015, 97 : e8
  • [10] Mapping genome-wide transcription factor binding sites in frozen tissues
    Savic, Daniel
    Gertz, Jason
    Jain, Preti
    Cooper, Gregory M.
    Myers, Richard M.
    EPIGENETICS & CHROMATIN, 2013, 6