Reliable prediction of regulator targets using 12 Drosophila genomes

被引:124
作者
Kheradpour, Pouya
Stark, Alexander
Roy, Sushmita
Kellis, Manolis
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Harvard Univ, MIT, Broad Inst, Cambridge, MA 02141 USA
[3] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
关键词
D O I
10.1101/gr.7090407
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression is regulated pre- and post-transcriptionally via cis-regulatory DNA and RNA motifs. Identification of individual functional instances of such motifs in genome sequences is a major goal for inferring regulatory networks yet has been hampered due to the motifs' short lengths that lead to many chance matches and poor signal-to-noise ratios. In this paper, we develop a general methodology for the comparative identification of functional motif instances across many related species, using a phylogenetic framework that accounts for the evolutionary relationships between species, allows for motif movements, and is robust against missing data due to artifacts in sequencing, assembly, or alignment. We also provide a robust statistical framework for evaluating motif confidence, which enables us to translate evolutionary conservation into a confidence measure for each motif instance, correcting for varying motif length, composition, and background conservation of the target regions. We predict targets of fly transcription factors and miRNAs in alignments of 12 recently sequenced Drosophila species. When compared to extensive genome-wide experimental data, predicted targets are of high quality, matching and surpassing ChIP-chip microarrays and recovering miRNA targets with high sensitivity. The resulting regulatory network suggests significant redundancy between pre- and post-transcriptional regulation of gene expression.
引用
收藏
页码:1919 / 1931
页数:13
相关论文
共 73 条
[31]   Functional evolution of a cis-regulatory module [J].
Ludwig, MZ ;
Palsson, A ;
Alekseeva, E ;
Bergman, CM ;
Nathan, J ;
Kreitman, M .
PLOS BIOLOGY, 2005, 3 (04) :588-598
[32]   Evidence for stabilizing selection in a eukaryotic enhancer element [J].
Ludwig, MZ ;
Bergman, C ;
Patel, NH ;
Kreitman, M .
NATURE, 2000, 403 (6769) :564-567
[33]   Identification and characterization of multi-species conserved sequences [J].
Margulies, EH ;
Blanchette, M ;
Haussler, D ;
Green, ED .
GENOME RESEARCH, 2003, 13 (12) :2507-2518
[34]   Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome [J].
Margulies, Elliott H. ;
Cooper, Gregory M. ;
Asimenos, George ;
Thomas, Daryl J. ;
Dewey, Colin N. ;
Siepel, Adam ;
Birney, Ewan ;
Keefe, Damian ;
Schwartz, Ariel S. ;
Hou, Minmei ;
Taylor, James ;
Nikolaev, Sergey ;
Montoya-Burgos, Juan I. ;
Loytynoja, Ari ;
Whelan, Simon ;
Pardi, Fabio ;
Massingham, Tim ;
Brown, James B. ;
Bickel, Peter ;
Holmes, Ian ;
Mullikin, James C. ;
Ureta-Vidal, Abel ;
Paten, Benedict ;
Stone, Eric A. ;
Rosenbloom, Kate R. ;
Kent, W. James ;
Antonarakis, Stylianos E. ;
Batzoglou, Serafim ;
Goldman, Nick ;
Hardison, Ross ;
Haussler, David ;
Miller, Webb ;
Pachter, Lior ;
Green, Eric D. ;
Sidow, Arend .
GENOME RESEARCH, 2007, 17 (06) :760-774
[35]   A regulatory code for neurogenic gene expression in the Drosophila embryo [J].
Markstein, M ;
Zinzen, R ;
Markstein, P ;
Yee, KP ;
Erives, A ;
Stathopoulos, A ;
Levine, M .
DEVELOPMENT, 2004, 131 (10) :2387-2394
[36]   TRANSFAC®:: transcriptional regulation, from patterns to profiles [J].
Matys, V ;
Fricke, E ;
Geffers, R ;
Gössling, E ;
Haubrock, M ;
Hehl, R ;
Hornischer, K ;
Karas, D ;
Kel, AE ;
Kel-Margoulis, OV ;
Kloos, DU ;
Land, S ;
Lewicki-Potapov, B ;
Michael, H ;
Münch, R ;
Reuter, I ;
Rotert, S ;
Saxel, H ;
Scheer, M ;
Thiele, S ;
Wingender, E .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :374-378
[37]   Morphological evolution through multiple cis-regulatory mutations at a single [J].
McGregor, Alistair P. ;
Orgogozo, Virginie ;
Delon, Isabelle ;
Zanet, Jennifer ;
Srinivasan, Dayalan G. ;
Payre, Francois ;
Stern, David L. .
NATURE, 2007, 448 (7153) :587-U6
[38]   Comparative genomics [J].
Miller, W ;
Makova, KD ;
Nekrutenko, A ;
Hardison, RC .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2004, 5 :15-56
[39]   MONKEY: identifying conserved transcription-factor binding sites in multiple alignments using a binding site-specific evolutionary model [J].
Moses, AM ;
Chiang, DY ;
Pollard, DA ;
Iyer, VN ;
Eisen, MB .
GENOME BIOLOGY, 2004, 5 (12)
[40]   Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays [J].
Mukherjee, S ;
Berger, MF ;
Jona, G ;
Wang, XS ;
Muzzey, D ;
Snyder, M ;
Young, RA ;
Bulyk, ML .
NATURE GENETICS, 2004, 36 (12) :1331-1339