Genome-wide analysis of promoter architecture in Drosophila melanogaster

被引:176
作者
Hoskins, Roger A. [1 ]
Landolin, Jane M. [1 ]
Brown, James B. [2 ]
Sandler, Jeremy E. [1 ]
Takahashi, Hazuki [3 ]
Lassmann, Timo [3 ]
Yu, Charles [1 ]
Booth, Benjamin W. [1 ]
Zhang, Dayu [4 ,5 ]
Wan, Kenneth H. [1 ]
Yang, Li [6 ]
Boley, Nathan [2 ]
Andrews, Justen [4 ]
Kaufman, Thomas C. [4 ]
Graveley, Brenton R. [6 ]
Bickel, Peter J. [2 ]
Carninci, Piero [3 ]
Carlson, Joseph W. [1 ]
Celniker, Susan E. [1 ]
机构
[1] Lawrence Berkeley Lab, Div Life Sci, Dept Genome Dynam, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[3] RIKEN, Yokohama Inst, Om Sci Ctr, Kanagawa 2300045, Japan
[4] Indiana Univ, Dept Biol, Bloomington, IN 47405 USA
[5] Indiana Univ, Ctr Genom & Bioinformat, Bloomington, IN 47405 USA
[6] Univ Connecticut, Ctr Hlth, Dept Genet & Dev Biol, Farmington, CT 06030 USA
关键词
FULL-LENGTH CDNAS; RNA; SEQUENCE; CAP; IDENTIFICATION; GENES; EXPRESSION; DATABASE; COMPLEX;
D O I
10.1101/gr.112466.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLM-RACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.
引用
收藏
页码:182 / 192
页数:11
相关论文
共 44 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Methods for analyzing deep sequencing expression data: constructing the human and mouse promoterome with deepCAGE data [J].
Balwierz, Piotr J. ;
Carninci, Piero ;
Daub, Carsten O. ;
Kawai, Jun ;
Hayashizaki, Yoshihide ;
Van Belle, Werner ;
Beisel, Christian ;
van Nimwegen, Erik .
GENOME BIOLOGY, 2009, 10 (07)
[4]  
BICKEL PJ, 2011, ANN APPL ST IN PRESS
[5]   Transcriptional regulation in Drosophila: The post-genome challenge [J].
Biggin M.D. ;
Tjian R. .
Functional & Integrative Genomics, 2001, 1 (4) :223-234
[6]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[7]   F-Seq: a feature density estimator for high-throughput sequence tags [J].
Boyle, Alan P. ;
Guinney, Justin ;
Crawford, Gregory E. ;
Furey, Terrence S. .
BIOINFORMATICS, 2008, 24 (21) :2537-2538
[8]   Drosophila TFIID binds to a conserved downstream basal promoter element that is present in many TATA-box-deficient promoters [J].
Burke, TW ;
Kadonaga, JT .
GENES & DEVELOPMENT, 1996, 10 (06) :711-724
[9]   High-efficiency full-length cDNA cloning by biotinylated CAP trapper [J].
Carninci, P ;
Kvam, C ;
Kitamura, A ;
Ohsumi, T ;
Okazaki, Y ;
Itoh, M ;
Kamiya, M ;
Shibata, K ;
Sasaki, N ;
Izawa, M ;
Muramatsu, M ;
Hayashizaki, Y ;
Schneider, C .
GENOMICS, 1996, 37 (03) :327-336
[10]   Genome-wide analysis of mammalian promoter architecture and evolution [J].
Carninci, Piero ;
Sandelin, Albin ;
Lenhard, Boris ;
Katayama, Shintaro ;
Shimokawa, Kazuro ;
Ponjavic, Jasmina ;
Semple, Colin A. M. ;
Taylor, Martin S. ;
Engström, Par G. ;
Frith, Martin C. ;
Forrest, Alistair R. R. ;
Alkema, Wynand B. ;
Tan, Sin Lam ;
Plessy, Charles ;
Kodzius, Rimantas ;
Ravasi, Timothy ;
Kasukawa, Takeya ;
Fukuda, Shiro ;
Kanamori-Katayama, Mutsumi ;
Kitazume, Yayoi ;
Kawaji, Hideya ;
Kai, Chikatoshi ;
Nakamura, Mari ;
Konno, Hideaki ;
Nakano, Kenji ;
Mottagui-Tabar, Salim ;
Arner, Peter ;
Chesi, Alessandra ;
Gustincich, Stefano ;
Persichetti, Francesca ;
Suzuki, Harukazu ;
Grimmond, Sean M. ;
Wells, Christine A. ;
Orlando, Valerio ;
Wahlestedt, Claes ;
Liu, Edison T. ;
Harbers, Matthias ;
Kawai, Jun ;
Bajic, Vladimir B. ;
Hume, David A. ;
Hayashizaki, Yoshihide .
NATURE GENETICS, 2006, 38 (06) :626-635