Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing

被引:80
作者
Anvar, Seyed Yahya [1 ,2 ,3 ]
Allard, Guy [1 ]
Tseng, Elizabeth [4 ]
Sheynkman, Gloria M. [5 ,6 ,7 ]
de Klerk, Eleonora [1 ,8 ]
Vermaat, Martijn [1 ,2 ]
Yin, Raymund H. [9 ]
Johansson, Hans E. [9 ]
Ariyurek, Yavuz [1 ,2 ]
den Dunnen, Johan T. [1 ,2 ]
Turner, Stephen W. [4 ]
't Hoen, Peter A. C. [1 ,10 ]
机构
[1] Leiden Univ, Dept Human Genet, Med Ctr, NL-2300 RC Leiden, Netherlands
[2] Leiden Univ, Leiden Genome Technol Ctr, Med Ctr, NL-2300 RC Leiden, Netherlands
[3] Leiden Univ, Dept Clin Pharm & Toxicol, Med Ctr, NL-2300 RC Leiden, Netherlands
[4] Pacific Biosci, 1305 OBrien Dr, Menlo Pk, CA 94025 USA
[5] Dana Farber Canc Inst, CCSB, Boston, MA 02215 USA
[6] Dana Farber Canc Inst, Dept Canc Biol, Boston, MA 02215 USA
[7] Harvard Med Sch, Dept Genet, Boston, MA 02115 USA
[8] Univ Calif San Francisco, UCSF Diabet Ctr, Dept Microbiol & Immunol, San Francisco, CA 94143 USA
[9] LGC Biosearch Technol, Petaluma, CA 94954 USA
[10] Radboud Univ Nijmegen, Med Ctr, Radboud Inst Mol Life Sci, Ctr Mol & Biomol Informat, Nijmegen, Netherlands
来源
GENOME BIOLOGY | 2018年 / 19卷
关键词
IN-SITU HYBRIDIZATION; BINDING PROTEIN HUR; ADHERENT CELLS; SEQ DATA; POLYADENYLATION; NUCLEAR; IDENTIFICATION; STABILITY; ISOFORM; COMPLEXITY;
D O I
10.1186/s13059-018-1418-0
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing. Results: In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells. Conclusions: Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.
引用
收藏
页数:18
相关论文
共 72 条
  • [1] HTSeq-a Python']Python framework to work with high-throughput sequencing data
    Anders, Simon
    Pyl, Paul Theodor
    Huber, Wolfgang
    [J]. BIOINFORMATICS, 2015, 31 (02) : 166 - 169
  • [2] Anvar SYA, 2018, DATASETS SCRIPTS FUL
  • [3] Arindrarto W., 2016, GENTRAP GENERIC TRAN
  • [4] Characterization of the human ESC transcriptome by hybrid sequencing
    Au, Kin Fai
    Sebastiano, Vittorio
    Afshar, Pegah Tootoonchi
    Durruthy, Jens Durruthy
    Lee, Lawrence
    Williams, Brian A.
    van Bakel, Harm
    Schadt, Eric E.
    Reijo-Pera, Renee A.
    Underwood, Jason G.
    Wong, Wing Hung
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (50) : E4821 - E4830
  • [5] A subset of nuclear receptor coregulators act as coupling proteins during synthesis and maturation of RNA transcripts
    Auboeuf, D
    Dowhan, DH
    Dutertre, M
    Martin, N
    Berget, SM
    O'Malley, BW
    [J]. MOLECULAR AND CELLULAR BIOLOGY, 2005, 25 (13) : 5307 - 5316
  • [6] Mammalian smaug is a translational repressor that forms cytoplasmic foci similar to stress granules
    Baez, MV
    Boccaccio, GL
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (52) : 43131 - 43140
  • [7] DREME: motif discovery in transcription factor ChIP-seq data
    Bailey, Timothy L.
    [J]. BIOINFORMATICS, 2011, 27 (12) : 1653 - 1659
  • [8] Deciphering the splicing code
    Barash, Yoseph
    Calarco, John A.
    Gao, Weijun
    Pan, Qun
    Wang, Xinchen
    Shai, Ofer
    Blencowe, Benjamin J.
    Frey, Brendan J.
    [J]. NATURE, 2010, 465 (7294) : 53 - 59
  • [9] Changes in Cellular mRNA Stability, Splicing, and Polyadenylation through HuR Protein Sequestration by a Cytoplasmic RNA Virus
    Barnhart, Michael D.
    Moon, Stephanie L.
    Emch, Alexander W.
    Wilusz, Carol J.
    Wilusz, Jeffrey
    [J]. CELL REPORTS, 2013, 5 (04): : 909 - 917
  • [10] Loss of MBNL Leads to Disruption of Developmentally Regulated Alternative Polyadenylation in RNA-Mediated Disease
    Batra, Ranjan
    Charizanis, Konstantinos
    Manchanda, Mini
    Mohan, Apoorva
    Li, Moyi
    Finn, Dustin J.
    Goodwin, Marianne
    Zhang, Chaolin
    Sobczak, Krzysztof
    Thornton, Charles A.
    Swanson, Maurice S.
    [J]. MOLECULAR CELL, 2014, 56 (02) : 311 - 322