Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing

被引:82
作者
Anvar, Seyed Yahya [1 ,2 ,3 ]
Allard, Guy [1 ]
Tseng, Elizabeth [4 ]
Sheynkman, Gloria M. [5 ,6 ,7 ]
de Klerk, Eleonora [1 ,8 ]
Vermaat, Martijn [1 ,2 ]
Yin, Raymund H. [9 ]
Johansson, Hans E. [9 ]
Ariyurek, Yavuz [1 ,2 ]
den Dunnen, Johan T. [1 ,2 ]
Turner, Stephen W. [4 ]
't Hoen, Peter A. C. [1 ,10 ]
机构
[1] Leiden Univ, Dept Human Genet, Med Ctr, NL-2300 RC Leiden, Netherlands
[2] Leiden Univ, Leiden Genome Technol Ctr, Med Ctr, NL-2300 RC Leiden, Netherlands
[3] Leiden Univ, Dept Clin Pharm & Toxicol, Med Ctr, NL-2300 RC Leiden, Netherlands
[4] Pacific Biosci, 1305 OBrien Dr, Menlo Pk, CA 94025 USA
[5] Dana Farber Canc Inst, CCSB, Boston, MA 02215 USA
[6] Dana Farber Canc Inst, Dept Canc Biol, Boston, MA 02215 USA
[7] Harvard Med Sch, Dept Genet, Boston, MA 02115 USA
[8] Univ Calif San Francisco, UCSF Diabet Ctr, Dept Microbiol & Immunol, San Francisco, CA 94143 USA
[9] LGC Biosearch Technol, Petaluma, CA 94954 USA
[10] Radboud Univ Nijmegen, Med Ctr, Radboud Inst Mol Life Sci, Ctr Mol & Biomol Informat, Nijmegen, Netherlands
来源
GENOME BIOLOGY | 2018年 / 19卷
关键词
IN-SITU HYBRIDIZATION; BINDING PROTEIN HUR; ADHERENT CELLS; SEQ DATA; POLYADENYLATION; NUCLEAR; IDENTIFICATION; STABILITY; ISOFORM; COMPLEXITY;
D O I
10.1186/s13059-018-1418-0
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing. Results: In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells. Conclusions: Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.
引用
收藏
页数:18
相关论文
共 72 条
[1]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[2]  
Anvar SYA, 2018, DATASETS SCRIPTS FUL
[3]  
Arindrarto W., 2016, GENTRAP GENERIC TRAN
[4]   Characterization of the human ESC transcriptome by hybrid sequencing [J].
Au, Kin Fai ;
Sebastiano, Vittorio ;
Afshar, Pegah Tootoonchi ;
Durruthy, Jens Durruthy ;
Lee, Lawrence ;
Williams, Brian A. ;
van Bakel, Harm ;
Schadt, Eric E. ;
Reijo-Pera, Renee A. ;
Underwood, Jason G. ;
Wong, Wing Hung .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (50) :E4821-E4830
[5]   A subset of nuclear receptor coregulators act as coupling proteins during synthesis and maturation of RNA transcripts [J].
Auboeuf, D ;
Dowhan, DH ;
Dutertre, M ;
Martin, N ;
Berget, SM ;
O'Malley, BW .
MOLECULAR AND CELLULAR BIOLOGY, 2005, 25 (13) :5307-5316
[6]   Mammalian smaug is a translational repressor that forms cytoplasmic foci similar to stress granules [J].
Baez, MV ;
Boccaccio, GL .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (52) :43131-43140
[7]   DREME: motif discovery in transcription factor ChIP-seq data [J].
Bailey, Timothy L. .
BIOINFORMATICS, 2011, 27 (12) :1653-1659
[8]   Deciphering the splicing code [J].
Barash, Yoseph ;
Calarco, John A. ;
Gao, Weijun ;
Pan, Qun ;
Wang, Xinchen ;
Shai, Ofer ;
Blencowe, Benjamin J. ;
Frey, Brendan J. .
NATURE, 2010, 465 (7294) :53-59
[9]   Changes in Cellular mRNA Stability, Splicing, and Polyadenylation through HuR Protein Sequestration by a Cytoplasmic RNA Virus [J].
Barnhart, Michael D. ;
Moon, Stephanie L. ;
Emch, Alexander W. ;
Wilusz, Carol J. ;
Wilusz, Jeffrey .
CELL REPORTS, 2013, 5 (04) :909-917
[10]   Loss of MBNL Leads to Disruption of Developmentally Regulated Alternative Polyadenylation in RNA-Mediated Disease [J].
Batra, Ranjan ;
Charizanis, Konstantinos ;
Manchanda, Mini ;
Mohan, Apoorva ;
Li, Moyi ;
Finn, Dustin J. ;
Goodwin, Marianne ;
Zhang, Chaolin ;
Sobczak, Krzysztof ;
Thornton, Charles A. ;
Swanson, Maurice S. .
MOLECULAR CELL, 2014, 56 (02) :311-322