Long noncoding RNAs are rarely translated in two human cell lines

被引:334
作者
Banfai, Balazs [1 ]
Jia, Hui [2 ]
Khatun, Jainab [3 ]
Wood, Emily [2 ]
Risk, Brian [3 ]
Gundling, William E., Jr. [2 ]
Kundaje, Anshul [4 ]
Gunawardena, Harsha P. [5 ]
Yu, Yanbao [5 ]
Xie, Ling [5 ]
Krajewski, Krzysztof [5 ]
Strahl, Brian D. [5 ]
Chen, Xian [5 ]
Bickel, Peter [1 ]
Giddings, Morgan C. [6 ]
Brown, James B. [1 ]
Lipovich, Leonard [2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Wayne State Univ, Sch Med, Ctr Mol Med & Genet, Detroit, MI 48201 USA
[3] Boise State Univ, Biomol Res Ctr, Boise, ID 83725 USA
[4] Stanford Univ, Dept Comp Sci, Palo Alto, CA 94305 USA
[5] Univ N Carolina, Sch Med, Chapel Hill, NC USA
[6] Boise State Univ, Coll Arts & Sci, Boise, ID 83725 USA
关键词
RELATIVE PROTEIN ABUNDANCE; COMPARING PROTEIN; GENE-EXPRESSION; DISCOVERY; IDENTIFICATION; TRANSCRIPTION; ABSOLUTE; REVEALS; COMPLEX;
D O I
10.1101/gr.134767.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Data from the Encyclopedia of DNA Elements (ENCODE) project show over 9640 human genome loci classified as long noncoding RNAs (IncRNAs), yet only 100 have been deeply characterized to determine their role in the cell. To measure the protein-coding output from these RNAs, we jointly analyzed two recent data sets produced in the ENCODE project: tandem mass spectrometry (MS/MS) data mapping expressed peptides to their encoding genomic loci, and RNA-seq data generated by ENCODE in long polyA+ and polyA- fractions in the cell lines K562 and GMI2878. We used the machine-learning algorithm RuleFit3 to regress the peptide data against RNA expression data. The most important covariate for predicting translation was, surprisingly, the Cytosol polyA- fraction in both cell lines. LncRNAs are similar to 13-fold less likely to produce detectable peptides than similar mRNAs, indicating that similar to 92% of GENCODE v7 IncRNAs are not translated in these two ENCODE cell lines. Intersecting 9640 IncRNA loci with 79,333 peptides yielded 85 unique peptides matching 69 IncRNAs. Most cases were due to a coding transcript misannotated as IncRNA. Two exceptions were an unprocessed pseudogene and a bona fide IncRNA gene, both with open reading frames (ORFs) compromised by upstream stop codons. All potentially translatable IncRNA ORFs had only a single peptide match, indicating low protein abundance and/or false-positive peptide matches. We conclude that with very few exceptions, ribosomes are able to distinguish coding from noncoding transcripts and, hence, that ectopic translation and cryptic mRNAs are rare in the human IncRNAome.
引用
收藏
页码:1646 / 1657
页数:12
相关论文
共 47 条
[31]   A model for random sampling and estimation of relative protein abundance in shotgun proteomics [J].
Liu, HB ;
Sadygov, RG ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2004, 76 (14) :4193-4201
[32]   Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation [J].
Lu, Peng ;
Vogel, Christine ;
Wang, Rong ;
Yao, Xin ;
Marcotte, Edward M. .
NATURE BIOTECHNOLOGY, 2007, 25 (01) :117-124
[33]   Conserved long noncoding RNAs transcriptionally regulated by Oct4 and Nanog modulate pluripotency in mouse embryonic stem cells [J].
Mohamed, Jameelah Sheik ;
Gaughwin, Philip Michael ;
Lim, Bing ;
Robson, Paul ;
Lipovich, Leonard .
RNA, 2010, 16 (02) :324-337
[34]   Identification of a Cytoplasmic Complex That Adds a Cap onto 5′-Monophosphate RNA [J].
Otsuka, Yuichi ;
Kedersha, Nancy L. ;
Schoenberg, Daniel R. .
MOLECULAR AND CELLULAR BIOLOGY, 2009, 29 (08) :2155-2167
[35]   Dissecting the human BDNF locus:: Bidirectional transcription, complex splicing, and multiple promoters [J].
Pruunsild, Priit ;
Kazantseva, Anna ;
Aid, Tamara ;
Palm, Kaia ;
Timmusk, Tonis .
GENOMICS, 2007, 90 (03) :397-406
[36]  
Rohrbough James G, 2006, J Biomol Tech, V17, P327
[37]   Global quantification of mammalian gene expression control [J].
Schwanhaeusser, Bjoern ;
Busse, Dorothea ;
Li, Na ;
Dittmar, Gunnar ;
Schuchhardt, Johannes ;
Wolf, Jana ;
Chen, Wei ;
Selbach, Matthias .
NATURE, 2011, 473 (7347) :337-342
[38]   In-gel digestion for mass spectrometric characterization of proteins and proteomes [J].
Shevchenko, Andrej ;
Tomas, Henrik ;
Havlis, Jan ;
Olsen, Jesper V. ;
Mann, Matthias .
NATURE PROTOCOLS, 2006, 1 (06) :2856-2860
[39]   Improving gene annotation using peptide mass spectrometry [J].
Tanner, Stephen ;
Shen, Zhouxin ;
Ng, Julio ;
Florea, Liliana ;
Guigo, Roderic ;
Briggs, Steven P. ;
Bafna, Vineet .
GENOME RESEARCH, 2007, 17 (02) :231-239
[40]  
Tatusova TA, 1999, FEMS MICROBIOL LETT, V174, P247, DOI 10.1016/S0378-1097(99)00149-4