Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses

被引:2769
作者
Cabili, Moran N. [1 ,2 ,3 ,6 ]
Trapnell, Cole [1 ,3 ,6 ]
Goff, Loyal [1 ,4 ,6 ]
Koziol, Magdalena [1 ,3 ,6 ]
Tazon-Vega, Barbara [1 ,3 ,6 ]
Regev, Aviv [1 ,5 ,6 ]
Rinn, John L. [1 ,3 ,6 ]
机构
[1] MIT, Broad Inst, Cambridge, MA 02142 USA
[2] Harvard Univ, Sch Med, Dept Syst Biol, Boston, MA 02115 USA
[3] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[4] MIT, Comp Sci & Artificial Intelligence Lab, Dept Elect Engn & Comp Sci, Cambridge, MA 02140 USA
[5] MIT, Howard Hughes Med Inst, Dept Biol, Cambridge, MA 02140 USA
[6] Harvard Univ, Cambridge, MA 02142 USA
关键词
long noncoding RNAs; RNA sequencing; lincRNAs; HUMAN GENOME; CHROMATIN; GENE; TRANSCRIPTION; MOUSE; IDENTIFICATION; QUANTIFICATION; EXPRESSION; DELETION; DYNAMICS;
D O I
10.1101/gad.17446611
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Large intergenic noncoding RNAs (lincRNAs) are emerging as key regulators of diverse cellular processes. Determining the function of individual lincRNAs remains a challenge. Recent advances in RNA sequencing (RNA-seq) and computational methods allow for an unprecedented analysis of such transcripts. Here, we present an integrative approach to define a reference catalog of >8000 human lincRNAs. Our catalog unifies previously existing annotation sources with transcripts we assembled from RNA-seq data collected from similar to 4 billion RNA-seq reads across 24 tissues and cell types. We characterize each lincRNA by a panorama of >30 properties, including sequence, structural, transcriptional, and orthology features. We found that lincRNA expression is strikingly tissue-specific compared with coding genes, and that lincRNAs are typically coexpressed with their neighboring genes, albeit to an extent similar to that of pairs of neighboring protein-coding genes. We distinguish an additional subset of transcripts that have high evolutionary conservation but may include short ORFs and may serve as either lincRNAs or small peptides. Our integrated, comprehensive, yet conservative reference catalog of human lincRNAs reveals the global properties of lincRNAs and will facilitate experimental studies and further functional classification of these genes.
引用
收藏
页码:1915 / 1927
页数:13
相关论文
共 61 条
[1]   lncRNAdb: a reference database for long noncoding RNAs [J].
Amaral, Paulo P. ;
Clark, Michael B. ;
Gascoigne, Dennis K. ;
Dinger, Marcel E. ;
Mattick, John S. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D146-D151
[2]   Global identification of human transcribed sequences with genome tiling arrays [J].
Bertone, P ;
Stolc, V ;
Royce, TE ;
Rozowsky, JS ;
Urban, AE ;
Zhu, XW ;
Rinn, JL ;
Tongprasit, W ;
Samanta, M ;
Weissman, S ;
Gerstein, M ;
Snyder, M .
SCIENCE, 2004, 306 (5705) :2242-2246
[3]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[4]   Fast Statistical Alignment [J].
Bradley, Robert K. ;
Roberts, Adam ;
Smoot, Michael ;
Juvekar, Sudeep ;
Do, Jaeyoung ;
Dewey, Colin ;
Holmes, Ian ;
Pachter, Lior .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (05)
[5]   THE HUMAN XIST GENE - ANALYSIS OF A 17 KB INACTIVE X-SPECIFIC RNA THAT CONTAINS CONSERVED REPEATS AND IS HIGHLY LOCALIZED WITHIN THE NUCLEUS [J].
BROWN, CJ ;
HENDRICH, BD ;
RUPERT, JL ;
LAFRENIERE, RG ;
XING, Y ;
LAWRENCE, J ;
WILLARD, HF .
CELL, 1992, 71 (03) :527-542
[6]   The transcriptional landscape of the mammalian genome [J].
Carninci, P ;
Kasukawa, T ;
Katayama, S ;
Gough, J ;
Frith, MC ;
Maeda, N ;
Oyama, R ;
Ravasi, T ;
Lenhard, B ;
Wells, C ;
Kodzius, R ;
Shimokawa, K ;
Bajic, VB ;
Brenner, SE ;
Batalov, S ;
Forrest, ARR ;
Zavolan, M ;
Davis, MJ ;
Wilming, LG ;
Aidinis, V ;
Allen, JE ;
Ambesi-Impiombato, X ;
Apweiler, R ;
Aturaliya, RN ;
Bailey, TL ;
Bansal, M ;
Baxter, L ;
Beisel, KW ;
Bersano, T ;
Bono, H ;
Chalk, AM ;
Chiu, KP ;
Choudhary, V ;
Christoffels, A ;
Clutterbuck, DR ;
Crowe, ML ;
Dalla, E ;
Dalrymple, BP ;
de Bono, B ;
Della Gatta, G ;
di Bernardo, D ;
Down, T ;
Engstrom, P ;
Fagiolini, M ;
Faulkner, G ;
Fletcher, CF ;
Fukushima, T ;
Furuno, M ;
Futaki, S ;
Gariboldi, M .
SCIENCE, 2005, 309 (5740) :1559-1563
[7]   Long noncoding RNA genes: conservation of sequence and brain expression among diverse amniotes [J].
Chodroff, Rebecca A. ;
Goodstadt, Leo ;
Sirey, Tamara M. ;
Oliver, Peter L. ;
Davies, Kay E. ;
Green, Eric D. ;
Molnar, Zoltan ;
Ponting, Chris P. .
GENOME BIOLOGY, 2010, 11 (07)
[8]   A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression [J].
Cohen, BA ;
Mitra, RD ;
Hughes, JD ;
Church, GM .
NATURE GENETICS, 2000, 26 (02) :183-186
[9]   Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters [J].
Core, Leighton J. ;
Waterfall, Joshua J. ;
Lis, John T. .
SCIENCE, 2008, 322 (5909) :1845-1848
[10]   A Large Fraction of Extragenic RNA Pol II Transcription Sites Overlap Enhancers [J].
De Santa, Francesca ;
Barozzi, Iros ;
Mietton, Flore ;
Ghisletti, Serena ;
Polletti, Sara ;
Tusi, Betsabeh Khoramian ;
Muller, Heiko ;
Ragoussis, Jiannis ;
Wei, Chia-Lin ;
Natoli, Gioacchino .
PLOS BIOLOGY, 2010, 8 (05)