Examples of sequence conservation analyses capture a subset of mouse long non-coding RNAs sharing homology with fish conserved genomic elements

被引:17
作者
Basu, Swaraj [1 ]
Mueller, Ferenc [2 ]
Sanges, Remo [1 ]
机构
[1] Stn Zool A Dohrn, Lab Anim Physiol & Evolut, I-80121 Naples, Italy
[2] Univ Birmingham, Coll Med & Dent Sci, Ctr Rare Dis & Personalized Med, Sch Clin & Expt Med, Birmingham, W Midlands, England
关键词
EMBRYONIC STEM-CELLS; LARGE GENE LISTS; COMPUTATIONAL IDENTIFICATION; ULTRACONSERVED ELEMENTS; LMO3; INTERACTS; TRANSCRIPTION; ANNOTATION; ENHANCER; REVEALS; PLURIPOTENCY;
D O I
10.1186/1471-2105-14-S7-S14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Long non-coding RNAs (lncRNA) are a major class of non-coding RNAs. They are involved in diverse intra-cellular mechanisms like molecular scaffolding, splicing and DNA methylation. Through these mechanisms they are reported to play a role in cellular differentiation and development. They show an enriched expression in the brain where they are implicated in maintaining cellular identity, homeostasis, stress responses and plasticity. Low sequence conservation and lack of functional annotations make it difficult to identify homologs of mammalian lncRNAs in other vertebrates. A computational evaluation of the lncRNAs through systematic conservation analyses of both sequences as well as their genomic architecture is required. Results: Our results show that a subset of mouse candidate lncRNAs could be distinguished from random sequences based on their alignment with zebrafish phastCons elements. Using ROC analyses we were able to define a measure to select significantly conserved lncRNAs. Indeed, starting from similar to 2,800 mouse lncRNAs we could predict that between 4 and 11% present conserved sequence fragments in fish genomes. Gene ontology (GO) enrichment analyses of protein coding genes, proximal to the region of conservation, in both organisms highlighted similar GO classes like regulation of transcription and central nervous system development. The proximal coding genes in both the species show enrichment of their expression in brain. In summary, we show that interesting genomic regions in zebrafish could be marked based on their sequence homology to a mouse lncRNA, overlap with ESTs and proximity to genes involved in nervous system development. Conclusions: Conservation at the sequence level can identify a subset of putative lncRNA orthologs. The similar protein-coding neighborhood and transcriptional information about the conserved candidates provide support to the hypothesis that they share functional homology. The pipeline herein presented represents a proof of principle showing that a portion between 4 and 11% of lncRNAs retains region of conservation between mammals and fishes. We believe this study will result useful as a reference to analyze the conservation of lncRNAs in newly sequenced genomes and transcriptomes.
引用
收藏
页数:17
相关论文
共 69 条
[1]   lncRNAdb: a reference database for long noncoding RNAs [J].
Amaral, Paulo P. ;
Clark, Michael B. ;
Gascoigne, Dennis K. ;
Dinger, Marcel E. ;
Mattick, John S. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D146-D151
[2]   Complex architecture and regulated expression of the Sox2ot locus during vertebrate development [J].
Amaral, Paulo P. ;
Neyt, Christine ;
Wilkins, Simon J. ;
Askarian-Amiri, Marjan E. ;
Sunkin, Susan M. ;
Perkins, Andrew C. ;
Mattick, John S. .
RNA, 2009, 15 (11) :2013-2027
[3]  
[Anonymous], GENOME RES
[4]  
[Anonymous], 2004, Mach. Learn.
[5]   LMO3 interacts with neuronal transcription factor, HEN2, and acts as an oncogene in neuroblastoma [J].
Aoyama, M ;
Ozaki, T ;
Inuzuka, H ;
Tomotsune, D ;
Hirato, J ;
Okamoto, Y ;
Tokita, H ;
Ohira, M ;
Nakagawara, A .
CANCER RESEARCH, 2005, 65 (11) :4587-4597
[6]   Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus Paracoccidioides brasiliensis Software [J].
Arrial, Roberto T. ;
Togawa, Roberto C. ;
Brigido, Marcelo de M. .
BMC BIOINFORMATICS, 2009, 10
[7]   Long noncoding RNAs are rarely translated in two human cell lines [J].
Banfai, Balazs ;
Jia, Hui ;
Khatun, Jainab ;
Wood, Emily ;
Risk, Brian ;
Gundling, William E., Jr. ;
Kundaje, Anshul ;
Gunawardena, Harsha P. ;
Yu, Yanbao ;
Xie, Ling ;
Krajewski, Krzysztof ;
Strahl, Brian D. ;
Chen, Xian ;
Bickel, Peter ;
Giddings, Morgan C. ;
Brown, James B. ;
Lipovich, Leonard .
GENOME RESEARCH, 2012, 22 (09) :1646-1657
[8]   Ultraconserved elements in the human genome [J].
Bejerano, G ;
Pheasant, M ;
Makunin, I ;
Stephen, S ;
Kent, WJ ;
Mattick, JS ;
Haussler, D .
SCIENCE, 2004, 304 (5675) :1321-1325
[9]   THE HUMAN XIST GENE - ANALYSIS OF A 17 KB INACTIVE X-SPECIFIC RNA THAT CONTAINS CONSERVED REPEATS AND IS HIGHLY LOCALIZED WITHIN THE NUCLEUS [J].
BROWN, CJ ;
HENDRICH, BD ;
RUPERT, JL ;
LAFRENIERE, RG ;
XING, Y ;
LAWRENCE, J ;
WILLARD, HF .
CELL, 1992, 71 (03) :527-542
[10]   Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses [J].
Cabili, Moran N. ;
Trapnell, Cole ;
Goff, Loyal ;
Koziol, Magdalena ;
Tazon-Vega, Barbara ;
Regev, Aviv ;
Rinn, John L. .
GENES & DEVELOPMENT, 2011, 25 (18) :1915-1927