Psiscan: a computational approach to identify H/ACA-like and AGA-like non-coding RNA in trypanosomatid genomes

被引:16
作者
Myslyuk, Inna [1 ]
Doniger, Tirza [1 ]
Horesh, Yair [2 ]
Hury, Avraham [1 ]
Hoffer, Ran [1 ]
Ziporen, Yaara [1 ]
Michaeli, Shulamit [1 ]
Unger, Ron [1 ]
机构
[1] Bar Ilan Univ, Fac Life Sci, IL-52900 Ramat Gan, Israel
[2] Weizmann Inst Sci, Dept Phys Complex Syst, IL-76100 Rehovot, Israel
基金
以色列科学基金会;
关键词
D O I
10.1186/1471-2105-9-471
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Detection of non coding RNA (ncRNA) molecules is a major bioinformatics challenge. This challenge is particularly difficult when attempting to detect H/ACA molecules which are involved in converting uridine to pseudouridine on rRNA in trypanosomes, because these organisms have unique H/ACA molecules (termed H/ACA-like) that lack several of the features that characterize H/ACA molecules in most other organisms. Results: We present here a computational tool called Psiscan, which was designed to detect H/ACA-like molecules in trypanosomes. We started by analyzing known H/ACA-like molecules and characterized their crucial elements both computationally and experimentally. Next, we set up constraints based on this analysis and additional phylogenic and functional data to rapidly scan three trypanosome genomes (T. brucei, T. cruzi and L. major) for sequences that observe these constraints and are conserved among the species. In the next step, we used minimal energy calculation to select the molecules that are predicted to fold into a lowest energy structure that is consistent with the constraints. In the final computational step, we used a Support Vector Machine that was trained on known H/ACA-like molecules as positive examples and on negative examples of molecules that were identified by the computational analyses but were shown experimentally not to be H/ACA-like molecules. The leading candidate molecules predicted by the SVM model were then subjected to experimental validation. Conclusion: The experimental validation showed 11 molecules to be expressed (4 out of 25 in the intermediate stage and 7 out of 19 in the final validation after the machine learning stage). Five of these 11 molecules were further shown to be bona fide H/ACA-like molecules. As snoRNA in trypanosomes are organized in clusters, the new H/ACA-like molecules could be used as starting points to manually search for additional molecules in their neighbourhood. All together this study increased our repertoire by fourteen H/ACA-like and six C/D snoRNAs molecules from T. brucei and L. Major. In addition the experimental analysis revealed that six ncRNA molecules that are expressed are not downregulated in CBF5 silenced cells, suggesting that they have structural features of H/ACA-like molecules but do not have their standard function. We termed this novel class of molecules AGA-like, and we are exploring their function. This study demonstrates the power of tight collaboration between computational and experimental approaches in a combined effort to reveal the repertoire of ncRNA molecles.
引用
收藏
页数:20
相关论文
共 73 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Identification of 66 box C/D snoRNAs in Arabidopsis thaliana:: Extensive gene duplications generated multiple isoforms predicting new ribosomal RNA 2′-O-methylation sites [J].
Barneche, F ;
Gaspin, C ;
Guyot, R ;
Echeverría, M .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 311 (01) :57-73
[3]   Elucidating the role of H/ACA-like RNAs in trans-splicing and rRNA processing via RNA interference silencing of the Trypanosoma brucei CBF5 pseudouridine synthase [J].
Barth, S ;
Hury, A ;
Liang, XH ;
Michaeli, S .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (41) :34558-34568
[4]   Elucidating the role of C/D snoRNA in rRNA processing and modification in Trypanosoma brucei [J].
Barth, Sarit ;
Shalem, Boaz ;
Hury, Avraham ;
Tkacz, Itai Dov ;
Liang, Xue-Hai ;
Uliel, Shai ;
Myslyuk, Inna ;
Doniger, Tirza ;
Salmon-Divon, Mali ;
Unger, Ron ;
Michaeli, Shulamit .
EUKARYOTIC CELL, 2008, 7 (01) :86-101
[5]   The genome of the African trypanosome Trypanosoma brucei [J].
Berriman, M ;
Ghedin, E ;
Hertz-Fowler, C ;
Blandin, G ;
Renauld, H ;
Bartholomeu, DC ;
Lennard, NJ ;
Caler, E ;
Hamlin, NE ;
Haas, B ;
Böhme, W ;
Hannick, L ;
Aslett, MA ;
Shallom, J ;
Marcello, L ;
Hou, LH ;
Wickstead, B ;
Alsmark, UCM ;
Arrowsmith, C ;
Atkin, RJ ;
Barron, AJ ;
Bringaud, F ;
Brooks, K ;
Carrington, M ;
Cherevach, I ;
Chillingworth, TJ ;
Churcher, C ;
Clark, LN ;
Corton, CH ;
Cronin, A ;
Davies, RM ;
Doggett, J ;
Djikeng, A ;
Feldblyum, T ;
Field, MC ;
Fraser, A ;
Goodhead, I ;
Hance, Z ;
Harper, D ;
Harris, BR ;
Hauser, H ;
Hostetter, J ;
Ivens, A ;
Jagels, K ;
Johnson, D ;
Johnson, J ;
Jones, K ;
Kerhornou, AX ;
Koo, H ;
Larke, N .
SCIENCE, 2005, 309 (5733) :416-422
[6]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[7]   Elements essential for accumulation and function of small nucleolar RNAs directing site-specific pseudouridylation of ribosomal RNAs [J].
Bortolin, ML ;
Ganot, P ;
Kiss, T .
EMBO JOURNAL, 1999, 18 (02) :457-469
[8]   A small nucleolar RNP protein is required for pseudouridylation of eukaryotic ribosomal RNAs [J].
BousquetAntonelli, C ;
Henry, Y ;
Gelugne, JP ;
CaizerguesFerrer, M ;
Kiss, T .
EMBO JOURNAL, 1997, 16 (15) :4770-4776
[9]   Plant snoRNAs: functional evolution and new modes of gene expression [J].
Brown, JWS ;
Echeverria, M ;
Qu, LH .
TRENDS IN PLANT SCIENCE, 2003, 8 (01) :42-49
[10]  
Brown JWS, 2001, RNA, V7, P1817