5.8S-28S rRNA interaction and HMM-based ITS2 annotation

被引:373
作者
Keller, Alexander [1 ]
Schleicher, Tina [1 ]
Schultz, Joerg [1 ]
Mueller, Tobias [1 ]
Dandekar, Thomas [1 ]
Wolf, Matthias [1 ]
机构
[1] Univ Wurzburg, Dept Bioinformat, Bioctr, D-97074 Wurzburg, Germany
关键词
Hidden Markov models; Internal transcribed spacer 2; Non-coding RNA; Phylogenetics; Ribosomal RNA; Secondary structure; INTERNAL TRANSCRIBED SPACER-2; SECONDARY STRUCTURE; ITS2-PROXIMAL STEM; SEQUENCE DATABASES; COMMON CORE; IDENTIFICATION; 16S; SCENEDESMUS; PHYLOGENY; DOMAIN;
D O I
10.1016/j.gene.2008.10.012
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The internal transcribed spacer 2 (ITS2) of the nuclear ribosomal repeat unit is one of the most commonly applied phylogenetic markers. It is a fast evolving locus, which makes it appropriate for studies at low taxonomic levels, whereas its secondary structure is well conserved. and tree reconstructions are possible at higher taxonomic levels. However, annotation of start and end positions of the ITS2 differs markedly between studies. This is a severe shortcoming, as prediction of a correct secondary structure by standard ab initio folding programs requires accurate identification of the marker in question. Furthermore, the correct structure is essential for multiple sequence alignments based on individual structural features. The present study describes a new tool for the delimitation and identification of the ITS2. It is based on hidden Markov models (HMMs) and verifies annotations by comparison to a conserved structural motif in the 5.8S/28S rRNA regions. Our method was able to identify and delimit the ITS2 in more than 30000 entries lacking start and end annotations in GenBank. Furthermore, 45 000 ITS2 sequences with a questionable annotation were reannotated. Approximately 30 000 entries from the ITS2-DB, that uses a homology-based method for structure prediction, were re-annotated. We show that the method is able to correctly annotate an ITS2 as small as 58 nt from Giardia lamblia and an ITS2 as large as 1160 nt from humans. Thus, our method should be a valuable guide during the first and crucial step in any ITS2-based phylogenetic analysis: the delineation of the correct sequence. Sequences can be submitted to the following website for HMM-based ITS2 delineation: http://its2.bioapps.biozentrum.uni-wuerzburg.de. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 54 条
[31]   Questionable 16S ribosomal RNA gene annotations are frequent in completed microbial genomes [J].
Lin, Yu-Hsiang ;
Chang, Bill C. H. ;
Chiang, Pei-Wen ;
Tang, Sen-Lin .
GENE, 2008, 416 (1-2) :44-47
[32]   Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure [J].
Mathews, DH ;
Disney, MD ;
Childs, JL ;
Schroeder, SJ ;
Zuker, M ;
Turner, DH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (19) :7287-7292
[33]   The 3' end of yeast 5.8S rRNA is generated by an exonuclease processing mechanism [J].
Mitchell, P ;
Petfalski, E ;
Tollervey, D .
GENES & DEVELOPMENT, 1996, 10 (04) :502-513
[34]   Distinguishing species [J].
Mueller, Tobias ;
Philippi, Nicole ;
Dandekar, Thomas ;
Schultz, Joerg ;
Wolf, Matthias .
RNA, 2007, 13 (09) :1469-1472
[35]   Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective [J].
Nilsson, R. Henrik ;
Ryberg, Martin ;
Kristiansson, Erik ;
Abarenkov, Kessy ;
Larsson, Karl-Henrik ;
Koljalg, Urmas .
PLOS ONE, 2006, 1 (01)
[36]  
Nilsson RH, 2008, EVOL BIOINFORM, V4, P193
[37]  
Park MH, 2007, MOL CELLS, V23, P220
[38]   IMPROVED TOOLS FOR BIOLOGICAL SEQUENCE COMPARISON [J].
PEARSON, WR ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (08) :2444-2448
[39]   The structure of the ITS2-proximal stem is required for pre-rRNA processing in yeast [J].
Peculis, BA ;
Greer, CL .
RNA, 1998, 4 (12) :1610-1622
[40]   A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota [J].
Schultz, J ;
Maisel, S ;
Gerlach, D ;
Müller, T ;
Wolf, M .
RNA, 2005, 11 (04) :361-364