A pipeline of programs for collecting and analyzing group II intron retroelement sequences from GenBank

被引:9
作者
Abebe, Michael [1 ]
Candales, Manuel A. [1 ]
Duong, Adrian [1 ]
Hood, Keyar S. [1 ]
Li, Tony [1 ]
Neufeld, Ryan A. E. [1 ]
Shakenov, Abat [1 ]
Sun, Runda [1 ]
Wu, Li [1 ]
Jarding, Ashley M. [1 ]
Semper, Cameron [1 ]
Zimmerly, Steven [1 ]
机构
[1] Univ Calgary, Dept Biol Sci, Calgary, AB T2N 1N4, Canada
基金
加拿大健康研究院;
关键词
Bacteria; Genomes; Retroelement; Reverse transcriptase; Ribozyme; SELF-SPLICING INTRONS; REVERSE TRANSCRIPTASES; DIVERSITY; BACTERIA; DOMAIN;
D O I
10.1186/1759-8753-4-28
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results: Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of >= 95% identity, with one example sequence chosen to be the representative. Conclusions: These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate.
引用
收藏
页数:9
相关论文
共 29 条
[21]   Multiple self-splicing introns in the 16S rRNA genes of giant sulfur bacteria [J].
Salman, Verena ;
Amann, Rudolf ;
Shub, David A. ;
Schulz-Vogt, Heide N. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (11) :4203-4208
[22]   Characterization of the C-terminal DNA-binding/DNA endonuclease region of a group II intron-encoded protein [J].
San Filippo, J ;
Lambowitz, AM .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 324 (05) :933-951
[23]   Group II introns in eubacteria and archaea: ORF-less introns and new varieties [J].
Simon, Dawn M. ;
Clarke, Nicholas A. C. ;
McNeil, Bonnie A. ;
Johnson, Ian ;
Pantuso, Davin ;
Dai, Lixin ;
Chai, Dinggeng ;
Zimmerly, Steven .
RNA, 2008, 14 (09) :1704-1713
[24]   A diversity of uncharacterized reverse transcriptases in bacteria [J].
Simon, Dawn M. ;
Zimmerly, Steven .
NUCLEIC ACIDS RESEARCH, 2008, 36 (22) :7219-7229
[25]   Coevolution of group II intron RNA structures with their intron-encoded reverse transcriptases [J].
Toor, N ;
Hausner, G ;
Zimmerly, S .
RNA, 2001, 7 (08) :1142-1152
[26]   Bacterial group II introns:: not just splicing [J].
Toro, Nicolas ;
Jimenez-Zurdo, Jose Ignacio ;
Garcia-Rodriguez, Fernando Manuel .
FEMS MICROBIOLOGY REVIEWS, 2007, 31 (03) :342-358
[27]   Comprehensive Phylogenetic Analysis of Bacterial Group II Intron-Encoded ORFs Lacking the DNA Endonuclease Domain Reveals New Varieties [J].
Toro, Nicolas ;
Martinez-Abarca, Francisco .
PLOS ONE, 2013, 8 (01)
[28]   Diversity, mobility, and structural and functional evolution of group II introns carrying an unusual 3' extension [J].
Tourasse N.J. ;
Stabell F.B. ;
Kolstø A.-B. .
BMC Research Notes, 4 (1)
[29]   Phylogenetic relationships among group II intron ORFs [J].
Zimmerly, S ;
Hausner, G ;
Wu, XC .
NUCLEIC ACIDS RESEARCH, 2001, 29 (05) :1238-1250