Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems

被引:7
作者
Shang, Lei [1 ,2 ]
Gardner, David P. [1 ,2 ]
Xu, Weijia [3 ]
Cannone, Jamie J. [1 ,2 ]
Miranker, Daniel P. [4 ]
Ozer, Stuart [5 ]
Gutell, Robin R. [1 ,2 ]
机构
[1] Univ Texas Austin, Inst Cellular & Mol Biol, Austin, TX 78712 USA
[2] Univ Texas Austin, Ctr Computat Biol & Bioinformat, Austin, TX 78712 USA
[3] Univ Texas Austin, Texas Adv Comp Ctr, Austin, TX 78712 USA
[4] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[5] Microsoft Corp, Redmond, WA 98052 USA
来源
BMC SYSTEMS BIOLOGY | 2013年 / 7卷
基金
美国国家卫生研究院;
关键词
SECONDARY STRUCTURE PREDICTION; RIBOSOMAL-RNA; SEARCH; 16S; PARAMETERS; CLUSTAL;
D O I
10.1186/1752-0509-7-S4-S13
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The analysis of RNA sequences, once a small niche field for a small collection of scientists whose primary emphasis was the structure and function of a few RNA molecules, has grown most significantly with the realizations that 1) RNA is implicated in many more functions within the cell, and 2) the analysis of ribosomal RNA sequences is revealing more about the microbial ecology within all biological and environmental systems. The accurate and rapid alignment of these RNA sequences is essential to decipher the maximum amount of information from this data. Methods: Two computer systems that utilize the Gutell lab's RNA Comparative Analysis Database (rCAD) were developed to align sequences to an existing template alignment available at the Gutell lab's Comparative RNA Web (CRW) Site. Multiple dimensions of cross-indexed information are contained within the relational database rCAD, including sequence alignments, the NCBI phylogenetic tree, and comparative secondary structure information for each aligned sequence. The first program, CRWAlign-1 creates a phylogenetic-based sequence profile for each column in the alignment. The second program, CRWAlign-2 creates a profile based on phylogenetic, secondary structure, and sequence information. Both programs utilize their profiles to align new sequences into the template alignment. Results: The accuracies of the two CRWAlign programs were compared with the best template-based rRNA alignment programs and the best de-novo alignment programs. We have compared our programs with a total of eight alternative alignment methods on different sets of 16S rRNA alignments with sequence percent identities ranging from 50% to 100%. Both CRWAlign programs were superior to these other programs in accuracy and speed. Conclusions: Both CRWAlign programs can be used to align the very extensive amount of RNA sequencing that is generated due to the rapid next-generation sequencing technology. This latter technology is augmenting the new paradigm that RNA is intimately implicated in a significant number of functions within the cell. In addition, the use of bacterial 16S rRNA sequencing in the identification of the microbiome in many different environmental systems creates a need for rapid and highly accurate alignment of bacterial 16S rRNA sequences.
引用
收藏
页数:15
相关论文
共 28 条
  • [1] Protein alignment based on higher order conditional random fields for template-based modeling
    Morales-Cordovilia, Juan A.
    Sanchez, Victoria
    Ratajczak, Martin
    PLOS ONE, 2018, 13 (06):
  • [2] LoopIng: a template-based tool for predicting the structure of protein loops
    Messih, Mario Abdel
    Lepore, Rosalba
    Tramontano, Anna
    BIOINFORMATICS, 2015, 31 (23) : 3767 - 3772
  • [3] R-PASS: A Fast Structure-based RNA Sequence Alignment Algorithm
    Jiang, Yanan
    Xu, Weijia
    Thompson, Lee Parnell
    Gutell, Robin R.
    Miranker, Daniel P.
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 618 - 622
  • [4] Phylogenetic study of the genus Plasmodium based on the secondary structure-based alignment of the small subunit ribosomal RNA
    Escalante, AA
    Goldman, IF
    De Rijk, P
    De Wachter, R
    Collins, WE
    Qari, SH
    Lal, AA
    MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 1997, 90 (01) : 317 - 321
  • [5] SINA: Accurate high-throughput multiple sequence alignment of ribosomal RNA genes
    Pruesse, Elmar
    Peplies, Joerg
    Gloeckner, Frank Oliver
    BIOINFORMATICS, 2012, 28 (14) : 1823 - 1829
  • [6] rMSA: A Sequence Search and Alignment Algorithm to Improve RNA Structure Modeling
    Zhang, Chengxin
    Zhang, Yang
    Pyle, Anna Marie
    JOURNAL OF MOLECULAR BIOLOGY, 2023, 435 (14)
  • [7] Multi-objective two-level swarm intelligence approach for multiple RNA sequence-structure alignment
    Lalwani, Soniya
    Kumar, Rajesh
    Deep, Kusum
    SWARM AND EVOLUTIONARY COMPUTATION, 2017, 34 : 130 - 144
  • [8] RNA Sequence-structural Alignment Based on Quantum Evolutionary Algorithm
    Zhao Ying-Jie
    Wang Zheng-Zhi
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2009, 36 (09) : 1222 - 1228
  • [9] Two new mitogenomes of Picidae (Ayes, Piciformes): Sequence, structure and phylogenetic analyses
    Bi, De
    Ding, Hengwu
    Wang, Qingqing
    Jiang, Lan
    Lu, Wenkong
    Wu, Xuan
    Zhu, Ran
    Zeng, Juhua
    Zhou, Shoubiao
    Yang, Xiaojun
    Kan, Xianzhao
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2019, 133 : 683 - 692
  • [10] RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
    Bayegan, Amir H.
    Clote, Peter
    PLOS ONE, 2020, 15 (01):