Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA

被引:0
作者
Xu, Weijia [1 ]
Ozer, Stuart [2 ]
Gutell, Robin R. [3 ]
机构
[1] Univ Texas Austin, Texas Adv Comp Ctr, Austin, TX 78712 USA
[2] One Microsoft Way Redmond, Seattle, WA USA
[3] Univ Texas Austin, Ctr Computat Biol & Bioinformat, Austin, TX USA
来源
SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS | 2009年 / 5566卷
关键词
Biological database; Bioinformatics; Sequence Analysis; RNA; HIGHER-ORDER STRUCTURE; RIBOSOMAL-RNA; SECONDARY STRUCTURES; NONCODING RNAS; SEQUENCE; IDENTIFICATION; CONSTRAINTS; 16S-LIKE; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nueleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.
引用
收藏
页码:200 / +
页数:4
相关论文
共 37 条
[1]   Correlations among amino acid sites in bHLH protein domains: An information theoretic analysis [J].
Atchley, WR ;
Wollenberg, KR ;
Fitch, WM ;
Terhalle, W ;
Dress, AW .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :164-178
[2]  
Benson DA, 2017, NUCLEIC ACIDS RES, V45, pD37, DOI [10.1093/nar/gkl986, 10.1093/nar/gkw1070, 10.1093/nar/gkg057, 10.1093/nar/gks1195, 10.1093/nar/gkp1024, 10.1093/nar/gkq1079, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkn723]
[3]   The Comparative RNA Web (CRW) Site:: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs:: Correction (vol 3, pg 2, 2002) -: art. no. 15 [J].
Cannone, JJ ;
Subramanian, S ;
Schnare, MN ;
Collett, JR ;
D'Souza, LM ;
Du, YS ;
Feng, B ;
Lin, N ;
Madabusi, LV ;
Müller, KM ;
Pande, N ;
Shang, ZD ;
Yu, N ;
Gutell, RR .
BMC BIOINFORMATICS, 2002, 3 (1)
[4]   MSARI: Multiple sequence alignments for statistical detection of RNA secondary structure [J].
Coventry, A ;
Kleitman, DJ ;
Berger, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) :12102-12107
[5]   ddbRNA: detection of conserved secondary structures in multiple alignments [J].
di Bernardo, D ;
Down, T ;
Hubbard, T .
BIOINFORMATICS, 2003, 19 (13) :1606-1611
[6]   A model-based approach for detecting coevolving positions in a molecule [J].
Dutheil, J ;
Pupko, T ;
Jean-Marie, A ;
Galtier, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (09) :1919-1928
[7]  
ECKMAN B, 2004, IBM HLTH CARE LIFE S
[8]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[9]   IDENTIFICATION OF BASE-TRIPLES IN RNA USING COMPARATIVE SEQUENCE-ANALYSIS [J].
GAUTHERET, D ;
DAMBERGER, SH ;
GUTELL, RR .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 248 (01) :27-43
[10]   The accuracy of ribosomal RNA comparative structure models [J].
Gutell, RR ;
Lee, JC ;
Cannone, JJ .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (03) :301-310