Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints

被引:90
|
作者
D Dowell, Robin
Eddy, Sean R.
机构
[1] Washington Univ, Sch Med, Howard Hughes Med Inst, St Louis, MO 63108 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[3] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
D O I
10.1186/1471-2105-7-400
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing with the computational complexity of the algorithm. Results: We use probabilistic models ( pair stochastic context- free grammars, pairSCFGs) as a unifying framework for scoring pairwise alignment and folding. A constrained version of the pairSCFG structural alignment algorithm was developed which assumes knowledge of a few confidently aligned positions ( pins). These pins are selected based on the posterior probabilities of a probabilistic pairwise sequence alignment. Conclusion: Pairwise RNA structural alignment improves on structure prediction accuracy relative to single sequence folding. Constraining on alignment is a straightforward method of reducing the runtime and memory requirements of the algorithm. Five practical implementations of the pairwise Sankoff algorithm - this work ( Consan), David Mathews' Dynalign, Ian Holmes' Stemloc, Ivo Hofacker's PMcomp, and Jan Gorodkin's FOLDALIGN - have comparable overall performance with different strengths and weaknesses.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Secure Cloud Computing for Pairwise Sequence Alignment
    Salinas, Sergio
    Li, Pan
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 178 - 183
  • [32] Chemical reaction optimization for pairwise sequence alignment
    Zhu, Xiangyuan
    Li, Jian
    Journal of Computational and Theoretical Nanoscience, 2015, 12 (12) : 5351 - 5357
  • [33] A Parallel Pairwise Local Sequence Alignment Algorithm
    Bandyopadhyay, Sanghamitra
    Mitra, Ramkrishna
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2009, 8 (02) : 139 - 146
  • [34] Pairwise sequence alignment below the twilight zone
    Blake, JD
    Cohen, FE
    JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) : 721 - 735
  • [35] A Tiling Bound for Pairwise Global Sequence Alignment
    Horton, Paul
    Frith, Martin
    ADVANCES IN SOFTWARE ENGINEERING, 2009, 30 : 93 - 98
  • [36] Adaptive Learning of Rank-One Models for Efficient Pairwise Sequence Alignment
    Kamath, Govinda M.
    Baharav, Tavor Z.
    Shomorony, Ilan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [37] An enhanced RNA alignment benchmark for sequence alignment programs
    Wilm, Andreas
    Mainz, Indra
    Steger, Gerhard
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)
  • [38] Memory-efficient dynamic programming backtrace and pairwise local sequence alignment
    Newberg, Lee A.
    BIOINFORMATICS, 2008, 24 (16) : 1772 - 1778
  • [39] An enhanced RNA alignment benchmark for sequence alignment programs
    Andreas Wilm
    Indra Mainz
    Gerhard Steger
    Algorithms for Molecular Biology, 1
  • [40] SRP-RNA SEQUENCE ALIGNMENT AND SECONDARY STRUCTURE
    LARSEN, N
    ZWIEB, C
    NUCLEIC ACIDS RESEARCH, 1991, 19 (02) : 209 - 215