Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints

被引:90
|
作者
D Dowell, Robin
Eddy, Sean R.
机构
[1] Washington Univ, Sch Med, Howard Hughes Med Inst, St Louis, MO 63108 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[3] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
D O I
10.1186/1471-2105-7-400
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing with the computational complexity of the algorithm. Results: We use probabilistic models ( pair stochastic context- free grammars, pairSCFGs) as a unifying framework for scoring pairwise alignment and folding. A constrained version of the pairSCFG structural alignment algorithm was developed which assumes knowledge of a few confidently aligned positions ( pins). These pins are selected based on the posterior probabilities of a probabilistic pairwise sequence alignment. Conclusion: Pairwise RNA structural alignment improves on structure prediction accuracy relative to single sequence folding. Constraining on alignment is a straightforward method of reducing the runtime and memory requirements of the algorithm. Five practical implementations of the pairwise Sankoff algorithm - this work ( Consan), David Mathews' Dynalign, Ian Holmes' Stemloc, Ivo Hofacker's PMcomp, and Jan Gorodkin's FOLDALIGN - have comparable overall performance with different strengths and weaknesses.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints
    Robin D Dowell
    Sean R Eddy
    BMC Bioinformatics, 7
  • [2] Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign
    Harmanci, Arif Ozgun
    Sharma, Gaurav
    Mathews, David H.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [3] Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign
    Arif Ozgun Harmanci
    Gaurav Sharma
    David H Mathews
    BMC Bioinformatics, 8
  • [4] Efficient Alignment of RNAs with Pseudoknots Using Sequence Alignment Constraints
    Yoon, Byung-Jun
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2009, (01)
  • [5] RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment
    Bayegan, Amir H.
    Clote, Peter
    PLOS ONE, 2020, 15 (01):
  • [6] Multi-objective pairwise RNA sequence alignment
    Taneda, Akito
    BIOINFORMATICS, 2010, 26 (19) : 2383 - 2390
  • [7] Time and Space Efficient Optimal Pairwise Sequence Alignment using GPU
    Sarkar, Ardhendu
    Ray, Kinjal
    Chowdhury, Debaroti
    Sahu, Kishan
    Kundu, Solanki
    Ghosh, Surajeet
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 423 - 428
  • [8] An Efficient Processing Element Architecture for Pairwise Sequence Alignment
    Isa, M. N.
    Murad, S. A. Z.
    Ismail, R. C.
    Ahmad, M. I.
    Jambek, A. B.
    Kamil, M. K. Md
    2014 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC DESIGN (ICED), 2014, : 461 - 464
  • [9] ALIGN_MTX-An optimal pairwise textual sequence alignment program, adapted for using in sequence-structure alignment
    Vishnepolsky, Boris
    Pirtskhalava, Malak
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2009, 33 (03) : 235 - 238
  • [10] Extended Pairwise Sequence Alignment
    Araujo, Eloi
    Martinez, Fabio V.
    Rozante, Luiz C.
    Almeida, Nalvo F.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2023, PT I, 2023, 13956 : 218 - 230