Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints

被引:90
|
作者
D Dowell, Robin
Eddy, Sean R.
机构
[1] Washington Univ, Sch Med, Howard Hughes Med Inst, St Louis, MO 63108 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[3] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
D O I
10.1186/1471-2105-7-400
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: We are interested in the problem of predicting secondary structure for small sets of homologous RNAs, by incorporating limited comparative sequence information into an RNA folding model. The Sankoff algorithm for simultaneous RNA folding and alignment is a basis for approaches to this problem. There are two open problems in applying a Sankoff algorithm: development of a good unified scoring system for alignment and folding and development of practical heuristics for dealing with the computational complexity of the algorithm. Results: We use probabilistic models ( pair stochastic context- free grammars, pairSCFGs) as a unifying framework for scoring pairwise alignment and folding. A constrained version of the pairSCFG structural alignment algorithm was developed which assumes knowledge of a few confidently aligned positions ( pins). These pins are selected based on the posterior probabilities of a probabilistic pairwise sequence alignment. Conclusion: Pairwise RNA structural alignment improves on structure prediction accuracy relative to single sequence folding. Constraining on alignment is a straightforward method of reducing the runtime and memory requirements of the algorithm. Five practical implementations of the pairwise Sankoff algorithm - this work ( Consan), David Mathews' Dynalign, Ian Holmes' Stemloc, Ivo Hofacker's PMcomp, and Jan Gorodkin's FOLDALIGN - have comparable overall performance with different strengths and weaknesses.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Assessing secondary structure assignment of protein structures by using pairwise sequence-alignment benchmarks
    Zhang, Wei
    Dunker, A. Keith
    Zhou, Yaoqi
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) : 61 - 67
  • [42] A Hybrid Flow for Multiple Sequence Alignment with a BLASTn Based Pairwise Alignment Processor
    Lin, Mao-Jan
    Chang, Chih-Yu
    Li, Yu-Cheng
    Chen, Nae-Chyun
    Lu, Yi-Chang
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [43] Protein structure prediction using a combination of sequence-based alignment, constrained energy minimization, and structural alignment
    Standley, DM
    Eyrich, VA
    An, YL
    Pincus, DL
    Gunn, JR
    Friesner, RA
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, : 133 - 139
  • [44] Pairwise Sequence Alignment of Biological Database using Soft Computing Approach
    Kaur, Harleen
    Chand, Lal
    2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 72 - 77
  • [45] A cascaded pairwise biomolecular sequence alignment technique using evolutionary algorithm
    Garai, Gautam
    Chowdhury, Biswanath
    INFORMATION SCIENCES, 2015, 297 : 118 - 139
  • [46] The Detection and Assessment of Possible RNA Secondary Structure Using Multiple Sequence Alignment
    Fang, Xiaoyong
    Wang, Zhenghua
    Luo, Zhigang
    Yuan, Bo
    Ding, Fan
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 133 - +
  • [47] THE BACTERIAL PORIN SUPERFAMILY - SEQUENCE ALIGNMENT AND STRUCTURE PREDICTION
    JEANTEUR, D
    LAKEY, JH
    PATTUS, F
    MOLECULAR MICROBIOLOGY, 1991, 5 (09) : 2153 - 2164
  • [48] An Efficient Progressive Alignment Algorithm for Multiple Sequence Alignment
    Lakshmi, P. V.
    Rao, Allam Appa
    Sridhar, G. R.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (10): : 301 - 305
  • [49] Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%
    Havgaard, JH
    Lyngso, RB
    Stormo, GD
    Gorodkin, J
    BIOINFORMATICS, 2005, 21 (09) : 1815 - 1824
  • [50] Super Pairwise Alignment (SPA): An efficient approach to global alignment for homologous sequences
    Shen, SY
    Yang, J
    Yao, A
    Hwang, PI
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (03) : 477 - 486