ITERATIVE ESTIMATION OF STRUCTURES OF MULTIPLE RNA HOMOLOGS: TURBOFOLD

被引:181
作者
Sharma, Gaurav [1 ,2 ,3 ]
Harmanci, A. Ozgun [1 ]
Mathews, David H. [2 ,4 ]
机构
[1] Univ Rochester, Dept Elect & Comp Engn, Hopeman 204,RC Box 270126, Rochester, NY 14627 USA
[2] Univ Rochester, Med Ctr, Dept Biostat & Comput Biol, Rochester, NY 14642 USA
[3] Univ Rochester, Med Ctr, Dept Oncol, Rochester, NY 14642 USA
[4] Univ Rochester, Med Ctr, Dept Biochem & Biophys, Rochester, NY 14642 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
RNA secondary structure; Turbo decoding; Iterative Estimation; STRUCTURE PREDICTION; SECONDARY STRUCTURE; PARTITION-FUNCTION; ALIGNMENT;
D O I
10.1109/smartgridcomm.2011.6102379
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
TurboFold, an iterative algorithm for estimating the common secondary structures of multiple RNA homologs, is presented. The algorithm is motivated by and has structure and attributes analogous to the turbo decoding algorithm in communications. Instead of solving the joint problem of aligning and folding multiple RNA sequences, TurboFold uses an iterative process to fold a collection of RNA homologs. Beneficial information from inter-sequence comparisons is incorporated by using feedback from iteration to iteration in the form of pseudo-prior probabilities for base pairing which are incorporated in the computation of base pairing probabilities. As a result TurboFold retains several of the advantages of join alignment and folding while maintaining a per iteration computational complexity comparable to single sequence RNA folding. Experimental evaluation of the algorithm, performed over six ncRNA families, demonstrates that TurboFold achieves high accuracy, offering better performance than available alternatives for estimating RNA base pairing probabilities.
引用
收藏
页码:529 / 532
页数:4
相关论文
共 18 条
[1]   RNAalifold: improved consensus structure prediction for RNA alignments [J].
Bernhart, Stephan H. ;
Hofacker, Ivo L. ;
Will, Sebastian ;
Gruber, Andreas R. ;
Stadler, Peter F. .
BMC BIOINFORMATICS, 2008, 9 (1)
[2]  
Berrou C., 1993, P IEEE INT C COMM MA, P1064, DOI DOI 10.1109/ICC.1993.397441
[3]  
Durbin R., 1999, Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids
[4]   Non-coding RNA genes and the modern RNA world [J].
Eddy, SR .
NATURE REVIEWS GENETICS, 2001, 2 (12) :919-929
[5]   COMPARATIVE-STUDIES OF RNA - INFERRING HIGHER-ORDER STRUCTURE FROM PATTERNS OF SEQUENCE VARIATION [J].
GUTELL, RR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1993, 3 (03) :313-322
[6]  
Harmanci A. O., 2010, TURBOFOLD ITER UNPUB
[7]  
Harmanci AO, 2007, INT CONF ACOUST SPEE, P365
[8]   Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign [J].
Harmanci, Arif Ozgun ;
Sharma, Gaurav ;
Mathews, David H. .
BMC BIOINFORMATICS, 2007, 8 (1)
[9]   Clustal W and clustal X version 2.0 [J].
Larkin, M. A. ;
Blackshields, G. ;
Brown, N. P. ;
Chenna, R. ;
McGettigan, P. A. ;
McWilliam, H. ;
Valentin, F. ;
Wallace, I. M. ;
Wilm, A. ;
Lopez, R. ;
Thompson, J. D. ;
Gibson, T. J. ;
Higgins, D. G. .
BIOINFORMATICS, 2007, 23 (21) :2947-2948
[10]   Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimization [J].
Mathews, DH .
RNA, 2004, 10 (08) :1178-1190