A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures

被引:30
作者
Jabbari, Hosna [1 ]
Condon, Anne [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V5Z 1M9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
RNA; Secondary structure prediction; Pseudoknot; Hierarchical folding; Minimum free energy; DYNAMIC-PROGRAMMING ALGORITHM; PARTITION-FUNCTION; TRANSLATION; SERVER;
D O I
10.1186/1471-2105-15-147
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating known structural information can improve prediction accuracy; however, there are not many methods for prediction of pseudoknotted structures that can incorporate structural information as input. There is even less understanding of the relative robustness of these methods with respect to partial information. Results: We present a new method, Iterative HFold, for pseudoknotted RNA secondary structure prediction. Iterative HFold takes as input a pseudoknot-free structure, and produces a possibly pseudoknotted structure whose energy is at least as low as that of any (density-2) pseudoknotted structure containing the input structure. Iterative HFold leverages strengths of earlier methods, namely the fast running time of HFold, a method that is based on the hierarchical folding hypothesis, and the energy parameters of HotKnots V2.0. Our experimental evaluation on a large data set shows that Iterative HFold is robust with respect to partial information, with average accuracy on pseudoknotted structures steadily increasing from roughly 54% to 79% as the user provides up to 40% of the input structure. Iterative HFold is much faster than HotKnots V2.0, while having comparable accuracy. Iterative HFold also has significantly better accuracy than IPknot on our HK-PK and IP-pk168 data sets. Conclusions: Iterative HFold is a robust method for prediction of pseudoknotted RNA secondary structures, whose accuracy with more than 5% information about true pseudoknot-free structures is better than that of IPknot, and with about 35% information about true pseudoknot-free structures compares well with that of HotKnots V2.0 while being significantly faster. Iterative HFold and all data used in this work are freely available at http://www.cs.ubc.ca/similar to hjabbari/software.php.
引用
收藏
页数:17
相关论文
共 66 条
[41]   Non-coding RNA [J].
Mattick, JS ;
Makunin, IV .
HUMAN MOLECULAR GENETICS, 2006, 15 :R17-R29
[42]   SimulFold:: Simultaneously inferring RNA structures including pseudoknots, alignments, and trees using a Bayesian MCMC framework [J].
Meyer, Irmtraud M. ;
Miklos, Istvan .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (08) :1441-1454
[43]   Valgrind: A framework for heavyweight dynamic binary instrumentation [J].
Nethercote, Nicholas ;
Seward, Julian .
ACM SIGPLAN NOTICES, 2007, 42 (06) :89-100
[44]   Identification and classification of conserved RNA secondary structures in the human genome [J].
Pedersen, Jakob Skou ;
Bejerano, Gill ;
Siepel, Adam ;
Rosenbloom, Kate ;
Lindblad-Toh, Kerstin ;
Lander, Eric S. ;
Kent, Jim ;
Miller, Webb ;
Haussler, David .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (04) :251-262
[45]   CoFold: an RNA secondary structure prediction method that takes co-transcriptional folding into account [J].
Proctor, Jeff R. ;
Meyer, Irmtraud M. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (09) :e102
[46]   CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction [J].
Puton, Tomasz ;
Kozlowski, Lukasz P. ;
Rother, Kristian M. ;
Bujnicki, Janusz M. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (07) :4307-4323
[47]   Parsing nucleic acid pseudoknotted secondary structure: Algorithm and applications [J].
Rastegari, Baharak ;
Condon, Anne .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2007, 14 (01) :16-32
[48]   Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics [J].
Reeder, J ;
Giegerich, R .
BMC BIOINFORMATICS, 2004, 5 (1)
[49]   HotKnots: Heuristic prediction of RNA secondary structures including pseudoknots [J].
Ren, JH ;
Rastegari, B ;
Condon, A ;
Hoos, HH .
RNA, 2005, 11 (10) :1494-1504
[50]   A dynamic programming algorithm for RNA structure prediction including pseudoknots [J].
Rivas, E ;
Eddy, SR .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 285 (05) :2053-2068