An efficient algorithm to compute the landscape of locally optimal RNA secondary structures with respect to the Nussinov-Jacobson energy model

被引:26
作者
Clote, P [1 ]
机构
[1] Boston Coll, Dept Biol, Dept Comp Sci, Chestnut Hill, MA 02467 USA
关键词
RNA; secondary structure; energy landscape; local optimum;
D O I
10.1089/cmb.2005.12.83
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We make a novel contribution to the theory of biopolymer folding, by developing an efficient algorithm to compute the number of locally optimal secondary structures of an RNA molecule, with respect to the Nussinov-Jacobson energy model. Additionally, we apply our algorithm to analyze the folding landscape of selenocysteine insertion sequence (SECIS) elements from A. Bock (personal communication), hammerhead ribozymes from Rfam (Griffiths-Jones et al., 2003), and tRNAs from Sprinzl's database (Sprinzl et al., 1998). It had previously been reported that IRNA has lower minimum free energy than random RNA of the same compositional frequency (Clote et al., 2003; Rivas and Eddy, 2000), although the situation is less clear for mRNA (Seffens and Digby, 1999; Workman and Krogh, 1999; Cohen and Skienna, 2002),(1) which plays no structural role. Applications of our algorithm extend knowledge of the energy landscape differences between naturally occurring and random RNA. Given an RNA molecule a(1),...,a(n) and an integer k greater than or equal to 0, a k-locally optimal secondary structure S is a secondary structure on a(1),...,a(n) which,has k fewer base pairs than the maximum possible number, yet for which no basepairs can be added without violation of the definition of secondary structure (e.g., introducing a pseudoknot). Despite the fact that the number numStr(k) of k-locally optimal structures for a given RNA molecule in general is exponential in n, we present an algorithm running in time 0(n(4)) and space 0(n(3)), which computes numStr(k) for each k. Structurally important RNA, such as SECIS elements, hammerhead ribozymes, and tRNA, all have a markedly smaller number of k-locally optimal structures than that of random RNA of the same dinucleotide frequency, for small and moderate values of k. This suggests a potential future role of our algorithm as a tool to detect noncoding RNA genes.
引用
收藏
页码:83 / 101
页数:19
相关论文
共 34 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]   Protein folding in the hydrophobic-hydrophilic (HP) model is NP-complete [J].
Berger, B ;
Leighton, T .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (01) :27-40
[3]   SPIN-GLASSES AND THE STATISTICAL-MECHANICS OF PROTEIN FOLDING [J].
BRYNGELSON, JD ;
WOLYNES, PG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (21) :7524-7528
[4]   COMPACT POLYMERS [J].
CHAN, HS ;
DILL, KA .
MACROMOLECULES, 1989, 22 (12) :4559-4573
[5]  
CLOTE P, 2003, CURRENTS COMPUT MOL, P149
[6]  
Clote P., 2000, COMPUTATIONAL MOL BI
[7]  
CLOTE P, 1999, SPRINGER LECT NOTES, V1644, P240
[8]   Natural selection and algorithmic design of mRNA [J].
Cohen, B ;
Skiena, S .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (3-4) :419-432
[9]   On the complexity of protein folding [J].
Crescenzi, P ;
Goldman, D ;
Papadimitriou, C ;
Piccolboni, A ;
Yannakakis, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (03) :423-465
[10]  
CUPAL J, 1996, P GERM C BIOINF, P184