Prediction of RNA secondary structure with pseudoknots using integer programming

被引:16
作者
Poolsap, Unyanee [1 ]
Kato, Yuki [1 ]
Akutsu, Tatsuya [1 ]
机构
[1] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Kyoto 6110011, Japan
关键词
TREE ADJOINING GRAMMARS; SEQUENCES; ALGORITHM;
D O I
10.1186/1471-2105-10-S1-S38
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA secondary structure prediction is one major task in bioinformatics, and various computational methods have been proposed so far. Pseudoknot is one of the typical substructures appearing in several RNAs, and plays an important role in some biological processes. Prediction of RNA secondary structure with pseudoknots is still challenging since the problem is NP-hard when arbitrary pseudoknots are taken into consideration. Results: We introduce a new method of predicting RNA secondary structure with pseudoknots based on integer programming. In our formulation, we aim at minimizing the value of the objective function that reflects free energy of a folding structure of an input RNA sequence. We focus on a practical class of pseudoknots by setting constraints appropriately. Experimental results for a set of real RNA sequences show that our proposed method outperforms several existing methods in sensitivity. Furthermore, for a set of sequences of small length, our approach achieved good performance in both sensitivity and specificity. Conclusion: Our integer programming-based approach for RNA structure prediction is flexible and extensible.
引用
收藏
页数:11
相关论文
共 21 条
[1]   Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots [J].
Akutsu, T .
DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) :45-62
[2]  
[Anonymous], ILOG CPLEX
[3]   Stochastic modeling of RNA pseudoknotted structures: a grammatical approach [J].
Cai, Liming ;
Malmberg, Russell L. ;
Wu, Yunzhou .
BIOINFORMATICS, 2003, 19 :i66-i73
[4]   Rfam: annotating non-coding RNAs in complete genomes [J].
Griffiths-Jones, S ;
Moxon, S ;
Marshall, M ;
Khanna, A ;
Eddy, SR ;
Bateman, A .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D121-D124
[5]  
Kato Y., 2006, IPSJ T BIOINFORMATIC, V47, P12
[6]   RNA pseudoknot prediction in energy-based models [J].
Lyngso, RB ;
Pedersen, CNS .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :409-427
[7]   Dynalign: An algorithm for finding the secondary structure common to two RNA sequences [J].
Mathews, DH ;
Turner, DH .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 317 (02) :191-203
[8]   Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures [J].
Matsui, H ;
Sato, K ;
Sakakibara, Y .
BIOINFORMATICS, 2005, 21 (11) :2611-2617
[9]   The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data [J].
Parisien, Marc ;
Major, Francois .
NATURE, 2008, 452 (7183) :51-55
[10]   Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics [J].
Reeder, J ;
Giegerich, R .
BMC BIOINFORMATICS, 2004, 5 (1)