Prediction of RNA secondary structure including pseudoknots for long sequences

被引:27
作者
Sato, Kengo [1 ]
Kato, Yuki [2 ,3 ]
机构
[1] Keio Univ, Dept Biosci & Informat, Tokyo, Japan
[2] Osaka Univ, Grad Sch Med, Dept RNA Biol & Neurosci, Suita, Osaka, Japan
[3] Osaka Univ, Integrated Frontier Res Med Sci Div, Inst Open & Transdisciplinary Res Initiat, Suita, Osaka, Japan
基金
日本学术振兴会;
关键词
RNA secondary structure prediction; pseudoknots; integer programming;
D O I
10.1093/bib/bbab395
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
RNA structural elements called pseudoknots are involved in various biological phenomena including ribosomal frameshifts. Because it is infeasible to construct an efficiently computable secondary structure model including pseudoknots, secondary structure prediction methods considering pseudoknots are not yet widely available. We developed IPknot, which uses heuristics to speed up computations, but it has remained difficult to apply it to long sequences, such as messenger RNA and viral RNA, because it requires cubic computational time with respect to sequence length and has threshold parameters that need to be manually adjusted. Here, we propose an improvement of IPknot that enables calculation in linear time by employing the LinearPartition model and automatically selects the optimal threshold parameters based on the pseudo-expected accuracy. In addition, IPknot showed favorable prediction accuracy across a wide range of conditions in our exhaustive benchmarking, not only for single sequences but also for multiple alignments.
引用
收藏
页数:9
相关论文
共 36 条
[1]   Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots [J].
Akutsu, T .
DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) :45-62
[2]   RNA STRAND: The RNA secondary structure and statistical analysis database [J].
Andronescu, Mirela ;
Bereg, Vera ;
Hoos, Holger H. ;
Condon, Anne .
BMC BIOINFORMATICS, 2008, 9 (1)
[3]   ProbKnot: Fast prediction of RNA secondary structure including pseudoknots [J].
Bellaousov, Stanislav ;
Mathews, David H. .
RNA, 2010, 16 (10) :1870-1880
[4]   Viral RNA pseudoknots: versatile motifs in gene expression and replication [J].
Brierley, Ian ;
Pennell, Simon ;
Gilbert, Robert J. C. .
NATURE REVIEWS MICROBIOLOGY, 2007, 5 (08) :598-610
[5]   The Comparative RNA Web (CRW) Site:: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs -: art. no. 2 [J].
Cannone, JJ ;
Subramanian, S ;
Schnare, MN ;
Collett, JR ;
D'Souza, LM ;
Du, YS ;
Feng, B ;
Lin, N ;
Madabusi, LV ;
Müller, KM ;
Pande, N ;
Shang, ZD ;
Yu, N ;
Gutell, RR .
BMC BIOINFORMATICS, 2002, 3 (1)
[6]   FlexStem: improving predictions of RNA secondary structures with pseudoknots by reducing the search space [J].
Chen, Xiang ;
He, Si-Min ;
Bu, Dongbo ;
Zhang, Fa ;
Wang, Zhiyong ;
Chen, Runsheng ;
Gao, Wen .
BIOINFORMATICS, 2008, 24 (18) :1994-2001
[7]   bpRNA: large-scale automated annotation and analysis of RNA secondary structure [J].
Danaee, Padideh ;
Rouches, Mason ;
Wiley, Michelle ;
Deng, Dezhong ;
Huang, Liang ;
Hendrix, David .
NUCLEIC ACIDS RESEARCH, 2018, 46 (11) :5381-5394
[8]   CONTRAfold: RNA secondary structure prediction without physics-based models [J].
Do, Chuong B. ;
Woods, Daniel A. ;
Batzoglou, Serafim .
BIOINFORMATICS, 2006, 22 (14) :E90-E98
[9]   CD-HIT: accelerated for clustering the next-generation sequencing data [J].
Fu, Limin ;
Niu, Beifang ;
Zhu, Zhengwei ;
Wu, Sitao ;
Li, Weizhong .
BIOINFORMATICS, 2012, 28 (23) :3150-3152
[10]   Improving the accuracy of predicting secondary structure for aligned RNA sequences [J].
Hamada, Michiaki ;
Sato, Kengo ;
Asai, Kiyoshi .
NUCLEIC ACIDS RESEARCH, 2011, 39 (02) :393-402