Improving the accuracy of predicting secondary structure for aligned RNA sequences

被引:44
作者
Hamada, Michiaki [1 ,2 ]
Sato, Kengo [3 ]
Asai, Kiyoshi [2 ,3 ]
机构
[1] Mizuho Informat & Res Inst Inc, Chiyoda Ku, Tokyo 1018443, Japan
[2] Natl Inst Adv Ind Sci & Technol, Computat Biol Res Ctr, Koto Ku, Tokyo 1350064, Japan
[3] Univ Tokyo, Grad Sch Frontier Sci, Chiba 2778562, Japan
关键词
NONCODING RNAS; WEB SERVER; ALIGNMENT; IMPROVEMENT; RFAM;
D O I
10.1093/nar/gkq792
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Considerable attention has been focused on predicting the secondary structure for aligned RNA sequences since it is useful not only for improving the limiting accuracy of conventional secondary structure prediction but also for finding non-coding RNAs in genomic sequences. Although there exist many algorithms of predicting secondary structure for aligned RNA sequences, further improvement of the accuracy is still awaited. In this article, toward improving the accuracy, a theoretical classification of state-of-the-art algorithms of predicting secondary structure for aligned RNA sequences is presented. The classification is based on the viewpoint of maximum expected accuracy (MEA), which has been successfully applied in various problems in bioinformatics. The classification reveals several disadvantages of the current algorithms but we propose an improvement of a previously introduced algorithm (CentroidAlifold). Finally, computational experiments strongly support the theoretical classification and indicate that the improved CentroidAlifold substantially outperforms other algorithms.
引用
收藏
页码:393 / 402
页数:10
相关论文
共 44 条
[1]   RNA STRAND: The RNA secondary structure and statistical analysis database [J].
Andronescu, Mirela ;
Bereg, Vera ;
Hoos, Holger H. ;
Condon, Anne .
BMC BIOINFORMATICS, 2008, 9 (1)
[2]  
Bernhart Stephan H., 2009, Briefings in Functional Genomics & Proteomics, V8, P461, DOI 10.1093/bfgp/elp043
[3]   RNAalifold: improved consensus structure prediction for RNA alignments [J].
Bernhart, Stephan H. ;
Hofacker, Ivo L. ;
Will, Sebastian ;
Gruber, Andreas R. ;
Stadler, Peter F. .
BMC BIOINFORMATICS, 2008, 9 (1)
[4]   Fast Statistical Alignment [J].
Bradley, Robert K. ;
Roberts, Adam ;
Smoot, Michael ;
Juvekar, Sudeep ;
Do, Jaeyoung ;
Dewey, Colin ;
Holmes, Ian ;
Pachter, Lior .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (05)
[5]   Specific alignment of structured RNA: stochastic grammars and sequence annealing [J].
Bradley, Robert K. ;
Pachter, Lior ;
Holmes, Ian .
BIOINFORMATICS, 2008, 24 (23) :2677-2683
[6]   Centroid estimation in discrete high-dimensional spaces with applications in biology [J].
Carvalho, Luis E. ;
Lawrence, Charles E. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (09) :3209-3214
[7]   RNA secondary structure in the coding region of dengue virus type 2 directs translation start codon selection and is required for viral replication [J].
Clyde, K ;
Harris, E .
JOURNAL OF VIROLOGY, 2006, 80 (05) :2170-2182
[8]   A statistical sampling algorithm for RNA secondary structure prediction [J].
Ding, Y ;
Lawrence, CE .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7280-7301
[9]   ProbCons: Probabilistic consistency-based multiple sequence alignment [J].
Do, CB ;
Mahabhashyam, MSP ;
Brudno, M ;
Batzoglou, S .
GENOME RESEARCH, 2005, 15 (02) :330-340
[10]   CONTRAfold: RNA secondary structure prediction without physics-based models [J].
Do, Chuong B. ;
Woods, Daniel A. ;
Batzoglou, Serafim .
BIOINFORMATICS, 2006, 22 (14) :E90-E98