共 50 条
The Trouble with Long-Range Base Pairs in RNA Folding
被引:0
|作者:
Amman, Fabian
[1
]
Bernhart, Stephan H.
[1
]
Doose, Gero
[1
,2
]
Hofacker, Ivo L.
[3
,5
,8
]
Qin, Jing
[1
,4
]
Stadler, Peter F.
[1
,2
,3
,4
,5
,6
,7
]
Will, Sebastian
[1
]
机构:
[1] Univ Leipzig, Dept Comp Sci, Hartelstr 16-18, Leipzig, Germany
[2] Univ Leipzig, Leipzing Res Ctr Civilizat Dis, LIFE, D-04107 Leipzig, Germany
[3] Univ Vienna, Dept Theoret Chem, Vienna, Austria
[4] MPI Math Sci, Leipzig, Germany
[5] Univ Copenhagen, RTH, Frederiksberg, Denmark
[6] FHI Cell Therapy & Immunol, Leipzig, Germany
[7] Santa Fe Inst, Santa Fe, NM USA
[8] Univ Vienna, Bioinformat & Computat Biol Res Grp, A-1090 Vienna, Austria
来源:
关键词:
RNA folding;
long-range base pair;
prediction accuracy;
polymer zeta property;
SECONDARY STRUCTURE PREDICTION;
PROBABILITIES;
SEQUENCES;
DISTANCE;
DATABASE;
ENDS;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
RNA prediction has long been struggling with long-range base pairs since prediction accuracy decreases with base pair span. We analyze here the empirical distribution of base pair spans in large collection of experimentally known RNA structures. Surprisingly, we find that long-range base pairs are overrepresented in these data. In particular, there is no evidence that long-range base pairs are systematically over-predicted relative to short-range interactions in thermodynamic predictions. This casts doubt on a recent suggestion that kinetic effects are the cause of length-dependent decrease of predictability. Instead of a modification of the energy model we advocate a modification of the expected accuracy model for RNA secondary structures. We demonstrate that the inclusion of a span-dependent penalty leads to improved maximum expected accuracy structure predictions compared to both the standard MEA model and a modified folding algorithm with an energy penalty function. The prevalence of long-range base pairs provide further evidence that RNA structures in general do not have the so-called polymer zeta property. This has consequences for the asymptotic performance for a large class of sparsified RNA folding algorithms.
引用
收藏
页码:1 / 11
页数:11
相关论文