Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data

被引:67
作者
Wu, Yang [1 ,2 ]
Shi, Binbin [1 ,2 ]
Ding, Xinqiang [1 ,2 ]
Liu, Tong [1 ,2 ]
Hu, Xihao [3 ]
Yip, Kevin Y. [3 ]
Yang, Zheng Rong [4 ]
Mathews, David H. [5 ,6 ]
Lu, Zhi John [1 ,2 ]
机构
[1] Tsinghua Univ, Ctr Plant Biol, Ctr Synthet & Syst Biol, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Life Sci, Tsinghua Peking Joint Ctr Life Sci, Beijing 100084, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
[4] Univ Exeter, Sch Biosci, Exeter EX4 4QD, Devon, England
[5] Univ Rochester, Med Ctr, Dept Biochem & Biophys, Rochester, NY 14642 USA
[6] Univ Rochester, Med Ctr, Ctr RNA Biol, Rochester, NY 14642 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
DIMETHYL SULFATE; IN-VIVO; SHAPE; TRANSCRIPTOME; REVEALS; CONSTRAINTS; PARAMETERS; RESOLUTION; ALGORITHM; ACCURACY;
D O I
10.1093/nar/gkv706
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is optimized for SHAPE data, while SeqFold is optimized for PARS data. Here, we report a new RNA secondary structure prediction method, restrained MaxExpect (RME), which can incorporate multiple types of experimental probing data and is based on a free energy model and an MEA (maximizing expected accuracy) algorithm. We first demonstrated that RME substantially improved secondary structure prediction with perfect restraints (base pair information of known structures). Next, we collected structure-probing data from diverse experiments (e.g. SHAPE, PARS and DMS-seq) and transformed them into a unified set of pairing probabilities with a posterior probabilistic model. By using the probability scores as restraints in RME, we compared its secondary structure prediction performance with two other well-known tools, RNAstructure-Fold (based on a free energy minimization algorithm) and SeqFold (based on a sampling algorithm). For SHAPE data, RME and RNAstructure-Fold performed better than Se-qFold, because they markedly altered the energy model with the experimental restraints. For high-throughput data (e.g. PARS and DMS-seq) with lower probing efficiency, the secondary structure prediction performances of the tested tools were comparable, with performance improvements for only a portion of the tested RNAs. However, when the effects of tertiary structure and protein interactions were removed, RME showed the highest prediction accuracy in the DMS-accessible regions by incorporating in vivo DMS-seq data.
引用
收藏
页码:7247 / 7259
页数:13
相关论文
共 50 条
[21]   Genome-wide measurement of RNA secondary structure in yeast [J].
Kertesz, Michael ;
Wan, Yue ;
Mazor, Elad ;
Rinn, John L. ;
Nutter, Robert C. ;
Chang, Howard Y. ;
Segal, Eran .
NATURE, 2010, 467 (7311) :103-107
[22]   Pfold: RNA secondary structure prediction using stochastic context-free grammars [J].
Knudsen, B ;
Hein, J .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3423-3428
[23]   ViennaRNA Package 2.0 [J].
Lorenz, Ronny ;
Bernhart, Stephan H. ;
Siederdissen, Christian Hoener Zu ;
Tafer, Hakim ;
Flamm, Christoph ;
Stadler, Peter F. ;
Hofacker, Ivo L. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6
[24]   SHAPE-directed RNA secondary structure prediction [J].
Low, Justin T. ;
Weeks, Kevin M. .
METHODS, 2010, 52 (02) :150-158
[25]   Improved RNA secondary structure prediction by maximizing expected pair accuracy [J].
Lu, Zhi John ;
Gloor, Jason W. ;
Mathews, David H. .
RNA, 2009, 15 (10) :1805-1813
[26]   Multiplexed RNA structure characterization with selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) [J].
Lucks, Julius B. ;
Mortimer, Stefanie A. ;
Trapnell, Cole ;
Luo, Shujun ;
Aviran, Sharon ;
Schroth, Gary P. ;
Pachter, Lior ;
Doudna, Jennifer A. ;
Arkin, Adam P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (27) :11063-11068
[27]   Revolutions in RNA secondary structure prediction [J].
Mathews, David H. .
JOURNAL OF MOLECULAR BIOLOGY, 2006, 359 (03) :526-532
[28]   Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure [J].
Mathews, DH ;
Disney, MD ;
Childs, JL ;
Schroeder, SJ ;
Zuker, M ;
Turner, DH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (19) :7287-7292
[29]   Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure [J].
Mathews, DH ;
Sabina, J ;
Zuker, M ;
Turner, DH .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 288 (05) :911-940
[30]   THE EQUILIBRIUM PARTITION-FUNCTION AND BASE PAIR BINDING PROBABILITIES FOR RNA SECONDARY STRUCTURE [J].
MCCASKILL, JS .
BIOPOLYMERS, 1990, 29 (6-7) :1105-1119