Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data

被引:67
作者
Wu, Yang [1 ,2 ]
Shi, Binbin [1 ,2 ]
Ding, Xinqiang [1 ,2 ]
Liu, Tong [1 ,2 ]
Hu, Xihao [3 ]
Yip, Kevin Y. [3 ]
Yang, Zheng Rong [4 ]
Mathews, David H. [5 ,6 ]
Lu, Zhi John [1 ,2 ]
机构
[1] Tsinghua Univ, Ctr Plant Biol, Ctr Synthet & Syst Biol, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Sch Life Sci, Tsinghua Peking Joint Ctr Life Sci, Beijing 100084, Peoples R China
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
[4] Univ Exeter, Sch Biosci, Exeter EX4 4QD, Devon, England
[5] Univ Rochester, Med Ctr, Dept Biochem & Biophys, Rochester, NY 14642 USA
[6] Univ Rochester, Med Ctr, Ctr RNA Biol, Rochester, NY 14642 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
DIMETHYL SULFATE; IN-VIVO; SHAPE; TRANSCRIPTOME; REVEALS; CONSTRAINTS; PARAMETERS; RESOLUTION; ALGORITHM; ACCURACY;
D O I
10.1093/nar/gkv706
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is optimized for SHAPE data, while SeqFold is optimized for PARS data. Here, we report a new RNA secondary structure prediction method, restrained MaxExpect (RME), which can incorporate multiple types of experimental probing data and is based on a free energy model and an MEA (maximizing expected accuracy) algorithm. We first demonstrated that RME substantially improved secondary structure prediction with perfect restraints (base pair information of known structures). Next, we collected structure-probing data from diverse experiments (e.g. SHAPE, PARS and DMS-seq) and transformed them into a unified set of pairing probabilities with a posterior probabilistic model. By using the probability scores as restraints in RME, we compared its secondary structure prediction performance with two other well-known tools, RNAstructure-Fold (based on a free energy minimization algorithm) and SeqFold (based on a sampling algorithm). For SHAPE data, RME and RNAstructure-Fold performed better than Se-qFold, because they markedly altered the energy model with the experimental restraints. For high-throughput data (e.g. PARS and DMS-seq) with lower probing efficiency, the secondary structure prediction performances of the tested tools were comparable, with performance improvements for only a portion of the tested RNAs. However, when the effects of tertiary structure and protein interactions were removed, RME showed the highest prediction accuracy in the DMS-accessible regions by incorporating in vivo DMS-seq data.
引用
收藏
页码:7247 / 7259
页数:13
相关论文
共 50 条
[1]   RNAstructure: web servers for RNA secondary structure prediction and analysis [J].
Bellaousov, Stanislav ;
Reuter, Jessica S. ;
Seetin, Matthew G. ;
Mathews, David H. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W471-W474
[2]   The Structure of the Eukaryotic Ribosome at 3.0 Å Resolution [J].
Ben-Shem, Adam ;
de Loubresse, Nicolas Garreau ;
Melnikov, Sergey ;
Jenner, Lasse ;
Yusupova, Gulnara ;
Yusupov, Marat .
SCIENCE, 2011, 334 (6062) :1524-1529
[3]   Correlating SHAPE signatures with three-dimensional RNA structures [J].
Bindewald, Eckart ;
Wendeler, Michaela ;
Legiewicz, Michal ;
Bona, Marion K. ;
Wang, Yi ;
Pritt, Mark J. ;
Le Grice, Stuart F. J. ;
Shapiro, Bruce A. .
RNA, 2011, 17 (09) :1688-1696
[4]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[5]   An RNA Mapping DataBase for curating RNA structure mapping experiments [J].
Cordero, Pablo ;
Lucks, Julius B. ;
Das, Rhiju .
BIOINFORMATICS, 2012, 28 (22) :3006-3008
[6]   Quantitative Dimethyl Sulfate Mapping for Automated RNA Secondary Structure Inference [J].
Cordero, Pablo ;
Kladwang, Wipapat ;
VanLang, Christopher C. ;
Das, Rhiju .
BIOCHEMISTRY, 2012, 51 (36) :7037-7039
[7]   Accurate SHAPE-directed RNA structure determination [J].
Deigan, Katherine E. ;
Li, Tian W. ;
Mathews, David H. ;
Weeks, Kevin M. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (01) :97-102
[8]   RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble [J].
Ding, Y ;
Chan, CY ;
Lawrence, CE .
RNA, 2005, 11 (08) :1157-1166
[9]   A statistical sampling algorithm for RNA secondary structure prediction [J].
Ding, Y ;
Lawrence, CE .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7280-7301
[10]   In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features [J].
Ding, Yiliang ;
Tang, Yin ;
Kwok, Chun Kit ;
Zhang, Yu ;
Bevilacqua, Philip C. ;
Assmann, Sarah M. .
NATURE, 2014, 505 (7485) :696-+