RNAdualPF: software to compute the dual partition function with sample applications in molecular evolution theory

被引:9
作者
Antonio Garcia-Martin, Juan [1 ,3 ]
Bayegan, Amir H. [1 ]
Dotu, Ivan [2 ]
Clote, Peter [1 ]
机构
[1] Boston Coll, Dept Biol, 140 Commonwealth Ave, Chestnut Hill, MA 02467 USA
[2] Univ Pompeu Fabra, Hosp del Mar Med Res Inst IMIM, Dept Expt & Hlth Sci, Res Programme Biomed Informat GRIB, Dr Aiguader 88, Barcelona, Spain
[3] CSIC, Ctr Nacl Biotecnol, Syst Biol Program, C Darwin 3, E-28049 Madrid, Spain
基金
美国国家科学基金会;
关键词
RNA secondary structure; Partition function; Boltzmann ensemble; Robustness; SECONDARY STRUCTURE; RNA SEQUENCES; DESIGN; ALGORITHM; RFAM;
D O I
10.1186/s12859-016-1280-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: RNA inverse folding is the problem of finding one or more sequences that fold into a user-specified target structure s(0), i.e. whose minimum free energy secondary structure is identical to the target s(0). Here we consider the ensemble of all RNA sequences that have low free energy with respect to a given target s(0). Results: We introduce the program RNAdualPF, which computes the dual partition function Z*, defined as the sum of Boltzmann factors exp(-E(a, s(0))/RT) of all RNA nucleotide sequences a compatible with target structure s(0). Using RNAdualPF, we efficiently sample RNA sequences that approximately fold into s(0), where additionally the user can specify IUPAC sequence constraints at certain positions, and whether to include dangles (energy terms for stacked, single-stranded nucleotides). Moreover, since we also compute the dual partition function Z*(k) over all sequences having GC-content k, the user can require that all sampled sequences have a precise, specified GC-content. Using Z*, we compute the dual expected energy <E*>, and use it to show that natural RNAs from the Rfam 12.0 database have higher minimum free energy than expected, thus suggesting that functional RNAs are under evolutionary pressure to be only marginally thermodynamically stable. We show that C. elegans precursor microRNA (pre-miRNA) is significantly non-robust with respect to mutations, by comparing the robustness of each wild type pre-miRNA sequence with 2000 [resp. 500] sequences of the same GC-content generated by RNAdualPF, which approximately [resp. exactly] fold into the wild type target structure. We confirm and strengthen earlier findings that precursor microRNAs and bacterial small noncoding RNAs display plasticity, a measure of structural diversity. Conclusion: We describe RNAdualPF, which rapidly computes the dual partition function Z* and samples sequences having low energy with respect to a target structure, allowing sequence constraints and specified GC-content. Using different inverse folding software, another group had earlier shown that pre-miRNA is mutationally robust, even controlling for compositional bias. Our opposite conclusion suggests a cautionary note that computationally based insights into molecular evolution may heavily depend on the software used. C/C++-software for RNAdualPF is available at http://bioinformatics. bc. edu/clotelab/RNAdualPF.
引用
收藏
页数:24
相关论文
共 28 条
[1]  
[Anonymous], 2015, LOS ALAMOS HIV DATAB
[2]   RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules [J].
Antonio Garcia-Martin, Juan ;
Dotu, Ivan ;
Clote, Peter .
NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) :W513-W521
[3]   RNAiFOLD: A CONSTRAINT PROGRAMMING ALGORITHM FOR RNA INVERSE FOLDING AND MOLECULAR DESIGN [J].
Antonio Garcia-Martin, Juan ;
Clote, Peter ;
Dotu, Ivan .
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (02)
[4]   Direct evolution of genetic robustness in microRNA [J].
Borenstein, E ;
Ruppin, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (17) :6593-6598
[5]   INFO-RNA - a fast approach to inverse RNA folding [J].
Busch, Anke ;
Backofen, Rolf .
BIOINFORMATICS, 2006, 22 (15) :1823-1831
[6]   A statistical sampling algorithm for RNA secondary structure prediction [J].
Ding, Y ;
Lawrence, CE .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7280-7301
[7]   Sfold web server for statistical folding and rational design of nucleic acids [J].
Ding, Y ;
Chan, CY ;
Lawrence, CE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W135-W141
[8]   Paradigms for computational nucleic acid design [J].
Dirks, RM ;
Lin, M ;
Winfree, E ;
Pierce, NA .
NUCLEIC ACIDS RESEARCH, 2004, 32 (04) :1392-1403
[9]  
Garcia-Martin JA, 2016, THESIS
[10]   RNA Thermodynamic Structural Entropy [J].
Garcia-Martin, Juan Antonio ;
Clote, Peter .
PLOS ONE, 2015, 10 (11)