RIsearch: fast RNA-RNA interaction search using a simplified nearest-neighbor energy model

被引:69
作者
Wenzel, Anne [1 ,2 ]
Akbasli, Erdinc [3 ]
Gorodkin, Jan [1 ,2 ]
机构
[1] Univ Copenhagen, Ctr Noncoding RNA Technol & Hlth, DK-1870 Frederiksberg, Denmark
[2] Univ Copenhagen, Dept Vet Clin & Anim Sci, DK-1870 Frederiksberg, Denmark
[3] Univ Copenhagen, Software Dev Grp, DK-2300 Copenhagen S, Denmark
关键词
SECONDARY STRUCTURE PREDICTION; BASE-PAIRING PROBABILITIES; NONCODING RNAS; COMPARATIVE GENOMICS; TARGETS; IDENTIFICATION; ALGORITHM; ACCESSIBILITY; ALIGNMENTS; STABILITY;
D O I
10.1093/bioinformatics/bts519
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Regulatory, non-coding RNAs often function by forming a duplex with other RNAs. It is therefore of interest to predict putative RNA-RNA duplexes in silico on a genome-wide scale. Current computational methods for predicting these interactions range from fast complementary-based searches to those that take intramolecular binding into account. Together these methods constitute a trade-off between speed and accuracy, while leaving room for improvement within the context of genome-wide screens. A fast pre-filtering of putative duplexes would therefore be desirable. Results: We present RIsearch, an implementation of a simplified Turner energy model for fast computation of hybridization, which significantly reduces runtime while maintaining accuracy. Its time complexity for sequences of lengths m and n is O(m.n) with a much smaller pre-factor than other tools. We show that this energy model is an accurate approximation of the full energy model for near-complementary RNA-RNA duplexes. RIsearch uses a Smith-Waterman-like algorithm using a dinucleotide scoring matrix which approximates the Turner nearest-neighbor energies. We show in benchmarks that we achieve a speed improvement of at least 2.4x compared with RNAplex, the currently fastest method for searching near-complementary regions. RIsearch shows a prediction accuracy similar to RNAplex on two datasets of known bacterial short RNA (sRNA)-messenger RNA (mRNA) and eukaryotic microRNA (miRNA)-mRNA interactions. Using RIsearch as a pre-filter in genome-wide screens reduces the number of binding site candidates reported by miRNA target prediction programs, such as TargetScanS and miRanda, by up to 70%. Likewise, substantial filtering was performed on bacterial RNA-RNA interaction data.
引用
收藏
页码:2738 / 2746
页数:9
相关论文
共 62 条
[1]  
Akbasli E., 2008, THESIS IT U COPENHAG
[2]   RNA-RNA interaction prediction and antisense RNA target search [J].
Alkan, C ;
Karakoç, E ;
Nadeau, JH ;
Sahinalp, SC ;
Zhang, KH .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :267-282
[3]   The eukaryotic genome as an RNA machine [J].
Amaral, Paulo P. ;
Dinger, Marcel E. ;
Mercer, Tim R. ;
Mattick, John S. .
SCIENCE, 2008, 319 (5871) :1787-1789
[4]   Secondary structure prediction of interacting RNA molecules [J].
Andronescu, M ;
Zhang, ZC ;
Condon, A .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 345 (05) :987-1001
[5]   MicroRNAs: tiny targets for engineering CHO cell phenotypes? [J].
Barron, Niall ;
Sanchez, Noelia ;
Kelly, Paul ;
Clynes, Martin .
BIOTECHNOLOGY LETTERS, 2011, 33 (01) :11-21
[6]   Identification of hundreds of conserved and nonconserved human microRNAs [J].
Bentwich, I ;
Avniel, A ;
Karov, Y ;
Aharonov, R ;
Gilad, S ;
Barad, O ;
Barzilai, A ;
Einat, P ;
Einav, U ;
Meiri, E ;
Sharon, E ;
Spector, Y ;
Bentwich, Z .
NATURE GENETICS, 2005, 37 (07) :766-770
[7]   Local RNA base pairing probabilities in large sequences [J].
Bernhart, SH ;
Hofacker, IL ;
Stadler, PF .
BIOINFORMATICS, 2006, 22 (05) :614-615
[8]   Partition function and base pairing probabilities of RNA heterodimers [J].
Bernhart, Stephan H. ;
Tafer, Hakim ;
Mueckstein, Ulrike ;
Flamm, Christoph ;
Stadler, Peter F. ;
Hofacker, Ivo L. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)
[9]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[10]   Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments [J].
Breitling, R ;
Armengaud, P ;
Amtmann, A ;
Herzyk, P .
FEBS LETTERS, 2004, 573 (1-3) :83-92