Infernal 1.1: 100-fold faster RNA homology searches

被引:2353
作者
Nawrocki, Eric P. [1 ]
Eddy, Sean R. [1 ]
机构
[1] HHMI Janelia Farm Res Campus, Ashburn, VA 20147 USA
关键词
NONCODING RNA; DATABASE; ALIGNMENTS; ANNOTATION; SEQUENCE; FAMILIES; RFAM;
D O I
10.1093/bioinformatics/btt509
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Infernal builds probabilistic profiles of the sequence and secondary structure of an RNA family called covariance models (CMs) from structurally annotated multiple sequence alignments given as input. Infernal uses CMs to search for new family members in sequence databases and to create potentially large multiple sequence alignments. Version 1.1 of Infernal introduces a new filter pipeline for RNA homology search based on accelerated profile hidden Markov model (HMM) methods and HMM-banded CM alignment methods. This enables similar to 100-fold acceleration over the previous version and similar to 10 000-fold acceleration over exhaustive non-filtered CM searches.
引用
收藏
页码:2933 / 2935
页数:3
相关论文
共 16 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Brown M P, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P57
[3]   Rfam 11.0: 10 years of RNA families [J].
Burge, Sarah W. ;
Daub, Jennifer ;
Eberhardt, Ruth ;
Tate, John ;
Barquist, Lars ;
Nawrocki, Eric P. ;
Eddy, Sean R. ;
Gardner, Paul P. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D226-D232
[4]   The Ribosomal Database Project: improved alignments and new tools for rRNA analysis [J].
Cole, J. R. ;
Wang, Q. ;
Cardenas, E. ;
Fish, J. ;
Chai, B. ;
Farris, R. J. ;
Kulam-Syed-Mohideen, A. S. ;
McGarrell, D. M. ;
Marsh, T. ;
Garrity, G. M. ;
Tiedje, J. M. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D141-D145
[5]  
Durbin R., 1998, Biological sequence analysis: probabilistic models of proteins and nucleic acids
[6]  
Eddy S. R, 2003, HMMER2 USERS GUIDE
[7]   A probabilistic model of local sequence alignment that simplifies statistical significance estimation [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (05)
[8]   Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
[9]   Exploring genomic dark matter: A critical assessment of the performance of homology search methods on noncoding RNA [J].
Freyhult, Eva K. ;
Bollback, Jonathan P. ;
Gardner, Paul P. .
GENOME RESEARCH, 2007, 17 (01) :117-125
[10]   Rfam: an RNA family database [J].
Griffiths-Jones, S ;
Bateman, A ;
Marshall, M ;
Khanna, A ;
Eddy, SR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :439-441