RNAmountAlign: Efficient software for local, global, semiglobal pairwise and multiple RNA sequence/structure alignment

被引:4
作者
Bayegan, Amir H. [1 ]
Clote, Peter [1 ]
机构
[1] Boston Coll, Biol Dept, Chestnut Hill, MA 02167 USA
基金
美国国家科学基金会;
关键词
STRUCTURAL ALIGNMENT; SECONDARY STRUCTURE; SEQUENCE ALIGNMENT; WEB SERVER; ALGORITHM; FAMILIES; FEATURES; SEARCH; GENOME; ENERGY;
D O I
10.1371/journal.pone.0227177
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Alignment of structural RNAs is an important problem with a wide range of applications. Since function is often determined by molecular structure, RNA alignment programs should take into account both sequence and base-pairing information for structural homology identification. This paper describes C++ software, RNAmountAlign, for RNA sequence/structure alignment that runs in O(n(3)) time and O(n(2)) space for two sequences of length n; moreover, our software returns a p-value (transformable to expect value E) based on Karlin-Altschul statistics for local alignment, as well as parameter fitting for local and global alignment. Using incremental mountain height, a representation of structural information computable in cubic time, RNAmountAlign implements quadratic time pairwise local, global and global/semiglobal (query search) alignment using a weighted combination of sequence and structural similarity. RNAmountAlign is capable of performing progressive multiple alignment as well. Benchmarking of RNAmountAlign against LocARNA, LARA, FOLDALIGN, DYNALIGN, STRAL, MXSCARNA, and MUSCLE shows that RNAmountAlign has reasonably good accuracy and faster run time supporting all alignment types. Additionally, our extension of RNAmountAlign, called RNAmountAlignScan, which scans a target genome sequence to find hits having high sequence and structural similarity to a given query sequence, outperforms RSEARCH and sequence-only query scans and runs faster than FOLDALIGN query scan.
引用
收藏
页数:34
相关论文
共 53 条
[1]  
[Anonymous], 1966, Soviet Physics Doklady
[2]  
[Anonymous], 1997, ALGORITHMS STRINGS T
[3]  
Barsacchi M., 2016, Proceedings of the 31st Annual ACM Symposium on Applied Computing, P44
[4]   DETERMINANTS OF A PROTEIN FOLD - UNIQUE FEATURES OF THE GLOBIN AMINO-ACID-SEQUENCES [J].
BASHFORD, D ;
CHOTHIA, C ;
LESK, AM .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (01) :199-216
[5]   Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization [J].
Bauer, Markus ;
Klau, Gunnar W. ;
Reinert, Knut .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]   RNAalifold: improved consensus structure prediction for RNA alignments [J].
Bernhart, Stephan H. ;
Hofacker, Ivo L. ;
Will, Sebastian ;
Gruber, Andreas R. ;
Stadler, Peter F. .
BMC BIOINFORMATICS, 2008, 9 (1)
[7]   Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency [J].
Clote, P ;
Ferré, F ;
Kranakis, E ;
Krizanc, D .
RNA, 2005, 11 (05) :578-591
[8]   STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time [J].
Dalli, Deniz ;
Wilm, Andreas ;
Mainz, Indra ;
Steger, Gerhard .
BIOINFORMATICS, 2006, 22 (13) :1593-1599
[9]   A statistical sampling algorithm for RNA secondary structure prediction [J].
Ding, Y ;
Lawrence, CE .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7280-7301
[10]   DIAL:: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities [J].
Ferre, F. ;
Ponty, Y. ;
Lorenz, W. A. ;
Clote, Peter .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W659-W668