Clustal Omega for making accurate alignments of many protein sequences

被引:1349
作者
Sievers, Fabian
Higgins, Desmond G. [1 ,2 ]
机构
[1] Univ Coll Dublin, Sch Med, Dublin 4, Ireland
[2] Univ Coll Dublin, Conway Inst Biomol & Biomed Res, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
clustal omega; multiple sequence alignment; benchmarking; protein structure; GUIDE TREES;
D O I
10.1002/pro.3290
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Clustal Omega is a widely used package for carrying out multiple sequence alignment. Here, we describe some recent additions to the package and benchmark some alternative ways of making alignments. These benchmarks are based on protein structure comparisons or predictions and include a recently described method based on secondary structure prediction. In general, Clustal Omega is fast enough to make very large alignments and the accuracy of protein alignments is high when compared to alternative packages. The package is freely available as executables or source code from or can be run on-line from a variety of sites, especially the EBI .
引用
收藏
页码:135 / 145
页数:11
相关论文
共 23 条
[1]   Sequence embedding for fast construction of guide trees for multiple sequence alignment [J].
Blackshields, Gordon ;
Sievers, Fabian ;
Shi, Weifeng ;
Wilm, Andreas ;
Higgins, Desmond G. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2010, 5
[2]   Simple chained guide trees give high-quality protein multiple sequence alignments [J].
Boyce, Kieran ;
Sievers, Fabian ;
Higgins, Desmond G. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (29) :10556-10561
[3]   JPred4: a protein secondary structure prediction server [J].
Drozdetskiy, Alexey ;
Cole, Christian ;
Procter, James ;
Barton, Geoffrey J. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) :W389-W394
[4]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[5]   Pfam: the protein families database [J].
Finn, Robert D. ;
Bateman, Alex ;
Clements, Jody ;
Coggill, Penelope ;
Eberhardt, Ruth Y. ;
Eddy, Sean R. ;
Heger, Andreas ;
Hetherington, Kirstie ;
Holm, Liisa ;
Mistry, Jaina ;
Sonnhammer, Erik L. L. ;
Tate, John ;
Punta, Marco .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D222-D230
[6]   HMMER web server: interactive sequence similarity searching [J].
Finn, Robert D. ;
Clements, Jody ;
Eddy, Sean R. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :W29-W37
[7]   Using de novo protein structure predictions to measure the quality of very large multiple sequence alignments [J].
Fox, Gearoid ;
Sievers, Fabian ;
Higgins, Desmond G. .
BIOINFORMATICS, 2016, 32 (06) :814-820
[8]   A benchmark of multiple sequence alignment programs upon structural RNAs [J].
Gardner, PP ;
Wilm, A ;
Washietl, S .
NUCLEIC ACIDS RESEARCH, 2005, 33 (08) :2433-2439
[9]  
HIGGINS DG, 1992, COMPUT APPL BIOSCI, V8, P189
[10]   MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform [J].
Katoh, K ;
Misawa, K ;
Kuma, K ;
Miyata, T .
NUCLEIC ACIDS RESEARCH, 2002, 30 (14) :3059-3066