MAFFT-DASH: integrated protein sequence and structural alignment

被引：660

作者：

Rozewicki, John ^{[1
,2
]}

Li, Songling ^{[1
,2
]}

Amada, Karlou Mar ^{[2
,3
]}

Standley, Daron M. ^{[1
,2
]}

Katoh, Kazutaka ^{[1
,2
]}

机构：

[1] Osaka Univ, Microbial Dis Res Inst, Dept Genome Informat, Genome Informat Res Ctr, 3-1 Yamadaoka, Suita, Osaka 5650871, Japan

[2] Osaka Univ, Immunol Frontier Res Ctr, Syst Immunol Lab, 3-1 Yamadaoka, Suita, Osaka 5650871, Japan

[3] TILI IO PTY LTD, L7 380 Docklands Dr, Docklands, Vic 3008, Australia

来源：

NUCLEIC ACIDS RESEARCH | 2019年 / 47卷 / W1期

关键词：

MULTIPLE; ACCURACY; DATABASE; PROGRAM;

D O I：

10.1093/nar/gkz342

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Here, we describe a web server that integrates structural alignments with the MAFFT multiple sequence alignment (MSA) tool. For this purpose, we have prepared a web-based Database of Aligned Structural Homologs (DASH), which provides structural alignments at the domain and chain levels for all proteins in the Protein Data Bank (PDB), and can be queried interactively or by a simple REST-like API. MAFFT-DASH integration can be invoked with a single flag on either the web (https://mafft.cbrc.jp/alignment/server/) or command-line versions of MAFFT. In our benchmarks using 878 cases from the BAliBase, HomFam, OXFam, Mattbench and SISYPHUS datasets, MAFFT-DASH showed 10-20% improvement over standard MAFFT for MSA problems with weak similarity, in terms of Sum-of-Pairs (SP), a measure of how well a program succeeds at aligning input sequences in comparison to a reference alignment. When MAFFT alignments were supplemented with homologous sequences, further improvement was observed. Potential applications of DASH beyond MSA enrichment include functional annotation through detection of remote homology and assembly of template libraries for homology modeling.

引用

页码：W5 / W10

页数：6

共 33 条

[1] PDP: protein domain parser [J].