Accuracy analysis of multiple structure alignments

被引:26
作者
Berbalk, Christoph [1 ]
Schwaiger, Christine S. [2 ]
Lackner, Peter [1 ]
机构
[1] Salzburg Univ, Dept Mol Biol, A-5020 Salzburg, Austria
[2] Stockholm Univ, Dept Biophys & Biochem, Ctr Biomembrane Res, S-10691 Stockholm, Sweden
关键词
multiple structure alignment; alignment accuracy; reference structural alignments; structural bioinformatics; PROTEIN-STRUCTURE ALIGNMENT; CLASSIFICATION; SIMILARITIES; SEQUENCES;
D O I
10.1002/pro.213
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein structure alignment methods are essential for many different challenges in protein science, such as the determination of relations between proteins in the fold space or the analysis and prediction of their biological function. A number of different pairwise and multiple structure alignment (MStA) programs have been developed and provided to the community. Prior knowledge of the expected alignment accuracy is desirable for the user of such tools. To retrieve an estimate of the performance of current structure alignment methods, we compiled a test suite taken from literature and the SISYPHUS database consisting of proteins that are difficult to align. Subsequently, different MStA programs were evaluated regarding alignment correctness and general limitations. The analysis shows that there are large differences in the success between the methods in terms of applicability and correctness. The latter ranges from 44 to 75% correct core positions. Taking only the best method result per test case this number increases to 84%. We conclude that the methods available are applicable to difficult cases, but also that there is still room for improvements in both, practicability and alignment correctness. An approach that combines the currently available methods supported by a proper score would be useful. Until then, a user should not rely on just a single program.
引用
收藏
页码:2027 / 2035
页数:9
相关论文
共 33 条
[1]   SISYPHUS - structural alignments for proteins with non-trivial relationships [J].
Andreeva, Antonina ;
Prlic, Andreas ;
Hubbard, Tim J. P. ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D253-D259
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   Improving model construction of profile HMMs for remote homology detection through structural alignment [J].
Bernardes, Juliana S. ;
Davila, Alberto Mr ;
Costa, Vitor S. ;
Zaverucha, Gerson .
BMC BIOINFORMATICS, 2007, 8 (1)
[4]   Protein fold similarity estimated by a probabilistic approach based on Cα-Cα distance comparison [J].
Carugo, O ;
Pongor, S .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (04) :887-898
[5]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[6]   The CATH classification revisited-architectures reviewed and new ways to characterize structural divergence in superfamilies [J].
Cuff, Alison L. ;
Sillitoe, Ian ;
Lewis, Tony ;
Redfern, Oliver C. ;
Garratt, Richard ;
Thornton, Janet ;
Orengo, Christine A. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D310-D314
[7]   MASS: multiple structural alignment by secondary structures [J].
Dror, O. ;
Benyamini, H. ;
Nussinov, R. ;
Wolfson, H. .
BIOINFORMATICS, 2003, 19 :i95-i104
[8]   Development and validation of a consistency based multiple structure alignment algorithm [J].
Ebert, J ;
Brutlag, D .
BIOINFORMATICS, 2006, 22 (09) :1080-1087
[9]   Knowledge-based protein secondary structure assignment [J].
Frishman, D ;
Argos, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04) :566-579
[10]   Surprising similarities in structure comparison [J].
Gibrat, JF ;
Madej, T ;
Bryant, SH .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :377-385