SABERTOOTH: protein structural alignment based on a vectorial structure representation

被引:32
作者
Teichert, Florian [1 ]
Bastolla, Ugo [2 ]
Porto, Markus [1 ]
机构
[1] Tech Univ Darmstadt, Inst Festkorperphys, D-64289 Darmstadt, Germany
[2] Univ Autonoma Madrid, CSIC, Ctr Biol Mol Severo Ochoa, E-28049 Madrid, Spain
来源
BMC BIOINFORMATICS | 2007年 / 8卷
关键词
RELATE; 2; SETS; SEQUENCE; PREDICTION; ROTATION; DATABASE;
D O I
10.1186/1471-2105-8-425
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The task of computing highly accurate structural alignments of proteins in very short computation time is still challenging. This is partly due to the complexity of protein structures. Therefore, instead of manipulating coordinates directly, matrices of inter-atomic distances, sets of vectors between protein backbone atoms, and other reduced representations are used. These decrease the effort of comparing large sets of coordinates, but protein structural alignment still remains computationally expensive. Results: We represent the topology of a protein structure through a structural profile that expresses the global effective connectivity of each residue. We have shown recently that this representation allows explicitly expressing the relationship between protein structure and protein sequence. Based on this very condensed vectorial representation, we develop a structural alignment framework that recognizes structural similarities with accuracy comparable to established alignment tools. Furthermore, our algorithm has favourable scaling of computation time with chain length. Since the algorithm is independent of the details of the structural representation, our framework can be applied to sequence-to-sequence and sequence-to-structure comparison within the same setup, and it is therefore more general than other existing tools.
引用
收藏
页数:17
相关论文
共 30 条
[1]   Do aligned sequences share the same fold? [J].
Abagyan, RA ;
Batalov, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (01) :355-368
[2]  
[Anonymous], 1992, LAPACK Users' Guide
[3]   Principal eigenvector of contact matrices and hydrophobicity profiles in proteins [J].
Bastolla, U ;
Porto, M ;
Roman, HE ;
Vendruscolo, M .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (01) :22-30
[4]   A protein evolution model with independent sites that reproduces site-specific amino acid distributions from the Protein Data Bank [J].
Bastolla, Ugo ;
Porto, Markus ;
Roman, H. Eduardo ;
Vendruscolo, Michele .
BMC EVOLUTIONARY BIOLOGY, 2006, 6 (1)
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[7]  
Dijkstra E. W., 1959, NUMER MATH, V1, P269, DOI DOI 10.1007/BF01386390
[8]   PARSER FOR PROTEIN-FOLDING UNITS [J].
HOLM, L ;
SANDER, C .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (03) :256-268
[9]   DaliLite workbench for protein structure comparison [J].
Holm, L ;
Park, J .
BIOINFORMATICS, 2000, 16 (06) :566-567
[10]   Protein structure alignment using environmental profiles [J].
Jung, J ;
Lee, B .
PROTEIN ENGINEERING, 2000, 13 (08) :535-543