A hybrid clustering of protein binding sites

被引:10
作者
Ivan, Gabor [1 ,2 ]
Szabadka, Zoltan [1 ,2 ]
Grolmusz, Vince [1 ,2 ]
机构
[1] Eotvos Lorand Univ, Dept Comp Sci, Prot Informat Technol Grp, H-1117 Budapest, Hungary
[2] Uratim Ltd, Budapest, Hungary
基金
匈牙利科学研究基金会;
关键词
binding sites; clustering; distance; OPTICS; PDB; sequence; FUNCTIONAL CLASSIFICATION; SEQUENCE; PREDICTION; DATABASE; MOTIFS; PFAM;
D O I
10.1111/j.1742-4658.2010.07578.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Protein Data Bank contains the description of approximately 27 000 protein-ligand binding sites. Most of the ligands at these sites are biologically active small molecules, affecting the biological function of the protein. The classification of their binding sites may lead to relevant results in drug discovery and design. Clusters of similar binding sites were created here by a hybrid, sequence and spatial structure-based approach, using the OPTICS clustering algorithm. A dissimilarity measure was defined: a distance function on the amino acid sequences of the binding sites. All the binding sites were clustered in the Protein Data Bank according to this distance function, and it was found that the clusters characterized well the Enzyme Commission numbers of the entries. The results, carefully color coded by the Enzyme Commission numbers of the proteins, containing the 20 967 binding sites clustered, are available as html files in three parts at http://pitgroup.org/seqclust/.
引用
收藏
页码:1494 / 1502
页数:9
相关论文
共 28 条
  • [1] Detection of non-topological motifs in protein structures
    Alesker, V
    Nussinov, R
    Wolfson, HJ
    [J]. PROTEIN ENGINEERING, 1996, 9 (12): : 1103 - 1119
  • [2] Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
  • [3] Mining sequence annotation databanks for association patterns
    Artamonova, II
    Frishman, G
    Gelfand, MS
    Frishman, D
    [J]. BIOINFORMATICS, 2005, 21 : 49 - 57
  • [4] Interchanges of spatially neighbouring residues in structurally conserved environments
    Azarya-Sprinzak, E
    Naor, D
    Wolfson, HJ
    Nussinov, R
    [J]. PROTEIN ENGINEERING, 1997, 10 (10): : 1109 - 1122
  • [5] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [6] Ester M., 1996, DENSITY BASED ALGORI, DOI DOI 10.5555/3001460.3001507
  • [7] SitesBase: a database for structure-based protein-ligand binding site comparisons
    Gold, Nicola D.
    Jackson, Richard M.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D231 - D234
  • [8] Is allostery an intrinsic property of all dynamic proteins?
    Gunasekaran, K
    Ma, BY
    Nussinov, R
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (03) : 433 - 443
  • [9] SiteLight: Binding-site prediction using phage display libraries
    Halperin, I
    Wolfson, H
    Nussinov, R
    [J]. PROTEIN SCIENCE, 2003, 12 (07) : 1344 - 1359
  • [10] Prediction of multimolecular assemblies by multiple docking
    Inbar, Y
    Benyamini, H
    Nussinov, R
    Wolfson, HJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 349 (02) : 435 - 447