Annotating nucleic acid-binding function based on protein structure

被引:165
作者
Stawiski, EW
Gregoret, LM [1 ]
Mandel-Gutfreund, Y
机构
[1] Univ Calif Santa Cruz, Dept Chem & Biochem, Santa Cruz, CA 95064 USA
[2] Univ Calif Santa Cruz, Dept Mol Cell & Dev Biol, Santa Cruz, CA 95064 USA
关键词
structural genomics; nucleic acid binding; function prediction; electrostatics; surface patches;
D O I
10.1016/S0022-2836(03)00031-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Many of the targets of structural genomics will be proteins with little or no structural similarity to those currently in the database. Therefore, novel function prediction methods that do not rely on sequence or fold similarity to other known proteins are needed. We present an automated approach to predict nucleic-acid-binding (NA-binding) proteins, specifically DNA-binding proteins. The method is based on characterizing the structural and sequence properties of large, positively charged electrostatic patches on DNA-binding protein surfaces, which typically coincide with the DNA-binding-sites. Using an ensemble of features extracted from these electrostatic patches, we predict DNA-binding proteins with high accuracy. We show that our method does not rely on sequence or structure homology and is capable of predicting proteins of novel-binding motifs and protein structures solved in an unbound state. Our method can also distinguish NA-binding proteins from other proteins that have similar, large positive electrostatic patches on their surfaces, but that do not bind nucleic acids. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1065 / 1079
页数:15
相关论文
共 66 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [3] Membrane binding of peptides containing both basic and aromatic residues.: Experimental studies with peptides corresponding to the scaffolding region of caveolin and the effector region of MARCKS
    Arbuzova, A
    Wang, LB
    Wang, JY
    Hangyás-Mihályné, G
    Murray, D
    Honig, B
    McLaughlin, S
    [J]. BIOCHEMISTRY, 2000, 39 (33) : 10330 - 10339
  • [4] ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information
    Armon, A
    Graur, D
    Ben-Tal, N
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) : 447 - 463
  • [5] CRYSTAL-STRUCTURES AT 2.5 ANGSTROM RESOLUTION OF SERYL-TRANSFER-RNA SYNTHETASE COMPLEXED 2 ANALOGS OF SERYL ADENYLATE
    BELRHALI, H
    YAREMCHUK, A
    TUKALO, M
    LARSEN, K
    BERTHETCOLOMINAS, C
    LEBERMAN, R
    BEIJER, B
    SPROAT, B
    ALSNIELSEN, J
    GRUBEL, G
    LEGRAND, JF
    LEHMANN, M
    CUSACK, S
    [J]. SCIENCE, 1994, 263 (5152) : 1432 - 1436
  • [6] THE NUCLEIC-ACID DATABASE - A COMPREHENSIVE RELATIONAL DATABASE OF 3-DIMENSIONAL STRUCTURES OF NUCLEIC-ACIDS
    BERMAN, HM
    OLSON, WK
    BEVERIDGE, DL
    WESTBROOK, J
    GELBIN, A
    DEMENY, T
    HSIEH, SH
    SRINIVASAN, AR
    SCHNEIDER, B
    [J]. BIOPHYSICAL JOURNAL, 1992, 63 (03) : 751 - 759
  • [7] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [8] Sequence and structure-based prediction of eukaryotic protein phosphorylation sites
    Blom, N
    Gammeltoft, S
    Brunak, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1999, 294 (05) : 1351 - 1362
  • [9] Implication of tubby proteins as transcription factors by structure-based functional analysis
    Boggon, TJ
    Shan, WS
    Santagata, S
    Myers, SC
    Shapiro, L
    [J]. SCIENCE, 1999, 286 (5447) : 2119 - 2125
  • [10] The relaxin receptor-binding site geometry suggests a novel gripping mode of interaction
    Büllesbach, EE
    Schwabe, C
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2000, 275 (45) : 35276 - 35280