Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions

被引:1044
作者
Simons, KT
Kooperberg, C
Huang, E
Baker, D
机构
[1] UNIV WASHINGTON, DEPT BIOCHEM, SEATTLE, WA 98195 USA
[2] UNIV WASHINGTON, DEPT STAT, SEATTLE, WA 98195 USA
[3] STANFORD UNIV, SCH MED, DEPT BIOL STRUCT, BECKMAN LABS STRUCT BIOL, STANFORD, CA 94305 USA
关键词
protein folding; computer simulation; multiple sequence alignment; structure prediction; knowledge-based scoring functions;
D O I
10.1006/jmbi.1997.0959
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We explore the ability of a simple simulated annealing procedure to assemble native-like structures from fragments of unrelated protein structures with similar local sequences using Bayesian scoring functions. Environment and residue pair specific contributions to the scoring functions appear as the first two terms in a series expansion for the residue probability distributions in the protein database; the decoupling of the distance and environment dependencies of the distributions resolves the major problems with current database-derived scoring functions noted by Thomas and Dill. The simulated annealing procedure rapidly and frequently generates native-like structures for small helical proteins and better than random structures for small beta sheet containing proteins. Most of the simulated structures have native-like solvent accessibility and secondary structure patterns, and thus ensembles of these structures provide a particularly challenging set of decoys for evaluating scoring functions. We investigate the effects of multiple sequence information and different types of conformational constraints on the overall performance of the method, and the ability of a variety of recently developed scoring functions to recognize the native-like conformations in the ensembles of simulated structures. (C) 1997 Academic Press Limited.
引用
收藏
页码:209 / 225
页数:17
相关论文
共 54 条
[1]   DETERMINATION OF THE CONFORMATION OF FOLDING INITIATION SITES IN PROTEINS BY COMPUTER-SIMULATION [J].
AVBELJ, F ;
MOULT, J .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (02) :129-141
[2]   AN IMPROVED PAIR POTENTIAL TO RECOGNIZE NATIVE PROTEIN FOLDS [J].
BAUER, A ;
BEYER, A .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1994, 18 (03) :254-261
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[5]   IDENTIFICATION OF PROTEIN FOLDS - MATCHING HYDROPHOBICITY PATTERNS OF SEQUENCE SETS WITH SOLVENT ACCESSIBILITY PATTERNS OF KNOWN STRUCTURES [J].
BOWIE, JU ;
CLARKE, ND ;
PABO, CO ;
SAUER, RT .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1990, 7 (03) :257-264
[6]   AN EVOLUTIONARY APPROACH TO FOLDING SMALL ALPHA-HELICAL PROTEINS THAT USES SEQUENCE INFORMATION AND AN EMPIRICAL GUIDING FITNESS FUNCTION [J].
BOWIE, JU ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (10) :4436-4440
[7]   Local sequence-structure correlations in proteins [J].
Bystroff, C ;
Simons, KT ;
Han, KF ;
Baker, D .
CURRENT OPINION IN BIOTECHNOLOGY, 1996, 7 (04) :417-421
[8]   ON THE PREDICTION OF PROTEIN-STRUCTURE - THE SIGNIFICANCE OF THE ROOT-MEAN-SQUARE DEVIATION [J].
COHEN, FE ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1980, 138 (02) :321-333
[9]   Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions [J].
Dandekar, T ;
Argos, P .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 256 (03) :645-660
[10]   Multiple sequence information for threading algorithms [J].
Defay, TR ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (02) :314-323