Automatic protein design with all atom force-fields by exact and heuristic optimization

被引:100
作者
Wernisch, L
Hery, S
Wodak, SJ
机构
[1] EMBL, European Bioinformat Inst, Cambridge CB10 1SD, England
[2] Free Univ Brussels, Unite Conformat Macromol Biol, B-1050 Brussels, Belgium
关键词
de novo protein design; side-chain modeling; dead end elimination; Branch and Bound;
D O I
10.1006/jmbi.2000.3984
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A fully automatic procedure for predicting the amino acid sequences compatible with a given target structure is described. It is based on the CHARMM package, and uses an all atom force-field and rotamer libraries to describe and evaluate side-chain types and conformations. Sequences are ranked by a quantity akin to the free energy of folding which incorporates hydration effects. Exact (Branch and Bound) and heuristic optimisation procedures are used to identifying highly scoring sequences from an astronomical number of possibilities. These sequences include the minimum free energy sequence, as well as all amino acid sequences whose free energy lies within a specified window from the minimum. Several applications of our procedure are illustrated. Prediction of side-chain conformations for a set of ten proteins yields results comparable to those of established side-chain placement programs. Applications to sequence optimisation comprise the re-design of the protein cores of c-Crk SH3 domain, the B1 domain of protein G and Ubiquitin, and of surface residues of the SH3 domain. In all calculations, no restrictions are imposed on the amino acid composition and identical parameter settings are used for core and surface residues. The best scoring sequences for the protein cores are virtually identical to wild-type. They feature no more than one to three mutations in a total of 11-16 variable positions. Tests suggest that this is due to the balance between various contributions in the force-field rather than to overwhelming influence from packing constraints. The effectiveness of our force-field is further supported by the sequence predictions for surface residues of the SH3 domain. More mutations are predicted than in the core, seemingly in order to optimise the network of complementary interactions between polar and charged groups. This appears to be an important energetic requirement in absence of the partner molecules with which the SH3 domain interacts, which were not included in the calculations. Finally, a detailed comparison between the sequences generated by the heuristic and exact optimisation algorithms, commends a note of caution concerning the efficiency of heuristic procedures in exploring sequence space. (C) 2000 Academic Press.
引用
收藏
页码:713 / 736
页数:24
相关论文
共 61 条
[1]  
ALARD P, 1990, THESIS U LIBRE BRUXE
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: A new homology modeling tool [J].
Bower, MJ ;
Cohen, FE ;
Dunbrack, RL .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (05) :1268-1282
[5]   DECIPHERING THE MESSAGE IN PROTEIN SEQUENCES - TOLERANCE TO AMINO-ACID SUBSTITUTIONS [J].
BOWIE, JU ;
REIDHAAROLSON, JF ;
LIM, WA ;
SAUER, RT .
SCIENCE, 1990, 247 (4948) :1306-1310
[6]   CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].
BROOKS, BR ;
BRUCCOLERI, RE ;
OLAFSON, BD ;
STATES, DJ ;
SWAMINATHAN, S ;
KARPLUS, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) :187-217
[7]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826
[8]   HYDROPHOBIC BONDING AND ACCESSIBLE SURFACE-AREA IN PROTEINS [J].
CHOTHIA, C .
NATURE, 1974, 248 (5446) :338-339
[9]   Protein design automation [J].
Dahiyat, BI ;
Mayo, SL .
PROTEIN SCIENCE, 1996, 5 (05) :895-903
[10]   Probing the role of packing specificity in protein design [J].
Dahiyat, BI ;
Mayo, SL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (19) :10172-10177