GOAP: A Generalized Orientation-Dependent, All-Atom Statistical Potential for Protein Structure Prediction

被引:219
作者
Zhou, Hongyi [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Sch Biol, Ctr Study Syst Biol, Atlanta, GA 30332 USA
基金
美国国家卫生研究院;
关键词
MEAN FORCE; TERTIARY STRUCTURES; FOLD-RECOGNITION; ABSOLUTE QUALITY; SCORING FUNCTION; ENERGY; REFINEMENT; RESIDUES; SEGMENTS; BURIAL;
D O I
10.1016/j.bpj.2011.09.012
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
An accurate scoring function is a key component for successful protein structure prediction. To address this important unsolved problem, we develop a generalized orientation and distance-dependent all-atom statistical potential. The new statistical potential, generalized orientation-dependent all-atom potential (GOAP), depends on the relative orientation of the planes associated with each heavy atom in interacting pairs. GOAP is a generalization of previous orientation-dependent potentials that consider only representative atoms or blocks of side-chain or polar atoms. GOAP is decomposed into distance- and angle-dependent contributions. The DFIRE distance-scaled finite ideal gas reference state is employed for the distance-dependent component of GOAP. GOAP was tested on 11 commonly used decoy sets containing 278 targets, and recognized 226 native structures as best from the decoys, whereas DFIRE recognized 127 targets. The major improvement comes from decoy sets that have homology-modeled structures that are close to native (all within similar to 4.0 angstrom) or from the ROSETTA ab initio decoy set. For these two kinds of decoys, orientation-independent DFIRE or only side-chain orientation-dependent RWplus performed poorly. Although the OPUS-PSP block-based orientation-dependent, side-chain atom contact potential performs much better (recognizing 196 targets) than DFIRE, RWplus, and dDFIRE, it is still similar to 15% worse than GOAP. Thus, GOAP is a promising advance in knowledge-based, all-atom statistical potentials. GOAP is available for download at http://cssb.biology.gatech.edu/GOAP.
引用
收藏
页码:2043 / 2052
页数:10
相关论文
共 56 条
[1]   A new force field (ECEPP-05) for peptides, proteins, and organic molecules [J].
Arnautova, YA ;
Jagielska, A ;
Scheraga, HA .
JOURNAL OF PHYSICAL CHEMISTRY B, 2006, 110 (10) :5025-5044
[2]   QMEAN: A comprehensive scoring function for model quality assessment [J].
Benkert, Pascal ;
Tosatto, Silvio C. E. ;
Schomburg, Dietmar .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) :261-277
[3]   Toward the estimation of the absolute quality of individual protein structure models [J].
Benkert, Pascal ;
Biasini, Marco ;
Schwede, Torsten .
BIOINFORMATICS, 2011, 27 (03) :343-350
[4]   CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].
BROOKS, BR ;
BRUCCOLERI, RE ;
OLAFSON, BD ;
STATES, DJ ;
SWAMINATHAN, S ;
KARPLUS, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) :187-217
[5]   Development of novel statistical potentials for protein fold recognition [J].
Buchete, NV ;
Straub, JE ;
Thirumalai, D .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (02) :225-232
[6]   Evaluation of atomic level mean force potentials via inverse folding and inverse refinement of protein structures: Atomic burial position and pairwise non-bonded interactions [J].
DeBolt, SE ;
Skolnick, J .
PROTEIN ENGINEERING, 1996, 9 (08) :637-655
[7]   A composite score for predicting errors in protein structure models [J].
Eramian, David ;
Shen, Min-Yi ;
Devos, Damien ;
Melo, Francisco ;
Sali, Andrej ;
Marti-Renom, Marc A. .
PROTEIN SCIENCE, 2006, 15 (07) :1653-1666
[8]   Four-body contact potentials derived from two protein datasets to discriminate native structures from decoys [J].
Feng, Yaping ;
Kloczkowski, Andrzej ;
Jernigan, Robert L. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 68 (01) :57-66
[9]   Development of novel statistical potentials describing cation-π interactions in proteins and comparison with semiempirical and quantum chemistry approaches [J].
Gilis, D ;
Biot, C ;
Buisine, E ;
Dehouck, Y ;
Rooman, M .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :884-893
[10]   IDENTIFICATION OF NATIVE PROTEIN FOLDS AMONGST A LARGE NUMBER OF INCORRECT MODELS - THE CALCULATION OF LOW-ENERGY CONFORMATIONS FROM POTENTIALS OF MEAN FORCE [J].
HENDLICH, M ;
LACKNER, P ;
WEITCKUS, S ;
FLOECKNER, H ;
FROSCHAUER, R ;
GOTTSBACHER, K ;
CASARI, G ;
SIPPL, MJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (01) :167-180