Optimized atomic statistical potentials: assessment of protein interfaces and loops

被引:88
作者
Dong, Guang Qiang
Fan, Hao
Schneidman-Duhovny, Dina
Webb, Ben
Sali, Andrej [1 ]
机构
[1] Univ Calif San Francisco, Dept Bioengn & Therapeut Sci, Dept Pharmaceut Chem, San Francisco, CA 94158 USA
关键词
KNOWLEDGE-BASED POTENTIALS; REFERENCE STATE IMPROVES; MEAN-FORCE; STRUCTURE PREDICTION; SECONDARY STRUCTURE; SCORING FUNCTION; ENERGY FUNCTION; DISTANCE; DOCKING; DISCRIMINATION;
D O I
10.1093/bioinformatics/btt560
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Statistical potentials have been widely used for modeling whole proteins and their parts ( e. g. sidechains and loops) as well as interactions between proteins, nucleic acids and small molecules. Here, we formulate the statistical potentials entirely within a statistical framework, avoiding questionable statistical mechanical assumptions and approximations, including a definition of the reference state. Results: We derive a general Bayesian framework for inferring statistically optimized atomic potentials (SOAP) in which the reference state is replaced with data-driven 'recovery' functions. Moreover, we restrain the relative orientation between two covalent bonds instead of a simple distance between two atoms, in an effort to capture orientation-dependent interactions such as hydrogen bonds. To demonstrate this general approach, we computed statistical potentials for protein-protein docking (SOAP-PP) and loop modeling (SOAP-Loop). For docking, a near-native model is within the top 10 scoring models in 40% of the PatchDock benchmark cases, compared with 23 and 27% for the state-of-the-art ZDOCK and FireDock scoring functions, respectively. Similarly, for modeling 12-residue loops in the PLOP benchmark, the average main-chain root mean square deviation of the best scored conformations by SOAP-Loop is 1.5 A, close to the average root mean square deviation of the best sampled conformations (1.2 A) and significantly better than that selected by Rosetta (2.1 A), DFIRE (2.3 A), DOPE (2.5 A) and PLOP scoring functions (3.0 A). Our Bayesian framework may also result in more accurate statistical potentials for additional modeling applications, thus affording better leverage of the experimentally determined protein structures.
引用
收藏
页码:3158 / 3166
页数:9
相关论文
共 87 条
  • [1] FireDock: Fast interaction refinement in molecular docking
    Andrusier, Nelly
    Nussinov, Ruth
    Wolfson, Haim J.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 (01) : 139 - 159
  • [2] [Anonymous], 2012, MACHINE LEARNING PRO
  • [3] Audie J, 2012, FUTURE MED CHEM, V4, P1619, DOI [10.4155/FMC.12.99, 10.4155/fmc.12.99]
  • [4] Protein structure prediction and structural genomics
    Baker, D
    Sali, A
    [J]. SCIENCE, 2001, 294 (5540) : 93 - 96
  • [5] QMEAN: A comprehensive scoring function for model quality assessment
    Benkert, Pascal
    Tosatto, Silvio C. E.
    Schomburg, Dietmar
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) : 261 - 277
  • [6] Betancourt MR, 1999, PROTEIN SCI, V8, P361
  • [7] Local propensities and statistical potentials of backbone dihedral angles in proteins
    Betancourt, MR
    Skolnick, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 342 (02) : 635 - 649
  • [8] Application of asymmetric statistical potentials to antibody-protein docking
    Brenke, Ryan
    Hall, David R.
    Chuang, Gwo-Yu
    Comeau, Stephen R.
    Bohnuud, Tanggis
    Beglov, Dmitri
    Schueler-Furman, Ora
    Vajda, Sandor
    Kozakov, Dima
    [J]. BIOINFORMATICS, 2012, 28 (20) : 2608 - 2614
  • [9] Reliability of atomic displacement parameters in protein crystal structures
    Carugo, O
    Argos, P
    [J]. ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1999, 55 : 473 - 478
  • [10] Carugo O, 1997, PROTEIN SCI, V6, P2261