Determining protein structures by combining semireliable data with atomistic physical models by Bayesian inference

被引:117
作者
MacCallum, Justin L. [1 ]
Perez, Alberto [2 ]
Dill, Ken A. [2 ,3 ,4 ]
机构
[1] Univ Calgary, Dept Chem, Calgary, AB T2N 1N4, Canada
[2] SUNY Stony Brook, Laufer Ctr Phys & Quantitat Biol, Stony Brook, NY 11794 USA
[3] SUNY Stony Brook, Dept Chem, Stony Brook, NY 11794 USA
[4] SUNY Stony Brook, Dept Phys, Stony Brook, NY 11794 USA
关键词
protein structure; molecular modeling; integrative structural biology; Bayesian inference; NMR STRUCTURE CALCULATION; STRUCTURE PREDICTION; FOLDING REACTIONS; FORCE-FIELD; DYNAMICS; SEQUENCE; SIMULATIONS; PERFORMANCE; RESTRAINTS; PACKAGE;
D O I
10.1073/pnas.1506788112
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
More than 100,000 protein structures are now known at atomic detail. However, far more are not yet known, particularly among large or complex proteins. Often, experimental information is only semireliable because it is uncertain, limited, or confusing in important ways. Some experiments give sparse information, some give ambiguous or nonspecific information, and others give uncertain information-where some is right, some is wrong, but we don't know which. We describe a method called Modeling Employing Limited Data (MELD) that can harness such problematic information in a physics-based, Bayesian framework for improved structure determination. We apply MELD to eight proteins of known structure for which such problematic structural data are available, including a sparse NMR dataset, two ambiguous EPR datasets, and four uncertain datasets taken from sequence evolution data. MELD gives excellent structures, indicating its promise for experimental biomolecule structure determination where only semireliable data are available.
引用
收藏
页码:6985 / 6990
页数:6
相关论文
共 41 条
[1]   De novo high-resolution protein structure determination from sparse spin-labeling EPR data [J].
Alexander, Nathan ;
Bortolus, Marco ;
Al-Mestarihi, Ahmad ;
Mchaourab, Hassane ;
Meilerl, Jens .
STRUCTURE, 2008, 16 (02) :181-195
[2]   Optimization of the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone φ, ψ and Side-Chain χ1 and χ2 Dihedral Angles [J].
Best, Robert B. ;
Zhu, Xiao ;
Shim, Jihyun ;
Lopes, Pedro E. M. ;
Mittal, Jeetain ;
Feig, Michael ;
MacKerell, Alexander D., Jr. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2012, 8 (09) :3257-3273
[3]   Version 1.2 of the Crystallography and NMR system [J].
Brunger, Axel T. .
NATURE PROTOCOLS, 2007, 2 (11) :2728-2733
[4]  
Case D.A, 2015, AMBER
[5]   From Levinthal to pathways to funnels [J].
Dill, KA ;
Chan, HS .
NATURE STRUCTURAL BIOLOGY, 1997, 4 (01) :10-19
[6]   The Protein-Folding Problem, 50 Years On [J].
Dill, Ken A. ;
MacCallum, Justin L. .
SCIENCE, 2012, 338 (6110) :1042-1046
[7]   HADDOCK: A protein-protein docking approach based on biochemical or biophysical information [J].
Dominguez, C ;
Boelens, R ;
Bonvin, AMJJ .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2003, 125 (07) :1731-1737
[8]   OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation [J].
Eastman, Peter ;
Friedrichs, Mark S. ;
Chodera, John D. ;
Radmer, Randall J. ;
Bruns, Christopher M. ;
Ku, Joy P. ;
Beauchamp, Kyle A. ;
Lane, Thomas J. ;
Wang, Lee-Ping ;
Shukla, Diwakar ;
Tye, Tony ;
Houston, Mike ;
Stich, Timo ;
Klein, Christoph ;
Shirts, Michael R. ;
Pande, Vijay S. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2013, 9 (01) :461-469
[9]  
Eswar N., 2006, COMP PROTEIN STRUCTU, p5.6.1
[10]   Modeling Intrinsically Disordered Proteins with Bayesian Statistics [J].
Fisher, Charles K. ;
Huang, Austin ;
Stultz, Collin M. .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2010, 132 (42) :14919-14927