Prediction of Protein Mutational Free Energy: Benchmark and Sampling Improvements Increase Classification Accuracy

被引:61
作者
Frenz, Brandon [1 ]
Lewis, Steven M. [1 ]
King, Indigo [1 ]
DiMaio, Frank [2 ]
Park, Hahnbeom [2 ]
Song, Yifan [1 ]
机构
[1] Cyrus Biotechnol, Seattle, WA 98101 USA
[2] Univ Washington, Dept Biochem, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
mutation; protein; mutation free energy; protein design and engineering; thermodynamics; STABILITY CHANGES; DESIGN;
D O I
10.3389/fbioe.2020.558247
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Software to predict the change in protein stability upon point mutation is a valuable tool for a number of biotechnological and scientific problems. To facilitate the development of such software and provide easy access to the available experimental data, the ProTherm database was created. Biases in the methods and types of information collected has led to disparity in the types of mutations for which experimental data is available. For example, mutations to alanine are hugely overrepresented whereas those involving charged residues, especially from one charged residue to another, are underrepresented. ProTherm subsets created as benchmark sets that do not account for this often underrepresent tense certain mutational types. This issue introduces systematic biases into previously published protocols' ability to accurately predict the change in folding energy on these classes of mutations. To resolve this issue, we have generated a new benchmark set with these problems corrected. We have then used the benchmark set to test a number of improvements to the point mutation energetics tools in the Rosetta software suite.
引用
收藏
页数:8
相关论文
共 21 条
[1]   Predicting free energy changes using structural ensembles [J].
Benedix, Alexander ;
Becker, Caroline M. ;
de Groot, Bert L. ;
Caflisch, Amedeo ;
Boeckmann, Rainer A. .
NATURE METHODS, 2009, 6 (01) :3-4
[2]   I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure [J].
Capriotti, E ;
Fariselli, P ;
Casadio, R .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W306-W310
[3]  
Casadio R, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P81
[4]   A Web Resource for Standardized Benchmark Datasets, Metrics, and Rosetta Protocols for Macromolecular Modeling and Design [J].
Conchuir, Shane O. ;
Barlow, Kyle A. ;
Pache, Roland A. ;
Ollikainen, Noah ;
Kundert, Kale ;
O'Meara, Matthew J. ;
Smith, Colin A. ;
Kortemme, Tanja .
PLOS ONE, 2015, 10 (09)
[5]  
Frenz B., 2020, BIORXIV, DOI [10.1101/2020.03.18.989657, DOI 10.1101/2020.03.18.989657]
[6]   Stability changes upon mutation of solvent-accessible residues in proteins evaluated by database-derived potentials [J].
Gilis, D ;
Rooman, M .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 257 (05) :1112-1126
[7]   Predicting changes in the stability of proteins and protein complexes: A study of more than 1000 mutations [J].
Guerois, R ;
Nielsen, JE ;
Serrano, L .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 320 (02) :369-387
[8]   Structure Based Thermostability Prediction Models for Protein Single Point Mutations with Machine Learning Tools [J].
Jia, Lei ;
Yarlagadda, Ramya ;
Reed, Charles C. .
PLOS ONE, 2015, 10 (09)
[9]   Role of conformational sampling in computing mutation-induced changes in protein structure and stability [J].
Kellogg, Elizabeth H. ;
Leaver-Fay, Andrew ;
Baker, David .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (03) :830-838
[10]   Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm [J].
Kumar, Prateek ;
Henikoff, Steven ;
Ng, Pauline C. .
NATURE PROTOCOLS, 2009, 4 (07) :1073-1082