A probabilistic model for secondary structure prediction from protein chemical shifts

被引:15
作者
Mechelke, Martin [1 ]
Habeck, Michael [1 ,2 ]
机构
[1] Max Planck Inst Dev Biol, Dept Prot Evolut, D-72076 Tubingen, Germany
[2] Max Planck Inst Intelligent Syst, Dept Empir Inference, D-72076 Tubingen, Germany
关键词
probabilistic modeling; protein chemical shifts; secondary structure prediction; hidden Markov model; nuclear magnetic resonance; secondary structure; NUCLEAR-MAGNETIC-RESONANCE; CONFORMATIONAL-ANALYSIS; IDENTIFICATION; RECOGNITION; STATE;
D O I
10.1002/prot.24249
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein chemical shifts encode detailed structural information that is difficult and computationally costly to describe at a fundamental level. Statistical and machine learning approaches have been used to infer correlations between chemical shifts and secondary structure from experimental chemical shifts. These methods range from simple statistics such as the chemical shift index to complex methods using neural networks. Notwithstanding their higher accuracy, more complex approaches tend to obscure the relationship between secondary structure and chemical shift and often involve many parameters that need to be trained. We present hidden Markov models (HMMs) with Gaussian emission probabilities to model the dependence between protein chemical shifts and secondary structure. The continuous emission probabilities are modeled as conditional probabilities for a given amino acid and secondary structure type. Using these distributions as outputs of first- and second-order HMMs, we achieve a prediction accuracy of 82.3%, which is competitive with existing methods for predicting secondary structure from protein chemical shifts. Incorporation of sequence-based secondary structure prediction into our HMM improves the prediction accuracy to 84.0%. Our findings suggest that an HMM with correlated Gaussian distributions conditioned on the secondary structure provides an adequate generative model of chemical shifts. Proteins 2013; (c) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:984 / 993
页数:10
相关论文
共 35 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   Determination of Secondary Structure Populations in Disordered States of Proteins Using Nuclear Magnetic Resonance Chemical Shifts [J].
Camilloni, Carlo ;
De Simone, Alfonso ;
Vranken, Wim F. ;
Vendruscolo, Michele .
BIOCHEMISTRY, 2012, 51 (11) :2224-2231
[3]   The use of chemical shifts and their anisotropies in biomolecular structure determination [J].
Case, DA .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (05) :624-630
[4]   Protein structure determination from NMR chemical shifts [J].
Cavalli, Andrea ;
Salvatella, Xavier ;
Dobson, Christopher M. ;
Vendruscolo, Michele .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (23) :9615-9620
[5]   DANGLE: A Bayesian inferential method for predicting protein backbone dihedral angles and secondary structure [J].
Cheung, Ming-Sin ;
Maguire, Mahon L. ;
Stevens, Tim J. ;
Broadhurst, R. William .
JOURNAL OF MAGNETIC RESONANCE, 2010, 202 (02) :223-233
[6]   Protein backbone angle restraints from searching a database for chemical shift and sequence homology [J].
Cornilescu, G ;
Delaglio, F ;
Bax, A .
JOURNAL OF BIOMOLECULAR NMR, 1999, 13 (03) :289-302
[7]  
Donald BR, 2011, COMPUTATIONAL MOL BI
[8]   Protein energetic conformational analysis from NMR chemical shifts (PECAN) and its use in determining secondary structural elements [J].
Eghbalnia, HR ;
Wang, LY ;
Bahrami, A ;
Assadi, A ;
Markley, JL .
JOURNAL OF BIOMOLECULAR NMR, 2005, 32 (01) :71-81
[9]   PDBselect 1992-2009 and PDBfilter-select [J].
Griep, Sven ;
Hobohm, Uwe .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D318-D319
[10]   Accurate and automated classification of protein secondary structure with PsiCSI [J].
Hung, LH ;
Samudrala, R .
PROTEIN SCIENCE, 2003, 12 (02) :288-295