INPS: predicting the impact of non-synonymous variations on protein stability from sequence

被引:112
作者
Fariselli, Piero [1 ,2 ]
Martelli, Pier Luigi [1 ]
Savojardo, Castrense [1 ]
Casadio, Rita [1 ]
机构
[1] Univ Bologna, Dept Biol, Biocomput Grp, I-40126 Bologna, Italy
[2] Univ Bologna, Dept Comp Sci & Engn, I-40127 Bologna, Italy
关键词
AMINO-ACID SUBSTITUTION; MUTATIONS; BIOINFORMATICS; POTENTIALS; NETWORKS; MUTANTS; SERVER;
D O I
10.1093/bioinformatics/btv291
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A tool for reliably predicting the impact of variations on protein stability is extremely important for both protein engineering and for understanding the effects of Mendelian and somatic mutations in the genome. Next Generation Sequencing studies are constantly increasing the number of protein sequences. Given the huge disproportion between protein sequences and structures, there is a need for tools suited to annotate the effect of mutations starting from protein sequence without relying on the structure. Here, we describe INPS, a novel approach for annotating the effect of non-synonymous mutations on the protein stability from its sequence. INPS is based on SVM regression and it is trained to predict the thermodynamic free energy change upon single-point variations in protein sequences. Results: We show that INPS performs similarly to the state-of-the-art methods based on protein structure when tested in cross-validation on a non-redundant dataset. INPS performs very well also on a newly generated dataset consisting of a number of variations occurring in the tumor suppressor protein p53. Our results suggest that INPS is a tool suited for computing the effect of non-synonymous polymorphisms on protein stability when the protein structure is not available. We also show that INPS predictions are complementary to those of the state-of-the-art, structure-based method mCSM. When the two methods are combined, the overall prediction on the p53 set scores significantly higher than those of the single methods.
引用
收藏
页码:2816 / 2821
页数:6
相关论文
共 32 条
[1]   I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure [J].
Capriotti, E ;
Fariselli, P ;
Casadio, R .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W306-W310
[2]   Bioinformatics for personal genome interpretation [J].
Capriotti, Emidio ;
Nehrt, Nathan L. ;
Kann, Maricel G. ;
Bromberg, Yana .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (04) :495-512
[3]   A three-state prediction of single point mutations on protein stability changes [J].
Capriotti, Emidio ;
Fariselli, Piero ;
Rossi, Ivan ;
Casadio, Rita .
BMC BIOINFORMATICS, 2008, 9
[4]   Correlating Disease-Related Mutations to Their Effect on Protein Stability: A Large-Scale Analysis of the Human Proteome [J].
Casadio, Rita ;
Vassura, Marco ;
Tiwari, Shalinee ;
Fariselli, Piero ;
Martelli, Pier Luigi .
HUMAN MUTATION, 2011, 32 (10) :1161-1170
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   iStable: off-the-shelf predictor integration for predicting protein stability changes [J].
Chen, Chi-Wei ;
Lin, Jerome ;
Chu, Yen-Wei .
BMC BIOINFORMATICS, 2013, 14
[7]   Prediction of protein stability changes for single-site mutations using support vector machines [J].
Cheng, JL ;
Randall, A ;
Baldi, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) :1125-1132
[8]  
Dayhoff M.O., 1978, ATLAS PROTEIN SEQ ST, V5
[9]   Fast and accurate predictions of protein stability changes upon mutations using statistical potentials and neural networks: PoPMuSiC-2.0 [J].
Dehouck, Yves ;
Grosfils, Aline ;
Folch, Benjamin ;
Gilis, Dimitri ;
Bogaerts, Philippe ;
Rooman, Marianne .
BIOINFORMATICS, 2009, 25 (19) :2537-2543
[10]   Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)