Machine Learning Algorithms for Predicting Protein Folding Rates and Stability of Mutant Proteins: Comparison with Statistical Methods

被引:0
作者
Gromiha, M. Michael [1 ]
Huang, Liang-Tsung [2 ]
机构
[1] Indian Inst Technol, Dept Biotehnol, Madras 600036, Tamil Nadu, India
[2] Mingdao Univ, Dept Biotechnol, Changhua 523, Taiwan
关键词
Protein folding rates; protein stability; structure based parameters; machine learning techniques; AMINO-ACID-SEQUENCE; TRANSITION-STATE PLACEMENT; SECONDARY STRUCTURE; CONTACT ORDER; CONFORMATIONAL STABILITY; THERMODYNAMIC DATABASE; STRUCTURAL CLASS; NEURAL-NETWORKS; 2-STATE; MUTATION;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Machine learning algorithms have wide range of applications in bioinformatics and computational biology such as prediction of protein secondary structures, solvent accessibility, binding site residues in protein complexes, protein folding rates, stability of mutant proteins, and discrimination of proteins based on their structure and function. In this work, we focus on two aspects of predictions: (i) protein folding rates and (ii) stability of proteins upon mutations. We briefly introduce the concepts of protein folding rates and stability along with available databases, features for prediction methods and measures for prediction performance. Subsequently, the development of structure based parameters and their relationship with protein folding rates will be outlined. The structure based parameters are helpful to understand the physical basis for protein folding and stability. Further, basic principles of major machine learning techniques will be mentioned and their applications for predicting protein folding rates and stability of mutant proteins will be illustrated. The machine learning techniques could achieve the highest accuracy of predicting protein folding rates and stability. In essence, statistical methods and machine learning algorithms are complimenting each other for understanding and predicting protein folding rates and the stability of protein mutants. The available online resources on protein folding rates and stability will be listed.
引用
收藏
页码:490 / 502
页数:13
相关论文
共 108 条
  • [1] A survey of proteins encoded by non-synonymous single nucleotide polymorphisms reveals a significant fraction with altered stability and activity
    Allali-Hassani, Abdellah
    Wasney, Gregory A.
    Chau, Irene
    Hong, Bum Soo
    Senisterra, Guillermo
    Loppnau, Peter
    Shi, Zhen
    Moult, John
    Edwards, Aled M.
    Arrowsmith, Cheryl H.
    Park, Hee Won
    Schapira, Matthieu
    Vedadi, Masoud
    [J]. BIOCHEMICAL JOURNAL, 2009, 424 : 15 - 26
  • [2] KineticDB: a database of protein folding kinetics
    Bogatyreva, Natalya S.
    Osypov, Alexander A.
    Ivankov, Dmitry N.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D342 - D346
  • [3] Large-scale prediction of protein geometry and stability changes for arbitrary single point mutations
    Bordner, AJ
    Abagyan, RA
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (02) : 400 - 413
  • [4] Breiman L., 1984, Classification and regression trees. The Wadsworth statistics/probability series, px
  • [5] Amino acid sequence autocorrelation vectors and ensembles of Bayesian-regularized genetic neural networks for prediction of conformational stability of human lysozyme mutants
    Caballero, Julio
    Fernandez, Leyden
    Abreu, Jose Ignacio
    Fernandez, Michael
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (03) : 1255 - 1268
  • [6] K-Fold: a tool for the prediction of the protein folding kinetic order and rate
    Capriotti, E.
    Casadio, R.
    [J]. BIOINFORMATICS, 2007, 23 (03) : 385 - 386
  • [7] Predicting protein stability changes from sequences using support vector machines
    Capriotti, E
    Fariselli, P
    Calabrese, R
    Casadio, R
    [J]. BIOINFORMATICS, 2005, 21 : 54 - 58
  • [8] A neural-network-based method for predicting protein stability changes upon single point mutations
    Capriotti, Emidio
    Fariselli, Piero
    Casadio, Rita
    [J]. BIOINFORMATICS, 2004, 20 : 63 - 68
  • [9] Prediction of protein stability changes for single-site mutations using support vector machines
    Cheng, JL
    Randall, A
    Baldi, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) : 1125 - 1132
  • [10] Chou K.C., 2009, OPEN BIOINFORM J, V3, P31