Sequence-Based Prediction of Protein Folding Rates Using Contacts, Secondary Structures and Support Vector Machines

被引:19
|
作者
Lin, Guan Ning [1 ]
Wang, Zheng [2 ]
Xu, Dong [1 ,2 ]
Cheng, Jianlin [1 ,2 ]
机构
[1] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
[2] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2009年
关键词
folding rate; support vector machine; classifcation; folding type; AMINO-ACID-SEQUENCE; 2-STATE PROTEINS; ORDER; MECHANISMS;
D O I
10.1109/BIBM.2009.21
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting folding rate require the tertiary structure of a protein as an input And most methods do not distinguish the different kinetic natures (two-state folding and multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using features extracted from only protein sequence with support vector machines. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec(-1)) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html.
引用
收藏
页码:3 / +
页数:3
相关论文
共 50 条
  • [21] Secondary structure prediction with support vector machines
    Ward, JJ
    McGuffin, LJ
    Buxton, BF
    Jones, DT
    BIOINFORMATICS, 2003, 19 (13) : 1650 - 1655
  • [22] Prediction of protein-protein interactions using support vector machines
    Dohkan, S
    Koike, A
    Takagi, T
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 576 - 583
  • [23] Prediction of protein solvent accessibility using support vector machines
    Yuan, Z
    Burrage, K
    Mattick, JS
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2002, 48 (03): : 566 - 570
  • [24] Transmembrane protein topology prediction using support vector machines
    Timothy Nugent
    David T Jones
    BMC Bioinformatics, 10
  • [25] Transmembrane protein topology prediction using support vector machines
    Nugent, Timothy
    Jones, David T.
    BMC BIOINFORMATICS, 2009, 10
  • [26] Prediction of protein subcellular locations using support vector machines
    Li, NN
    Niu, XH
    Shi, F
    Li, XY
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 1047 - 1051
  • [27] Prediction of protein structural classes using support vector machines
    X.-D. Sun
    R.-B. Huang
    Amino Acids, 2006, 30 : 469 - 475
  • [28] Prediction of protein structural classes using support vector machines
    Sun, X. -D.
    Huang, R. -B.
    AMINO ACIDS, 2006, 30 (04) : 469 - 475
  • [29] Prediction of Protein Structural Classes based on Secondary Structure Sequence using Improved Support Vector Machine (ISVM)
    Manikandan, P.
    Ramyachitra, D.
    2016 IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ELECTRONICS ENGINEERING (UPCON), 2016, : 525 - 530
  • [30] Sequence-based prediction of protein domains
    Liu, JF
    Rost, B
    NUCLEIC ACIDS RESEARCH, 2004, 32 (12) : 3522 - 3530