Sequence-Based Prediction of Protein Folding Rates Using Contacts, Secondary Structures and Support Vector Machines

被引:19
|
作者
Lin, Guan Ning [1 ]
Wang, Zheng [2 ]
Xu, Dong [1 ,2 ]
Cheng, Jianlin [1 ,2 ]
机构
[1] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
[2] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2009年
关键词
folding rate; support vector machine; classifcation; folding type; AMINO-ACID-SEQUENCE; 2-STATE PROTEINS; ORDER; MECHANISMS;
D O I
10.1109/BIBM.2009.21
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting folding rate require the tertiary structure of a protein as an input And most methods do not distinguish the different kinetic natures (two-state folding and multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using features extracted from only protein sequence with support vector machines. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec(-1)) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html.
引用
收藏
页码:3 / +
页数:3
相关论文
共 50 条
  • [1] Sequence-Based Prediction of Protein-Carbohydrate Binding Sites Using Support Vector Machines
    Taherzadeh, Ghazaleh
    Zhou, Yaoqi
    Liew, Alan Wee-Chung
    Yang, Yuedong
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (10) : 2115 - 2122
  • [2] SeqRate: sequence-based protein folding type classification and rates prediction
    Lin, Guan Ning
    Wang, Zheng
    Xu, Dong
    Cheng, Jianlin
    BMC BIOINFORMATICS, 2010, 11
  • [3] SeqRate: sequence-based protein folding type classification and rates prediction
    Guan Ning Lin
    Zheng Wang
    Dong Xu
    Jianlin Cheng
    BMC Bioinformatics, 11
  • [4] Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine
    Taherzadeh, Ghazaleh
    Yang, Yuedong
    Zhang, Tuo
    Liew, Alan Wee-Chung
    Zhou, Yaoqi
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (13) : 1223 - 1229
  • [5] Sequence-based protein-protein interaction prediction via support vector machine
    Yongcui Wang
    Jiguang Wang
    Zhixia Yang
    Naiyang Deng
    Journal of Systems Science and Complexity, 2010, 23 : 1012 - 1023
  • [6] Protein Secondary Structure Prediction Using Support Vector Machines (SVMs)
    Patel, Mayuri
    Shah, Hitesh
    2013 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND RESEARCH ADVANCEMENT (ICMIRA 2013), 2013, : 594 - 598
  • [7] Sequence-based protein-protein interaction prediction via support vector machine
    Wang, Yongcui
    Wang, Jiguang
    Yang, Zhixia
    Deng, Naiyang
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2010, 23 (05) : 1012 - 1023
  • [8] Protein solvent accessibility prediction using support vector machines and sequence conservations
    Ogul, Hasan
    Mumcuoglu, Erkan U.
    ARTIFICIAL INTELLIGENCE AND NEURAL NETWORKS, 2006, 3949 : 141 - 148
  • [9] Prediction of protein domains from sequence information using support vector machines
    Zou, Shuxue
    Huang, Yanxin
    Wang, Yan
    Zhou, Chunguang
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 674 - 681
  • [10] Prediction of protein secondary structure using Bayesian method and support vector machines
    Nguyen, MN
    Rajapakse, JC
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 616 - 620