Prediction of protein relative solvent accessibility with a two-stage SVM approach

被引:52
作者
Nguyen, MN [1 ]
Rajapakse, JC [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, BioInformat Res Ctr, Singapore 2263, Singapore
关键词
protein structure prediction; solvent accessibility; support vector machines; PSI-BIAST;
D O I
10.1002/prot.20404
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Information on relative solvent accessibility (RSA) of amino acid residues in proteins provides valuable clues to the prediction of protein structure and function. A two-stage approach with support vector machines (SVMs) is proposed, where an SVM predictor is introduced to the output of the single-stage SVM approach to take into account the contextual relationships among solvent accessibilities for the prediction. By using the position-specific scoring matrices (PSSMs) generated by PSI-BLAST, the two-stage SVM approach achieves accuracies up to 90.4% and 90.2% on the Manesh data set of 215 protein structures and the RS126 data set of 126 nonhomologous globular proteins, respectively, which are better than the highest published scores on both data sets to date. A Web server for protein RSA prediction using a two-stage SVM method has been developed and is available (http-//birc.ntu.edu.sg/-pas0186457/rsa.html). (C) 2005 Wiley-Liss, Inc.
引用
收藏
页码:30 / 37
页数:8
相关论文
共 36 条
  • [1] Accurate prediction of solvent accessibility using neural networks-based regression
    Adamczak, R
    Porollo, A
    Meller, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (04) : 753 - 767
  • [2] NETASA: neural network based prediction of solvent accessibility
    Ahmad, S
    Gromiha, MM
    [J]. BIOINFORMATICS, 2002, 18 (06) : 819 - 824
  • [3] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] Adaptation of protein surfaces to subcellular location
    Andrade, MA
    O'Donoghue, SI
    Rost, B
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1998, 276 (02) : 517 - 525
  • [5] ORIGINS OF STRUCTURE IN GLOBULAR-PROTEINS
    CHAN, HS
    DILL, KA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (16) : 6388 - 6392
  • [6] Chandonia JM, 1999, PROTEINS, V35, P293
  • [7] CHEN H, 2004, 2 AS PAC BIOINF C DU
  • [8] Cristianini N., 2000, Intelligent Data Analysis: An Introduction, DOI 10.1017/CBO9780511801389
  • [9] Cuff JA, 2000, PROTEINS, V40, P502, DOI 10.1002/1097-0134(20000815)40:3<502::AID-PROT170>3.0.CO
  • [10] 2-Q