Two-stage support vector regression approach for predicting accessible surface areas of amino acids

被引:46
|
作者
Nguyen, MN
Rajapakse, JC [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Bioinformat Res Ctr, Singapore 639798, Singapore
[2] MIT, Biol Engn Div, Cambridge, MA USA
关键词
protein structure prediction; accessible surface area; solvent accessibility; support vector regression; PSI-BLAST;
D O I
10.1002/prot.20883
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We address the problem of predicting solvent accessible surface area (ASA) of amino acid residues in protein sequences, without classifying them into buried and exposed types. A two-stage support vector regression (SVR) approach is proposed to predict real values of ASA from the position-specific scoring matrices generated from PSI-BIAST profiles. By adding SVR as the second stage to capture the influences on the ASA value of a residue by those of its neighbors, the two-stage SVR approach achieves improvements of mean absolute errors up to 3.3%, and correlation coefficients of 0.66, 0.68, and 0.67 on the Manesh dataset of 215 proteins, the Barton dataset of 502 nonhomologous proteins, and the Carugo dataset of 338 proteins, respectively, which are better than the scores published earlier on these datasets. A Web server for protein ASA prediction by using a two-stage SVR method has been developed and is available (http:// bire.ntu.edu.sg/similar to pas0186457/asa.html).
引用
收藏
页码:542 / 550
页数:9
相关论文
共 30 条
  • [1] Prediction of protein accessible surface areas by support vector regression
    Yuan, Z
    Huang, BX
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (03) : 558 - 564
  • [2] A two-stage support vector regression assisted sequential sampling approach for global metamodeling
    Chen Jiang
    Xiwen Cai
    Haobo Qiu
    Liang Gao
    Peigen Li
    Structural and Multidisciplinary Optimization, 2018, 58 : 1657 - 1672
  • [3] A two-stage support vector regression assisted sequential sampling approach for global metamodeling
    Jiang, Chen
    Cai, Xiwen
    Qiu, Haobo
    Gao, Liang
    Li, Peigen
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2018, 58 (04) : 1657 - 1672
  • [4] Two-stage support vector machines to protein relative solvent accessibility prediction
    Nguyen, MN
    Rajapakse, JC
    PROCEEDINGS OF THE 2004 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2004, : 67 - 72
  • [5] Fast Noise Level Estimation Algorithm Based on Two-Stage Support Vector Regression
    Xu S.
    Zeng X.
    Tang Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2018, 30 (03): : 447 - 458
  • [6] A two-stage regression approach for spectroscopic quantitative analysis
    Douak, F.
    Benoudjit, N.
    Melgani, F.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 109 (01) : 34 - 41
  • [7] A two-stage procedure for forecasting freight inspections at Border Inspection Posts using SOMs and support vector regression
    Ruiz-Aguilar, J. J.
    Turias, I. J.
    Jimenez-Come, M. J.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2015, 53 (07) : 2119 - 2130
  • [8] Practice of a Two-Stage Model Using Support Vector Regression and Black-Litterman for ETF Portfolio Selection
    Li, Jung-Bin
    Chen, Chuan-Yin
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [9] A Bayesian inference based two-stage support vector regression framework for soft sensor development in batch bioprocesses
    Yu, Jie
    COMPUTERS & CHEMICAL ENGINEERING, 2012, 41 : 134 - 144
  • [10] New approach to predicting proconvulsant activity with the use of Support Vector Regression
    Salat, Robert
    Salat, Kinga
    COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (05) : 575 - 581