Accurate prediction of protein structural classes by incorporating PSSS and PSSM into Chou's general PseAAC

被引:32
作者
Zhang, Shengli [1 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein structural class prediction; Secondary structure sequences; Support vector machine; Transition probability matrix; Pseudo-position specific scoring matrix; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; SECONDARY STRUCTURE PREDICTION; LOW-SIMILARITY SEQUENCES; PSI-BLAST; INFORMATION; FORM; CLASSIFIER; HOMOLOGY; ATTRIBUTES;
D O I
10.1016/j.chemolab.2015.01.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determination of protein structural class using a fast and suitable computational method has become a hot issue in protein science. Prediction of protein structural class for low-similarity sequences remains a challenge problem. In this study, a 111-dimensional feature vector is constructed to predict protein structural classes. Among the 111 features, 100 features based on pseudo-position specific scoring matrix (PsePSSM) are selected to reflect the evolutionary information and the sequence-order information, and the other 11 rational features based on predicted protein secondary structure sequences (PSSS) are designed in the previous works. To evaluate the performance of the proposed method (named by PSSS-PsePSSM), jackknife cross-validation tests are performed on three widely used benchmark datasets: 1189, 25PDB and 640. Our method achieves competitive performance on prediction accuracies, especially for the overall prediction accuracies for datasets 1189, 25PDB and 640, which reach 86.6%, 89.5% and 81.0%, respectively. The PSSS-PsePSSM algorithm also outperforms other existing methods, indicating that our proposed method is a cost-effective computational tool for protein structural class prediction. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:28 / 35
页数:8
相关论文
共 75 条
  • [21] iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites
    Chou, Kuo-Chen
    Wu, Zhi-Cheng
    Xiao, Xuan
    [J]. MOLECULAR BIOSYSTEMS, 2012, 8 (02) : 629 - 641
  • [22] Some remarks on predicting multi-label attributes in molecular biosystems
    Chou, Kuo-Chen
    [J]. MOLECULAR BIOSYSTEMS, 2013, 9 (06) : 1092 - 1100
  • [23] Some remarks on protein attribute prediction and pseudo amino acid composition
    Chou, Kuo-Chen
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2011, 273 (01) : 236 - 247
  • [24] Improving Protein Structural Class Prediction Using Novel Combined Sequence Information and Predicted Secondary Structural Features
    Dai, Qi
    Wu, Li
    Li, Lihua
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (16) : 3393 - 3398
  • [25] A protein structural classes prediction method based on PSI-BLAST profile
    Ding, Shuyan
    Yan, Shoujiang
    Qi, Shuhua
    Li, Yan
    Yao, Yuhua
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2014, 353 : 19 - 23
  • [26] A protein structural classes prediction method based on predicted secondary structure and PSI-BLAST profile
    Ding, Shuyan
    Li, Yan
    Shi, Zhuoxing
    Yan, Shoujiang
    [J]. BIOCHIMIE, 2014, 97 : 60 - 65
  • [27] PseAAC-General: Fast Building Various Modes of General Form of Chou's Pseudo-Amino Acid Composition for Large-Scale Protein Datasets
    Du, Pufeng
    Gu, Shuwang
    Jiao, Yasen
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (03): : 3495 - 3506
  • [28] PseAAC-Builder: A cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions
    Du, Pufeng
    Wang, Xin
    Xu, Chao
    Gao, Yang
    [J]. ANALYTICAL BIOCHEMISTRY, 2012, 425 (02) : 117 - 119
  • [29] Discriminating bioluminescent proteins by incorporating average chemical shift and evolutionary information into the general form of Chou's pseudo amino acid composition
    Fan, Guo-Liang
    Li, Qian-Zhong
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2013, 334 : 45 - 51
  • [30] Georgiou D.N., 2013, Open Bioinform. J, V7, P41, DOI [10.2174/1875036201307010041, DOI 10.2174/1875036201307010041]