Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM

被引:15
作者
Liang, Yunyun [1 ]
Liu, Sanyang [1 ]
Zhang, Shengli [1 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID-COMPOSITION; PRINCIPAL COMPONENT ANALYSIS; SUPPORT VECTOR MACHINE; PSI-BLAST; ACCURATE PREDICTION; FOLD RECOGNITION; REPRESENTATION; LOCALIZATION; HOMOLOGY;
D O I
10.1155/2015/370756
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM). Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS), segmented PsePSSM, and segmented autocovariance transformation (ACT) based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640) are adopted in this paper. Then a 700-dimensional (700D) feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA). To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.
引用
收藏
页数:9
相关论文
共 52 条
  • [11] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
  • [12] Predicting protein structural class by functional domain composition
    Chou, KC
    Cai, YD
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 321 (04) : 1007 - 1009
  • [13] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349
  • [14] A key driving force in determination of protein structural classes
    Chou, KC
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1999, 264 (01) : 216 - 224
  • [15] Recent progress in protein subcellular location prediction
    Chou, Kuo-Chen
    Shen, Hong-Bin
    [J]. ANALYTICAL BIOCHEMISTRY, 2007, 370 (01) : 1 - 16
  • [16] Comparison study on statistical features of predicted secondary structures for protein structural class prediction: From content to position
    Dai, Qi
    Li, Yan
    Liu, Xiaoqing
    Yao, Yuhua
    Cao, Yunjie
    He, Pingan
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [17] Dehzangi Abdollah, 2013, Pattern Recognition in Bioinformatics. 8th IAPR International Conference, PRIB 2013. Proceedings: LNCS 7986, P208, DOI 10.1007/978-3-642-39159-0_19
  • [18] A protein structural classes prediction method based on PSI-BLAST profile
    Ding, Shuyan
    Yan, Shoujiang
    Qi, Shuhua
    Li, Yan
    Yao, Yuhua
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2014, 353 : 19 - 23
  • [19] A protein structural classes prediction method based on predicted secondary structure and PSI-BLAST profile
    Ding, Shuyan
    Li, Yan
    Shi, Zhuoxing
    Yan, Shoujiang
    [J]. BIOCHIMIE, 2014, 97 : 60 - 65
  • [20] A new taxonomy-based protein fold recognition approach based on autocross-covariance transformation
    Dong, Qiwen
    Zhou, Shuigeng
    Guan, Jihong
    [J]. BIOINFORMATICS, 2009, 25 (20) : 2655 - 2662