Prediction of protein structural class by amino acid and polypeptide composition

被引:131
|
作者
Luo, RY
Feng, ZP [1 ]
Liu, JK
机构
[1] Tianjin Univ, Dept Phys, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Dept Math, Tianjin 300072, Peoples R China
[3] Nankai Univ, LiuHui Ctr Appl Math, Tianjin 300071, Peoples R China
来源
EUROPEAN JOURNAL OF BIOCHEMISTRY | 2002年 / 269卷 / 17期
关键词
stepwise discriminant analysis; polypeptides; amino acid composition; domain structural class; seeded peptides;
D O I
10.1046/j.1432-1033.2002.03115.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A new approach of predicting structural classes of protein domain sequences is presented in this paper. Besides the amino acid composition, the composition of several dipeptides, tripeptides, tetrapeptides, pentapeptides and hexapeptides are taken into account based on the stepwise discriminant analysis. The result of jackknife test shows that this new approach can lead to higher predictive sensitivity and specificity for reduced sequence similarity datasets. Considering the dataset PDB40-B constructed by Brenner and colleagues, 75.2% protein domain sequences are correctly assigned in the jackknife test for the four structural classes: all-alpha, all-beta, alpha/beta and alpha + beta, which is improved by 19.4% in jackknife test and 25.5% in resubstitution test, in contrast with the component-coupled algorithm using amino acid composition alone (AAC approach) for the same dataset. In the cross-validation test with dataset PDB40-J constructed by Park and colleagues, more than 80% predictive accuracy is obtained. Furthermore, for the dataset constructed by Chou and Maggiona, the accuracy of 100% and 99.7% can be easily achieved, respectively, in the resubstitution test and in the jackknife test merely taking the composition of dipeptides into account. Therefore, this new method provides an effective toot to extract valuable information from protein sequences, which can be used for the systematic analysis of small or medium size protein sequences. The computer programs used in this paper are available on request.
引用
收藏
页码:4219 / 4225
页数:7
相关论文
共 50 条
  • [1] PREDICTION OF PROTEIN FOLDING CLASS FROM AMINO-ACID-COMPOSITION
    DUBCHAK, I
    HOLBROOK, SR
    KIM, SH
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1993, 16 (01): : 79 - 91
  • [2] Weighted Amino Acid Composition based on Amino Acid Indices for Prediction of Protein Structural Classes
    Nanuwa, Sundeep Singh
    Dziurla, Andre
    Seker, Huseyin
    2009 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, 2009, : 583 - 586
  • [3] PREDICTION OF PROTEIN STRUCTURAL CLASS FROM THE AMINO-ACID-SEQUENCE
    KLEIN, P
    DELISI, C
    BIOPOLYMERS, 1986, 25 (09) : 1659 - 1672
  • [4] Prediction of Protein Functional Class from Pseudo-Amino Acid Composition
    Zeng, Qiangguang
    Yue, Guangxue
    Li, Renfa
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2011, 8 (07) : 1247 - 1251
  • [5] A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction
    Sahu, Sitanshu Sekhar
    Panda, Ganapati
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2010, 34 (5-6) : 320 - 327
  • [6] A WEIGHTING METHOD FOR PREDICTING PROTEIN STRUCTURAL CLASS FROM AMINO-ACID-COMPOSITION
    ZHOU, GF
    XU, XH
    ZHANG, CT
    EUROPEAN JOURNAL OF BIOCHEMISTRY, 1992, 210 (03): : 747 - 749
  • [7] AN OPTIMIZATION APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASS FROM AMINO-ACID-COMPOSITION
    ZHANG, CT
    CHOU, KC
    PROTEIN SCIENCE, 1992, 1 (03) : 401 - 408
  • [8] Prediction of protein quaternary structural type with functional domain and pseudo amino acid composition
    Xiao, Xuan
    Wang, Pu
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 756 - 759
  • [9] Amino Acid Principal Component Analysis (AAPCA) and its applications in protein structural class prediction
    Du, Qi-Shi
    Jiang, Zhi-Qin
    He, Wen-Zhang
    Li, Da-Peng
    Chou, Kou-Chen
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 23 (06): : 635 - 640
  • [10] Prediction of protein structural class for low-similarity sequences using Chou's pseudo amino acid composition and wavelet denoising
    Yu, Bin
    Lou, Lifeng
    Li, Shan
    Zhang, Yusen
    Qiu, Wenying
    Wu, Xue
    Wang, Minghui
    Tian, Baoguang
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2017, 76 : 260 - 273