Prediction of protein structural class by amino acid and polypeptide composition

被引:131
|
作者
Luo, RY
Feng, ZP [1 ]
Liu, JK
机构
[1] Tianjin Univ, Dept Phys, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Dept Math, Tianjin 300072, Peoples R China
[3] Nankai Univ, LiuHui Ctr Appl Math, Tianjin 300071, Peoples R China
来源
EUROPEAN JOURNAL OF BIOCHEMISTRY | 2002年 / 269卷 / 17期
关键词
stepwise discriminant analysis; polypeptides; amino acid composition; domain structural class; seeded peptides;
D O I
10.1046/j.1432-1033.2002.03115.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A new approach of predicting structural classes of protein domain sequences is presented in this paper. Besides the amino acid composition, the composition of several dipeptides, tripeptides, tetrapeptides, pentapeptides and hexapeptides are taken into account based on the stepwise discriminant analysis. The result of jackknife test shows that this new approach can lead to higher predictive sensitivity and specificity for reduced sequence similarity datasets. Considering the dataset PDB40-B constructed by Brenner and colleagues, 75.2% protein domain sequences are correctly assigned in the jackknife test for the four structural classes: all-alpha, all-beta, alpha/beta and alpha + beta, which is improved by 19.4% in jackknife test and 25.5% in resubstitution test, in contrast with the component-coupled algorithm using amino acid composition alone (AAC approach) for the same dataset. In the cross-validation test with dataset PDB40-J constructed by Park and colleagues, more than 80% predictive accuracy is obtained. Furthermore, for the dataset constructed by Chou and Maggiona, the accuracy of 100% and 99.7% can be easily achieved, respectively, in the resubstitution test and in the jackknife test merely taking the composition of dipeptides into account. Therefore, this new method provides an effective toot to extract valuable information from protein sequences, which can be used for the systematic analysis of small or medium size protein sequences. The computer programs used in this paper are available on request.
引用
收藏
页码:4219 / 4225
页数:7
相关论文
共 50 条
  • [31] CORRELATION OF THE AMINO-ACID-COMPOSITION OF A PROTEIN TO ITS STRUCTURAL AND BIOLOGICAL CHARACTERS
    NISHIKAWA, K
    OOI, T
    JOURNAL OF BIOCHEMISTRY, 1982, 91 (05): : 1821 - 1824
  • [32] DENTINAL PROTEIN - AMINO ACID COMPOSITION
    HESS, WC
    LEE, C
    NEIDIG, BA
    JOURNAL OF DENTAL RESEARCH, 1952, 31 (06) : 791 - 792
  • [33] AMINO ACID COMPOSITION OF DENTIN PROTEIN
    LOSEE, FL
    NEIDIG, BA
    HESS, WC
    PROCEEDINGS OF THE SOCIETY FOR EXPERIMENTAL BIOLOGY AND MEDICINE, 1951, 76 (04): : 783 - 785
  • [34] Amino acid composition and protein dimension
    Carugo, Oliviero
    PROTEIN SCIENCE, 2008, 17 (12) : 2187 - 2191
  • [35] THE AMINO ACID COMPOSITION OF ENAMEL PROTEIN
    HESS, WC
    LEE, CY
    NEIDIG, BA
    JOURNAL OF DENTAL RESEARCH, 1953, 32 (04) : 585 - 587
  • [36] Prediction of protein (domain) structural classes based on amino-acid index
    Bu, WS
    Feng, ZP
    Zhang, ZD
    Zhang, CT
    EUROPEAN JOURNAL OF BIOCHEMISTRY, 1999, 266 (03): : 1043 - 1049
  • [37] Prediction of protein secondary structure content using amino acid composition an evolutionary information
    Lee, S
    Lee, BC
    Kim, D
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) : 1107 - 1114
  • [38] Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition
    Su, Wenxia
    Deng, Shuyi
    Gu, Zhifeng
    Yang, Keli
    Ding, Hui
    Chen, Hui
    Zhang, Zhaoyue
    FRONTIERS IN GENETICS, 2023, 14
  • [39] Amino acid composition, and determination and prediction of the protein digestibility of different sugarcane yeasts in broilers
    Ribeiro Barbosa, Emanuela Nataly
    Rabello, Carlos Boa-Viagem
    Lopes, Claudia Costa
    Silva, Edney Pereira
    Freitas, Ednardo Rodrigues
    REVISTA CIENCIA AGRONOMICA, 2018, 49 (02): : 334 - 342
  • [40] Protein location prediction using atomic composition and global features of the amino acid sequence
    Cherian, Betsy Sheena
    Nair, Achuthsankar S.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 391 (04) : 1670 - 1674