Predicting protein structural class by functional domain composition

被引:171
作者
Chou, KC [1 ]
Cai, YD
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] TIBDD, Tianjin, Peoples R China
[3] UMIST, Biomol Sci Dept, Manchester M60 1QD, Lancs, England
关键词
sequence-order-related feature; function-related feature; less than 20% sequence identity; ISort predictor;
D O I
10.1016/j.bbrc.2004.07.059
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The functional domain composition is introduced to predict the structural class of a protein or domain according to the following classification: all-alpha, all-beta, alpha/beta, alpha + beta, mu (multi-domain), sigma (small protein), and rho (peptide). The advantage by doing so is that both the sequence-order-related features and the function-related features are naturally incorporated in the predictor. As a demonstration, the jackknife cross-validation test was performed on a dataset that consists of proteins and domains with only less than 20% sequence identity to each other in order to get rid of any homologous bias. The overall success rate thus obtained was 98%. In contrast to this, the corresponding rates obtained by the simple geometry approaches based on the amino acid composition were only 36-39%. This indicates that using the functional domain composition to represent the sample of a protein for statistical prediction is very promising.. and that the functional type of a domain is closely correlated with its structural class. (C) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:1007 / 1009
页数:3
相关论文
共 30 条
  • [1] The InterPro database, an integrated documentation resource for protein families, domains and functional sites
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Bateman, A
    Birney, E
    Biswas, M
    Bucher, P
    Cerutti, T
    Corpet, F
    Croning, MDR
    Durbin, R
    Falquet, L
    Fleischmann, W
    Gouzy, J
    Hermjakob, H
    Hulo, N
    Jonassen, I
    Kahn, D
    Kanapin, A
    Karavidopoulou, Y
    Lopez, R
    Marx, B
    Mulder, NJ
    Oinn, TM
    Pagni, M
    Servant, F
    Sigrist, CJA
    Zdobnov, EM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 37 - 40
  • [2] Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.3.CO
  • [3] 2-D
  • [4] Relation between amino acid composition and cellular location of proteins
    Cedano, J
    Aloy, P
    PerezPons, JA
    Querol, E
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) : 594 - 600
  • [5] Chan PH, 2001, STROKE, V32, P189
  • [6] CHANDONIA JM, 1995, PROTEIN SCI, V4, P275
  • [7] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349
  • [8] A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE
    CHOU, KC
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04): : 319 - 344
  • [9] Domain structural class prediction
    Chou, KC
    Maggiora, GM
    [J]. PROTEIN ENGINEERING, 1998, 11 (07): : 523 - 538
  • [10] CHOU KC, 1994, J BIOL CHEM, V269, P22014