Prediction of protein subcellular locations by incorporating quasi-sequence-order effect

被引:295
作者
Chou, KC [1 ]
机构
[1] Pharmacia, Comp Aided Drug Discovery, Kalamazoo, MI 49007 USA
关键词
organelles; sequence-order-coupling numbers; amino-acid composition; augmented covariant-discriminant algorithm;
D O I
10.1006/bbrc.2000.3815
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
How to incorporate the sequence order effect is a key and logical step for improving the prediction quality of protein subcellular location, but meanwhile it is a very difficult problem as well. This is because the number of possible sequence order patterns in proteins is extremely large, which has posed a formidable barrier to construct an effective training data set for statistical treatment based on the current knowledge. That is why most of the existing prediction algorithms are operated based on the amino-acid composition alone. In this paper, based on the physicochemical distance between amino acids, a set of sequence-order-coupling numbers was introduced to reflect the sequence order effect, or in a rigorous term, the quasi-sequence-order effect. Furthermore, the covariant discriminant algorithm by Chou and Elrod (Protein Eng. 12, 107-118, 1999) developed recently was augmented to allow the prediction performed by using the input of both the sequence-order-coupling numbers and amino-acid composition. A remarkable improvement was observed in the prediction quality using the augmented covariant discriminant algorithm. The approach described here represents one promising step forward in the efforts of incorporating sequence order effect in protein subcellular location prediction. It is anticipated that the current approach may also have a series of impacts on the prediction of other protein features by statistical approaches. (C) 2000 Academic Press.
引用
收藏
页码:477 / 483
页数:7
相关论文
共 22 条
  • [1] Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.0.CO
  • [2] 2-F
  • [3] The SWISS-PROT protein sequence data bank and its supplement TrEMBL
    Bairoch, A
    Apweller, R
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (01) : 31 - 36
  • [4] Prediction of protein structural classes by neural network
    Cai, YD
    Zhou, GP
    [J]. BIOCHIMIE, 2000, 82 (08) : 783 - 785
  • [5] Relation between amino acid composition and cellular location of proteins
    Cedano, J
    Aloy, P
    PerezPons, JA
    Querol, E
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) : 594 - 600
  • [6] Using discriminant function for prediction of subcellular location of prokaryotic proteins
    Chou, KC
    Elrod, DW
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 1998, 252 (01) : 63 - 68
  • [7] Protein subcellular location prediction
    Chou, KC
    Elrod, DW
    [J]. PROTEIN ENGINEERING, 1999, 12 (02): : 107 - 118
  • [8] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349
  • [9] A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE
    CHOU, KC
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04): : 319 - 344
  • [10] Chou KC, 1998, PROTEINS, V31, P97, DOI 10.1002/(SICI)1097-0134(19980401)31:1<97::AID-PROT8>3.3.CO