Prediction of protein subcellular locations by incorporating quasi-sequence-order effect

被引:307
作者
Chou, KC [1 ]
机构
[1] Pharmacia, Comp Aided Drug Discovery, Kalamazoo, MI 49007 USA
关键词
organelles; sequence-order-coupling numbers; amino-acid composition; augmented covariant-discriminant algorithm;
D O I
10.1006/bbrc.2000.3815
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
How to incorporate the sequence order effect is a key and logical step for improving the prediction quality of protein subcellular location, but meanwhile it is a very difficult problem as well. This is because the number of possible sequence order patterns in proteins is extremely large, which has posed a formidable barrier to construct an effective training data set for statistical treatment based on the current knowledge. That is why most of the existing prediction algorithms are operated based on the amino-acid composition alone. In this paper, based on the physicochemical distance between amino acids, a set of sequence-order-coupling numbers was introduced to reflect the sequence order effect, or in a rigorous term, the quasi-sequence-order effect. Furthermore, the covariant discriminant algorithm by Chou and Elrod (Protein Eng. 12, 107-118, 1999) developed recently was augmented to allow the prediction performed by using the input of both the sequence-order-coupling numbers and amino-acid composition. A remarkable improvement was observed in the prediction quality using the augmented covariant discriminant algorithm. The approach described here represents one promising step forward in the efforts of incorporating sequence order effect in protein subcellular location prediction. It is anticipated that the current approach may also have a series of impacts on the prediction of other protein features by statistical approaches. (C) 2000 Academic Press.
引用
收藏
页码:477 / 483
页数:7
相关论文
共 22 条
[11]  
2-Y
[12]   Domain structural class prediction [J].
Chou, KC ;
Maggiora, GM .
PROTEIN ENGINEERING, 1998, 11 (07) :523-538
[13]  
CHOU KC, 2001, IN PRESS PROTEINS ST, V42
[14]   Prediction of Protein Structural Classes and Subcellular Locations [J].
Chou, Kuo-Chen .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2000, 1 (02) :171-208
[15]  
Mahalanobis PC., 1936, P NATL I SCI INDIA, V12, P49, DOI DOI 10.1007/S13171-019-00164-5
[16]  
MARDIA KV, 1979, MULTIVARIATE ANAL, P322
[17]   DISCRIMINATION OF INTRACELLULAR AND EXTRACELLULAR PROTEINS USING AMINO-ACID-COMPOSITION AND RESIDUE-PAIR FREQUENCIES [J].
NAKASHIMA, H ;
NISHIKAWA, K .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 238 (01) :54-61
[18]  
PILLAI KCS, 1985, ENCY STATISTICAL SCI, V5, P176
[19]   Using neural networks for prediction of the subcellular location of proteins [J].
Reinhardt, A ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 1998, 26 (09) :2230-2236
[20]   THE RATIONAL DESIGN OF AMINO-ACID-SEQUENCES BY ARTIFICIAL NEURAL NETWORKS AND SIMULATED MOLECULAR EVOLUTION - DE-NOVO DESIGN OF AN IDEALIZED LEADER PEPTIDASE CLEAVAGE SITE [J].
SCHNEIDER, G ;
WREDE, P .
BIOPHYSICAL JOURNAL, 1994, 66 (02) :335-344