Domain structural class prediction

被引:153
作者
Chou, KC [1 ]
Maggiora, GM [1 ]
机构
[1] Pharmacia & Upjohn Inc, Comp Aided Drug Discovery, Kalamazoo, MI 49007 USA
来源
PROTEIN ENGINEERING | 1998年 / 11卷 / 07期
关键词
component-coupled effect; jack-knifing validation; Mahalanobis discriminant; SCOP database;
D O I
10.1093/protein/11.7.523
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The structural class of a protein domain can be approximately predicted according to its amino acid composition. However, can the prediction quality be improved by taking into account the coupling effect among different amino acid components? This question has evoked much controversy because completely different conclusions have been obtained by different investigators. To resolve such a perplexing problem, predictions by means of various algorithms were performed based on the SCOP database (Murzin et al., 1995), which is more natural and reliable for the study of structural classes because it is based on evolutionary relationships and on the principles that govern their three-dimensional structure. The results obtained using both resubstitution and jackknife tests indicated that the overall rates of correct prediction by an algorithm incorporating the coupling effect among different amino acid components were significantly higher than those by the algorithms that did not include such an effect. A completely consistent conclusion was also obtained when tests were performed on two large independent testing datasets classified into four and seven structural classes, respectively. It is revealed through an analysis that the reasons for reaching the opposite conclusion are mainly due to (1) misclassifying structural classes according to a conceptually incorrect rule, (2) misapplying the component-coupled algorithm by ignoring some important factors and (3) misrepresenting structural classes with statistically insignificant training subsets, Clarification of these problems would be instructive for effectively using the prediction algorithm and correctly interpreting the results.
引用
收藏
页码:523 / 538
页数:16
相关论文
共 28 条
[1]  
ABDA E, 1987, CRYSTALLOGRAPHIC DAT, P107
[2]  
Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.0.CO
[3]  
2-F
[4]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[5]   A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE [J].
CHOU, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04) :319-344
[6]  
CHOU KC, 1994, J BIOL CHEM, V269, P22014
[7]  
CHOU PY, 1989, PREDICTION PROTEIN S, P549, DOI DOI 10.1007/978-1-4613-1571-1_12
[8]   AN ALGORITHM FOR PROTEIN SECONDARY STRUCTURE PREDICTION BASED ON CLASS PREDICTION [J].
DELEAGE, G ;
ROUX, B .
PROTEIN ENGINEERING, 1987, 1 (04) :289-294
[9]  
DUDA RO, PATTERN CLASSIFICATI, P73
[10]  
Eisenhaber F, 1996, PROTEINS, V25, P169, DOI 10.1002/(SICI)1097-0134(199606)25:2<169::AID-PROT3>3.3.CO