EzyPred: A top-down approach for predicting enzyme functional classes and subclasses

被引:262
作者
Shen, Hong-Bin [1 ]
Chou, Kuo-Chen
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] Shanghai Jiao Tong Univ, Inst Image Processing & Pattern Recognit, Shanghai 200030, Peoples R China
关键词
enzyme functional class; evolution; functional domain; fusion approach; EzyPred web-server;
D O I
10.1016/j.bbrc.2007.09.098
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Given a protein sequence, how can we identify whether it is an enzyme or non-enzyme? If it is, which main functional class it belongs to? What about its sub-functional class? It is important to address these problems because they are closely correlated with the biological function of an uncharacterized protein and its acting object and process. Particularly, with the avalanche of protein sequences generated in the Post Genomic Age and relatively much slower progress in determining their functions by experiments, it is highly desired to develop an automated method by which one can get a fast and accurate answer to these questions. Here, a top-down predictor, called EzyPred, is developed by fusing the results derived from the functional domain and evolution information. EzyPred is a 3-layer predictor: the 1st layer prediction engine is for identifying a query protein as enzyme or non-enzyme; the 2nd layer for the main functional class; and the 3rd layer for the sub-functional class. The overall success rates for all the three layers are higher than 90% that were obtained through rigorous cross-validation tests on the very stringent benchmark datasets in which none of the proteins has >= 40% sequence identity to any other in a same class or subclass. EzyPred is freely accessible at http://chou.med.harvard.edu/bioinf/EzyPred/, by which one can get the desired 3-level results for a query protein sequence within less than 90 s. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 34 条
[1]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[2]  
CAO Y, 2006, BMC BIOINFORMATICS, V7
[3]   Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[4]   Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network [J].
Chen, Chao ;
Zhou, Xibin ;
Tian, Yuanxin ;
Zou, Xiaoyong ;
Cai, Peixiang .
ANALYTICAL BIOCHEMISTRY, 2006, 357 (01) :116-121
[5]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[6]   Predicting enzyme family class in a hybridization space [J].
Chou, KC ;
Cai, YD .
PROTEIN SCIENCE, 2004, 13 (11) :2857-2863
[7]   Prediction of enzyme family classes [J].
Chou, KC ;
Elrod, DW .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :183-190
[8]   Using functional domain composition and support vector machines for prediction of protein subcellular location [J].
Chou, KC ;
Cai, YD .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (48) :45765-45769
[9]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[10]   Prediction of protein cellular attributes using pseudo-amino acid composition (vol 43, pg 246, 2001) [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 44 (01) :60-60