Prediction protein structural classes with pseudo-amino acid composition: Approximate entropy and hydrophobicity pattern

被引:158
作者
Zhang, Tong-Liang [1 ]
Ding, Yong-Sheng [1 ]
Chou, Kuo-Chen [2 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
[2] Gordon Life Sci Inst, San Diego, CA 92130 USA
基金
高等学校博士学科点专项科研基金;
关键词
protein structure classes; pseudo-amino acid composition; approximate entropy; hydrophobicity pattern; fuzzy KNN classifier;
D O I
10.1016/j.jtbi.2007.09.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Compared with the conventional amino acid (AA) composition, the pseudo-amino acid (PseAA) composition as originally introduced for protein subcellular location prediction can incorporate much more information of a protein sequence, so as to remarkably enhance the power of using a discrete model to predict various attributes of a protein. In this study, based on the concept of PseAA composition, the approximate entropy and hydrophobicity pattern of a protein sequence are used to characterize the PseAA components. Also, the immune genetic algorithm (IGA) is applied to search the optimal weight factors in generating the PseAA composition. Thus, for a given protein sequence sample, a 27-D (dimensional), PseAA composition is generated as its descriptor. The fuzzy K nearest neighbors (FKNN) classifier is adopted as the prediction engine. The results thus obtained in predicting protein structural classification are quite encouraging, indicating that the current approach may also be used to improve the prediction quality of other protein attributes, or at least can play a complimentary role to the existing methods in the relevant areas. Our algorithm is written in Matlab that is available by contacting the corresponding author. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:186 / 193
页数:8
相关论文
共 88 条
[1]  
ARGOS P, 1982, EUR J BIOCHEM, V128, P565
[2]   Prediction of protein structural class with Rough Sets [J].
Cao, YF ;
Liu, S ;
Zhang, LD ;
Qin, J ;
Wang, J ;
Tang, KX .
BMC BIOINFORMATICS, 2006, 7 (1)
[3]   A HEURISTIC APPROACH TO PREDICTING THE TERTIARY STRUCTURE OF BOVINE SOMATOTROPIN [J].
CARLACCI, L ;
CHOU, KC ;
MAGGIORA, GM .
BIOCHEMISTRY, 1991, 30 (18) :4389-4398
[4]  
CHANDONIA JM, 1995, PROTEIN SCI, V4, P275
[5]   Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[6]   Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network [J].
Chen, Chao ;
Zhou, Xibin ;
Tian, Yuanxin ;
Zou, Xiaoyong ;
Cai, Peixiang .
ANALYTICAL BIOCHEMISTRY, 2006, 357 (01) :116-121
[7]   Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition [J].
Chen, Ying-Li ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) :377-381
[8]   Prediction of the subcellular location of apoptosis proteins [J].
Chen, Ying-Li ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 245 (04) :775-783
[9]   ENERGY-OPTIMIZED STRUCTURE OF ANTIFREEZE PROTEIN AND ITS BINDING MECHANISM [J].
CHOU, KC .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 223 (02) :509-517
[10]   Progress in protein structural class prediction and its impact to bioinformatics and proteomics [J].
Chou, KC .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2005, 6 (05) :423-436