A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction

被引:199
作者
Sahu, Sitanshu Sekhar [1 ]
Panda, Ganapati [2 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun Engn, Rourkela, Orissa, India
[2] Natl Inst Technol, Sch Elect Sci, Rourkela, Orissa, India
关键词
AAC; AmPseAAC; DCT; RBFNN; Protein domain; Structural class; SUPPORT VECTOR MACHINES; SUBCELLULAR LOCATION; SECONDARY STRUCTURE; NEURAL-NETWORKS;
D O I
10.1016/j.compbiolchem.2010.09.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
During last few decades accurate determination of protein structural class using a fast and suitable computational method has been a challenging problem in protein science. In this context a meaningful representation of a protein sample plays a key role in achieving higher prediction accuracy. In this paper based on the concept of Chou's pseudo amino acid composition (Chou. K.C., 2001. Proteins 43, 246-255), a new feature representation method is introduced which is composed of the amino acid composition information, the amphiphilic correlation factors and the spectral characteristics of the protein. Thus the sample of a protein is represented by a set of discrete components which incorporate both the sequence order and the length effect. On the basis of such a statistical framework a simple radial basis function network based classifier is introduced to predict protein structural class. A set of exhaustive simulation studies demonstrates high success rate of classification using the self-consistency and jackknife test on the benchmark datasets. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:320 / 327
页数:8
相关论文
共 62 条
[1]   DISCRETE COSINE TRANSFORM [J].
AHMED, N ;
NATARAJAN, T ;
RAO, KR .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93
[2]  
[Anonymous], NAT SCI
[3]   Prediction of protein structural classes by neural network [J].
Cai, YD ;
Zhou, GP .
BIOCHIMIE, 2000, 82 (08) :783-785
[4]   Using neural networks for prediction of domain structural classes [J].
Cai, YD ;
Li, YX ;
Chou, KC .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEIN STRUCTURE AND MOLECULAR ENZYMOLOGY, 2000, 1476 (01) :1-2
[5]   Support Vector Machines for predicting protein structural class [J].
Cai, Yu-Dong ;
Liu, Xiao-Jun ;
Xu, Xue-biao ;
Zhou, Guo-Ping .
BMC BIOINFORMATICS, 2001, 2 (1)
[6]   Prediction of protein structural class with Rough Sets [J].
Cao, YF ;
Liu, S ;
Zhang, LD ;
Qin, J ;
Wang, J ;
Tang, KX .
BMC BIOINFORMATICS, 2006, 7 (1)
[7]  
CHANDONIA JM, 1995, PROTEIN SCI, V4, P275
[8]  
CHAO C, 2006, ANAL BIOCHEM, V357, P116
[9]   Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine [J].
Chen, Chao ;
Chen, Lixuan ;
Zou, Xiaoyong ;
Cai, Peixiang .
PROTEIN AND PEPTIDE LETTERS, 2009, 16 (01) :27-31
[10]   Prediction of protein structural class using novel evolutionary collocation-based sequence representation [J].
Chen, Ke ;
Kurgan, Lukasz A. ;
Ruan, Jishou .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) :1596-1604