A novel feature representation method based on Chou's pseudo amino acid composition for protein structural class prediction

被引:198
作者
Sahu, Sitanshu Sekhar [1 ]
Panda, Ganapati [2 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun Engn, Rourkela, Orissa, India
[2] Natl Inst Technol, Sch Elect Sci, Rourkela, Orissa, India
关键词
AAC; AmPseAAC; DCT; RBFNN; Protein domain; Structural class; SUPPORT VECTOR MACHINES; SUBCELLULAR LOCATION; SECONDARY STRUCTURE; NEURAL-NETWORKS;
D O I
10.1016/j.compbiolchem.2010.09.002
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
During last few decades accurate determination of protein structural class using a fast and suitable computational method has been a challenging problem in protein science. In this context a meaningful representation of a protein sample plays a key role in achieving higher prediction accuracy. In this paper based on the concept of Chou's pseudo amino acid composition (Chou. K.C., 2001. Proteins 43, 246-255), a new feature representation method is introduced which is composed of the amino acid composition information, the amphiphilic correlation factors and the spectral characteristics of the protein. Thus the sample of a protein is represented by a set of discrete components which incorporate both the sequence order and the length effect. On the basis of such a statistical framework a simple radial basis function network based classifier is introduced to predict protein structural class. A set of exhaustive simulation studies demonstrates high success rate of classification using the self-consistency and jackknife test on the benchmark datasets. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:320 / 327
页数:8
相关论文
共 62 条
  • [1] DISCRETE COSINE TRANSFORM
    AHMED, N
    NATARAJAN, T
    RAO, KR
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) : 90 - 93
  • [2] [Anonymous], NAT SCI
  • [3] Prediction of protein structural classes by neural network
    Cai, YD
    Zhou, GP
    [J]. BIOCHIMIE, 2000, 82 (08) : 783 - 785
  • [4] Using neural networks for prediction of domain structural classes
    Cai, YD
    Li, YX
    Chou, KC
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEIN STRUCTURE AND MOLECULAR ENZYMOLOGY, 2000, 1476 (01): : 1 - 2
  • [5] Support Vector Machines for predicting protein structural class
    Cai, Yu-Dong
    Liu, Xiao-Jun
    Xu, Xue-biao
    Zhou, Guo-Ping
    [J]. BMC BIOINFORMATICS, 2001, 2 (1)
  • [6] Prediction of protein structural class with Rough Sets
    Cao, YF
    Liu, S
    Zhang, LD
    Qin, J
    Wang, J
    Tang, KX
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [7] CHANDONIA JM, 1995, PROTEIN SCI, V4, P275
  • [8] CHAO C, 2006, ANAL BIOCHEM, V357, P116
  • [9] Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine
    Chen, Chao
    Chen, Lixuan
    Zou, Xiaoyong
    Cai, Peixiang
    [J]. PROTEIN AND PEPTIDE LETTERS, 2009, 16 (01) : 27 - 31
  • [10] Prediction of protein structural class using novel evolutionary collocation-based sequence representation
    Chen, Ke
    Kurgan, Lukasz A.
    Ruan, Jishou
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) : 1596 - 1604