Prediction of protein structural class using a complexity-based distance measure

被引:25
作者
Liu, Taigang [2 ]
Zheng, Xiaoqi [3 ]
Wang, Jun [1 ]
机构
[1] Shanghai Normal Univ, Dept Math, Sci Comp Key Lab Shanghai Univ, Shanghai 200234, Peoples R China
[2] Dalian Univ Technol, Dept Appl Math, Dalian 116024, Peoples R China
[3] Dalian Univ Technol, Coll Adv Sci & Technol, Dalian 116024, Peoples R China
关键词
Symbol sequence complexity; Nearest neighbor algorithm; Jackknife cross-validation test; Performance measure; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; SUBCELLULAR LOCATION; SECONDARY STRUCTURE; ALGORITHM; FEATURES; PROGRESS; IMPACT; MODEL;
D O I
10.1007/s00726-009-0276-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Knowledge of structural class plays an important role in understanding protein folding patterns. So it is necessary to develop effective and reliable computational methods for prediction of protein structural class. To this end, we present a new method called NN-CDM, a nearest neighbor classifier with a complexity-based distance measure. Instead of extracting features from protein sequences as done previously, distance between each pair of protein sequences is directly evaluated by a complexity measure of symbol sequences. Then the nearest neighbor classifier is adopted as the predictive engine. To verify the performance of this method, jackknife cross-validation tests are performed on several benchmark datasets. Results show that our approach achieves a high prediction accuracy over some classical methods.
引用
收藏
页码:721 / 728
页数:8
相关论文
共 53 条
  • [1] Predicting protein structural class by SVM with class-wise optimized features and decision probabilities
    Anand, Ashish
    Pugalenthi, Ganesan
    Suganthan, P. N.
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2008, 253 (02) : 375 - 380
  • [2] Prediction of protein (domain) structural classes based on amino-acid index
    Bu, WS
    Feng, ZP
    Zhang, ZD
    Zhang, CT
    [J]. EUROPEAN JOURNAL OF BIOCHEMISTRY, 1999, 266 (03): : 1043 - 1049
  • [3] Prediction of protein structural classes by neural network
    Cai, YD
    Zhou, GP
    [J]. BIOCHIMIE, 2000, 82 (08) : 783 - 785
  • [4] Using LogitBoost classifier to predict protein structural classes
    Cai, YD
    Feng, KY
    Lu, WC
    Chou, KC
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 238 (01) : 172 - 176
  • [5] Support vector machines for prediction of protein domain structural class
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2003, 221 (01) : 115 - 120
  • [6] Nearest neighbour algorithm for predicting protein subcellular location by combining functional domain composition and pseudo-amino acid composition
    Cai, YD
    Chou, KC
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 305 (02) : 407 - 411
  • [7] Prediction of protein structural classes by support vector machines
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. COMPUTERS & CHEMISTRY, 2002, 26 (03): : 293 - 296
  • [8] Support Vector Machines for predicting protein structural class
    Cai, Yu-Dong
    Liu, Xiao-Jun
    Xu, Xue-biao
    Zhou, Guo-Ping
    [J]. BMC BIOINFORMATICS, 2001, 2 (1)
  • [9] Prediction of protein structural class with Rough Sets
    Cao, YF
    Liu, S
    Zhang, LD
    Qin, J
    Wang, J
    Tang, KX
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [10] Predicting protein structural class based on multi-features fusion
    Chen, Chao
    Chen, Li-Xuan
    Zou, Xiao-Yong
    Cai, Pei-Xiang
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2008, 253 (02) : 388 - 392