Protein fold recognition with support vector machines fusion network

被引:0
作者
Shi, JY [1 ]
Pan, Q
Zhang, SW
Liang, Y
机构
[1] Northwestern Polytech Univ, Coll Automat, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Coll Life Sci, Xian 710072, Peoples R China
关键词
protein fold recognition; support vector machines (SVM); classifier fusion; dynamic selection;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One of the important approaches to structure analysis is protein fold recognition, which is often applied when there is no significant sequence similarity between structurally similar proteins. A framework with a three-layer support vector machines fusion network (SFN) is presented. The framework is applied to 27-class protein fold recognition from primary structure of proteins. SFN uses support vector machines as member classifiers, and adopts All-Versus-All as multi-class categorization. Six groups of features are divided into major and minor ones by SFN, and. several diversity fusion schemes are correspondingly built. The final decision is made by dynamic selection of the results of all fusion schemes. When it is still difficult to know what kind of fusion of feature groups can achieve good prediction, SFN is a dependable solution by selecting the optimal fusion of feature groups automatically, which can ensure the best recognition. Overall recognition system achieves 61.04% fold prediction accuracy on the independent test dataset. The results and the comparison with other approaches demonstrate the effectiveness of SFN, and thus encourage its further exploration.
引用
收藏
页码:155 / 162
页数:8
相关论文
共 22 条
  • [1] Support vector machines for prediction of protein subcellular location by incorporating quasi-sequence-order effect
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2002, 84 (02) : 343 - 348
  • [2] Chinnasamy Arunkumar, 2005, Journal of Bioinformatics and Computational Biology, V3, P803, DOI 10.1142/S0219720005001302
  • [3] Crammer Koby., 2000, Proceedings of the Thirteenth Annual Conference on Computa- tional Learning Theory, COLT '00, P35
  • [4] Multi-class protein fold recognition using support vector machines and neural networks
    Ding, CHQ
    Dubchak, I
    [J]. BIOINFORMATICS, 2001, 17 (04) : 349 - 358
  • [5] Dubchak I, 1999, PROTEINS, V35, P401, DOI 10.1002/(SICI)1097-0134(19990601)35:4<401::AID-PROT3>3.3.CO
  • [6] 2-B
  • [7] A comparison of methods for multiclass support vector machines
    Hsu, CW
    Lin, CJ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02): : 415 - 425
  • [8] Support vector machine approach for protein subcellular localization prediction
    Hua, SJ
    Sun, ZR
    [J]. BIOINFORMATICS, 2001, 17 (08) : 721 - 728
  • [9] Jaakkola T, 1999, Proc Int Conf Intell Syst Mol Biol, P149
  • [10] Joachims T, 1999, ADVANCES IN KERNEL METHODS, P169