A Novel Approach of Protein Secondary Structure Prediction by SVM Using PSSM Combined by Sequence Features

被引:0
|
作者
Chen, Yehong [1 ]
Cheng, Jinyong [2 ]
Liu, Yihui [2 ]
Park, Pil Seong [3 ]
机构
[1] Qilu Univ Technol, Sch Graph Commun & Packaging, Jinan, Shandong, Peoples R China
[2] Qilu Univ Technol, Sch Informat, Jinan, Shandong, Peoples R China
[3] Univ Suwon, Dept Comp Sci, Suwon, South Korea
基金
中国国家自然科学基金;
关键词
Protein secondary structure prediction; SVM; Position specific scoring matrices; Sequence feature; Amino acid scale; ProtScale;
D O I
10.1007/978-3-319-56994-9_74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge of protein secondary structure is a useful step toward prediction of the 3D structure of a particular protein. In this paper, a support vector machine (SVM) based method used for the prediction of secondary structure is introduced in details. Protein sequence data is in a hybrid representation combining the Position-specific Scoring Matrix (PSSM), the Hydrophobicity Sequence Feature (HSF), and the Structural Sequence Feature (SSF). Protein sequences are obtained from CB513 dataset, corresponding PSSM profiles are obtained from PSI-BLAST Program and sequence features are computed based on amino acid scales offered by Expasy website (http://web.expasy.org/protscale/). Basically, PSSM profiles are used as input data to the SVM-PSSM classifier of the secondary structure prediction. Furthermore, to construct more accurate classifiers, more than 40 SFs (sequence features) are examined as accessional input vector to SVM-PSSM classifier for feature selection. The most accurate classifier in this study is constructed using a combination of PSSM and few relevant sequence features. The experimental results show that relevant sequence features extracted from Hydrophobicity index and Structural conformational parameters can improve the SVM-PSSM classifier for the prediction of protein secondary structure elements. Our proposed final SVM-PSSM-SF method achieved an overall accuracy of 78%.
引用
收藏
页码:1074 / 1084
页数:11
相关论文
共 50 条
  • [1] Prediction of Protein Secondary Structure using SVM-PSSM Classifier Combined by Sequence Features
    Chen, Yehong
    Liu, Yihui
    Cheng, Jinyong
    Wang, Yanchun
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 103 - 106
  • [2] Protein Secondary Structure Prediction Based on Physicochemical Features and PSSM by SVM
    Huang, Yin-Fu
    Chen, Shu-Ying
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2013, : 9 - 15
  • [3] Prediction of Protein Secondary Structure using Support Vector Machine with PSSM Profiles
    Wang, Yanchun
    Cheng, Jinyong
    Liu, Yihui
    Chen, Yehong
    2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 502 - 505
  • [4] Combined approach to protein secondary structure prediction
    Amirova, S. R.
    Machavariani, M. A.
    Filatov, L., V
    Milchevsky, Ju. V.
    Esipova, N. G.
    Tumanyan, V. G.
    Proceedings of the Fourth International Conference on Bioinformatics of Genome Regulation and Structure, Vol 1, 2004, : 231 - 234
  • [5] Prediction of RNA binding sites in a protein using SVM and PSSM profile
    Kumar, Manish
    Gromiha, A. Michael
    Raghava, G. P. S.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) : 189 - 194
  • [6] Improving Protein Structural Class Prediction Using Novel Combined Sequence Information and Predicted Secondary Structural Features
    Dai, Qi
    Wu, Li
    Li, Lihua
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (16) : 3393 - 3398
  • [7] EPTool: A New Enhancing PSSM Tool for Protein Secondary Structure Prediction
    Guo, Yuzhi
    Wu, Jiaxiang
    Ma, Hehuan
    Wang, Sheng
    Huang, Junzhou
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (04) : 362 - 364
  • [8] A novel hybrid GMM/SVM architecture for protein secondary structure prediction
    Samani, Emad Bahrami
    Homayounpour, M. Mehdi
    Gu, Hong
    APPLICATIONS OF FUZZY SETS THEORY, 2007, 4578 : 491 - +
  • [9] A novel method for protein secondary structure prediction using dual-layer SVM and profiles
    Guo, J
    Chen, H
    Sun, ZR
    Lin, YL
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (04) : 738 - 743
  • [10] Secondary structure prediction using SVM and clustering
    Doong, SH
    Yeh, CY
    HIS'04: Fourth International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 297 - 302