Using support vector machines to identify protein phosphorylation sites in viruses

被引:25
|
作者
Huang, Shu-Yun [1 ]
Shi, Shao-Ping [2 ,3 ]
Qiu, Jian-Ding [1 ]
Liu, Ming-Chu [1 ]
机构
[1] Pingxiang Coll, Dept Chem Engn, Pingxiang 337055, Peoples R China
[2] Nanchang Univ, Dept Chem, Nanchang 330031, Peoples R China
[3] Nanchang Univ, Dept Math, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Phosphorylation site; Virus proteins; Support vector machine; Encoding scheme based on attribute grouping; Position weight amino acid composition; GROUPED WEIGHT; PREDICTION; SEQUENCE; DATABASE; IDENTIFICATION; FRAMEWORK; MUSITE;
D O I
10.1016/j.jmgm.2014.12.005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phosphorylation of viral proteins plays important roles in enhancing replication and inhibition of normal host-cell functions. Given its importance in biology, a unique opportunity has arisen to identify viral protein phosphorylation sites. However, experimental methods for identifying phosphorylation sites are resource intensive. Hence, there is significant interest in developing computational methods for reliable prediction of viral phosphorylation sites from amino acid sequences. In this study, a new method based on support vector machine is proposed to identify protein phosphorylation sites in viruses. We apply an encoding scheme based on attribute grouping and position weight amino acid composition to extract physicochemical properties and sequence information of viral proteins around phosphorylation sites. By 10-fold cross-validation, the prediction accuracies for phosphoserine, phosphothreonine and phosphotyrosine with window size of 23 are 88.8%, 95.2% and 97.1%, respectively. Furthermore, compared with the existing methods of Musite and MDD-clustered HMMs, the high sensitivity and accuracy of our presented method demonstrate the predictive effectiveness of the identified phosphorylation sites for viral proteins. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [41] Prediction of phosphorylation sites based on granular support vector machine
    Cheng, Gong
    Chen, Qingfeng
    Zhang, Ruchang
    GRANULAR COMPUTING, 2021, 6 (01) : 107 - 117
  • [42] Protein solvent accessibility prediction using support vector machines and sequence conservations
    Ogul, Hasan
    Mumcuoglu, Erkan U.
    ARTIFICIAL INTELLIGENCE AND NEURAL NETWORKS, 2006, 3949 : 141 - 148
  • [43] Predicting protein stability changes from sequences using support vector machines
    Capriotti, E
    Fariselli, P
    Calabrese, R
    Casadio, R
    BIOINFORMATICS, 2005, 21 : 54 - 58
  • [44] Support Vector Machines for Protein Family Identification using Surface Invariant Coordinates
    Satpute, Babasaheb
    Yadav, Raghav
    2018 3RD INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [45] Conotoxin protein classification using free scores of words and support vector machines
    Nazar Zaki
    Stefan Wolfsheimer
    Gregory Nuel
    Sawsan Khuri
    BMC Bioinformatics, 12
  • [46] Prediction of protein domains from sequence information using support vector machines
    Zou, Shuxue
    Huang, Yanxin
    Wang, Yan
    Zhou, Chunguang
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 674 - 681
  • [47] Conotoxin protein classification using free scores of words and support vector machines
    Zaki, Nazar
    Wolfsheimer, Stefan
    Nuel, Gregory
    Khuri, Sawsan
    BMC BIOINFORMATICS, 2011, 12
  • [48] Prediction of protein secondary structure using Bayesian method and support vector machines
    Nguyen, MN
    Rajapakse, JC
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 616 - 620
  • [49] Identifying the interacting positions of a protein using Boolean learning and support vector machines
    Dubey, Anshul
    Realff, Matthew J.
    Lee, Jay H.
    Bommarius, Andreas S.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2006, 30 (04) : 268 - 279
  • [50] Protein secondary structure prediction using genetic neural support vector machines
    Reyaz-Ahmed, Anjum
    Zhang, Yan-Qing
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 1355 - 1359