Using support vector machines to identify protein phosphorylation sites in viruses

被引:25
|
作者
Huang, Shu-Yun [1 ]
Shi, Shao-Ping [2 ,3 ]
Qiu, Jian-Ding [1 ]
Liu, Ming-Chu [1 ]
机构
[1] Pingxiang Coll, Dept Chem Engn, Pingxiang 337055, Peoples R China
[2] Nanchang Univ, Dept Chem, Nanchang 330031, Peoples R China
[3] Nanchang Univ, Dept Math, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Phosphorylation site; Virus proteins; Support vector machine; Encoding scheme based on attribute grouping; Position weight amino acid composition; GROUPED WEIGHT; PREDICTION; SEQUENCE; DATABASE; IDENTIFICATION; FRAMEWORK; MUSITE;
D O I
10.1016/j.jmgm.2014.12.005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phosphorylation of viral proteins plays important roles in enhancing replication and inhibition of normal host-cell functions. Given its importance in biology, a unique opportunity has arisen to identify viral protein phosphorylation sites. However, experimental methods for identifying phosphorylation sites are resource intensive. Hence, there is significant interest in developing computational methods for reliable prediction of viral phosphorylation sites from amino acid sequences. In this study, a new method based on support vector machine is proposed to identify protein phosphorylation sites in viruses. We apply an encoding scheme based on attribute grouping and position weight amino acid composition to extract physicochemical properties and sequence information of viral proteins around phosphorylation sites. By 10-fold cross-validation, the prediction accuracies for phosphoserine, phosphothreonine and phosphotyrosine with window size of 23 are 88.8%, 95.2% and 97.1%, respectively. Furthermore, compared with the existing methods of Musite and MDD-clustered HMMs, the high sensitivity and accuracy of our presented method demonstrate the predictive effectiveness of the identified phosphorylation sites for viral proteins. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [21] Protein structural class determination using support vector machines
    Isik, Z
    Yanikoglu, B
    Sezerman, U
    COMPUTER AND INFORMATION SCIENCES - ISCIS 2004, PROCEEDINGS, 2004, 3280 : 82 - 89
  • [22] Prediction of protein structural classes using support vector machines
    Sun, X. -D.
    Huang, R. -B.
    AMINO ACIDS, 2006, 30 (04) : 469 - 475
  • [23] Probabilistic Prediction of Protein Phosphorylation Sites Using Classification Relevance Units Machines
    Menor, Mark
    Baek, Kyungim
    Poisson, Guylaine
    APPLIED COMPUTING REVIEW, 2012, 12 (04): : 8 - 20
  • [24] Using support vector machines to identify literacy skills: Evidence from eye movements
    Ya Lou
    Yanping Liu
    Johanna K. Kaakinen
    Xingshan Li
    Behavior Research Methods, 2017, 49 : 887 - 895
  • [25] Using support vector machines to identify literacy skills: Evidence from eye movements
    Lou, Ya
    Liu, Yanping
    Kaakinen, Johanna K.
    Li, Xingshan
    BEHAVIOR RESEARCH METHODS, 2017, 49 (03) : 887 - 895
  • [26] A support vector machine approach to the identification of phosphorylation sites
    Plewczynski, D
    Tkacz, A
    Godzik, A
    Rychlewski, L
    CELLULAR & MOLECULAR BIOLOGY LETTERS, 2005, 10 (01) : 73 - 89
  • [27] Identify catalytic triads of serine hydrolases by support vector machines
    Cai, YD
    Zhou, GP
    Jen, CH
    Lin, SL
    Chou, KC
    JOURNAL OF THEORETICAL BIOLOGY, 2004, 228 (04) : 551 - 557
  • [28] Protein Secondary Structure Prediction Using Support Vector Machines (SVMs)
    Patel, Mayuri
    Shah, Hitesh
    2013 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND RESEARCH ADVANCEMENT (ICMIRA 2013), 2013, : 594 - 598
  • [29] Predicting Protein Subcellular Localization using PsePSSM and Support Vector Machines
    Juan, Eric Y. T.
    Jhang, J. H.
    Li, W. J.
    PROCEEDINGS OF THE 11TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2008,
  • [30] Protein fold recognition using neural networks and support vector machines
    Jiang, N
    Wu, WXY
    Mitchell, I
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS, 2005, 3578 : 462 - 469