Using support vector machines to identify protein phosphorylation sites in viruses

被引:25
|
作者
Huang, Shu-Yun [1 ]
Shi, Shao-Ping [2 ,3 ]
Qiu, Jian-Ding [1 ]
Liu, Ming-Chu [1 ]
机构
[1] Pingxiang Coll, Dept Chem Engn, Pingxiang 337055, Peoples R China
[2] Nanchang Univ, Dept Chem, Nanchang 330031, Peoples R China
[3] Nanchang Univ, Dept Math, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Phosphorylation site; Virus proteins; Support vector machine; Encoding scheme based on attribute grouping; Position weight amino acid composition; GROUPED WEIGHT; PREDICTION; SEQUENCE; DATABASE; IDENTIFICATION; FRAMEWORK; MUSITE;
D O I
10.1016/j.jmgm.2014.12.005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phosphorylation of viral proteins plays important roles in enhancing replication and inhibition of normal host-cell functions. Given its importance in biology, a unique opportunity has arisen to identify viral protein phosphorylation sites. However, experimental methods for identifying phosphorylation sites are resource intensive. Hence, there is significant interest in developing computational methods for reliable prediction of viral phosphorylation sites from amino acid sequences. In this study, a new method based on support vector machine is proposed to identify protein phosphorylation sites in viruses. We apply an encoding scheme based on attribute grouping and position weight amino acid composition to extract physicochemical properties and sequence information of viral proteins around phosphorylation sites. By 10-fold cross-validation, the prediction accuracies for phosphoserine, phosphothreonine and phosphotyrosine with window size of 23 are 88.8%, 95.2% and 97.1%, respectively. Furthermore, compared with the existing methods of Musite and MDD-clustered HMMs, the high sensitivity and accuracy of our presented method demonstrate the predictive effectiveness of the identified phosphorylation sites for viral proteins. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [21] Classify a Protein Domain using Sigmoid Support Vector Machine
    Hassan, Umi Kalsum
    Nawi, Nazri Mohd.
    Kasim, Shahreen
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [22] An improved protein fold recognition with support vector machines
    Chmielnicki, Wieslaw
    Roterman-Konieczna, Irena
    Stapor, Katarzyna
    EXPERT SYSTEMS, 2012, 29 (02) : 200 - 211
  • [23] Prediction of Function Changes Associated With Single-Point Protein Mutations Using Support Vector Machines (SVMs)
    Gao, Shan
    Zhang, Ning
    Duan, Guang You
    Yang, Zhuo
    Ruan, Ji Shou
    Zhang, Tao
    HUMAN MUTATION, 2009, 30 (08) : 1161 - 1166
  • [24] Multi-class protein subcellular localization classification using support vector machines
    Meng, PW
    Rajapakse, JC
    PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 526 - 533
  • [25] A support vector machine approach to the identification of phosphorylation sites
    Plewczynski, D
    Tkacz, A
    Godzik, A
    Rychlewski, L
    CELLULAR & MOLECULAR BIOLOGY LETTERS, 2005, 10 (01) : 73 - 89
  • [26] Predicting and analyzing protein phosphorylation sites in plants using Musite
    Yao, Qiuming
    Gao, Jianjiong
    Bollinger, Curtis
    Thelen, Jay J.
    Xu, Dong
    FRONTIERS IN PLANT SCIENCE, 2012, 3
  • [27] Identify catalytic triads of serine hydrolases by support vector machines
    Cai, YD
    Zhou, GP
    Jen, CH
    Lin, SL
    Chou, KC
    JOURNAL OF THEORETICAL BIOLOGY, 2004, 228 (04) : 551 - 557
  • [28] A New Scheme to Characterize and Identify Protein Ubiquitination Sites
    Van-Nui Nguyen
    Huang, Kai-Yao
    Huang, Chien-Hsun
    Lai, K. Robert
    Lee, Tzong-Yi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (02) : 393 - 403
  • [29] Modelling Concrete Strength Using Support Vector Machines
    Yang, Haiying
    Dong, Yifeng
    CIVIL ENGINEERING, ARCHITECTURE AND SUSTAINABLE INFRASTRUCTURE II, PTS 1 AND 2, 2013, 438-439 : 170 - +
  • [30] Classification of Nucleotide Sequences Using Support Vector Machines
    Seo, Tae-Kun
    JOURNAL OF MOLECULAR EVOLUTION, 2010, 71 (04) : 250 - 267