Using support vector machines to identify protein phosphorylation sites in viruses

被引:25
|
作者
Huang, Shu-Yun [1 ]
Shi, Shao-Ping [2 ,3 ]
Qiu, Jian-Ding [1 ]
Liu, Ming-Chu [1 ]
机构
[1] Pingxiang Coll, Dept Chem Engn, Pingxiang 337055, Peoples R China
[2] Nanchang Univ, Dept Chem, Nanchang 330031, Peoples R China
[3] Nanchang Univ, Dept Math, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Phosphorylation site; Virus proteins; Support vector machine; Encoding scheme based on attribute grouping; Position weight amino acid composition; GROUPED WEIGHT; PREDICTION; SEQUENCE; DATABASE; IDENTIFICATION; FRAMEWORK; MUSITE;
D O I
10.1016/j.jmgm.2014.12.005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phosphorylation of viral proteins plays important roles in enhancing replication and inhibition of normal host-cell functions. Given its importance in biology, a unique opportunity has arisen to identify viral protein phosphorylation sites. However, experimental methods for identifying phosphorylation sites are resource intensive. Hence, there is significant interest in developing computational methods for reliable prediction of viral phosphorylation sites from amino acid sequences. In this study, a new method based on support vector machine is proposed to identify protein phosphorylation sites in viruses. We apply an encoding scheme based on attribute grouping and position weight amino acid composition to extract physicochemical properties and sequence information of viral proteins around phosphorylation sites. By 10-fold cross-validation, the prediction accuracies for phosphoserine, phosphothreonine and phosphotyrosine with window size of 23 are 88.8%, 95.2% and 97.1%, respectively. Furthermore, compared with the existing methods of Musite and MDD-clustered HMMs, the high sensitivity and accuracy of our presented method demonstrate the predictive effectiveness of the identified phosphorylation sites for viral proteins. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [1] Prediction of Protein Phosphorylation Sites by Support Vector Machines
    Ishino, Tomoki
    Nishikawa, Ikuko
    Fukuchi, Satoshi
    Tohsato, Yukako
    Nishikawa, Ken
    PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 817 - 821
  • [2] Identify and analysis crotonylation sites in histone by using support vector machines
    Qiu, Wang-Ren
    Sun, Bi-Qian
    Tang, Hua
    Huang, Jian
    Lin, Hao
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 83 : 75 - 81
  • [3] Prediction of protein-protein interaction sites using support vector machines
    Minakuchi, Y
    Satou, K
    Konagaya, A
    METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2003, : 22 - 28
  • [4] Prediction of protein-protein interaction sites using support vector machines
    Koike, A
    Takagi, T
    PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (02): : 165 - 173
  • [5] Prediction of protein-glucose binding sites using support vector machines
    Nassif, Houssam
    Al-Ali, Hassan
    Khuri, Sawsan
    Keirouz, Walid
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 (01) : 121 - 132
  • [6] Support vector machines for learning to identify the critical positions of a protein
    Dubey, A
    Realff, MJ
    Lee, JH
    Bommarius, AS
    JOURNAL OF THEORETICAL BIOLOGY, 2005, 234 (03) : 351 - 361
  • [7] Improved prediction of protein-protein binding sites using a support vector machines approach
    Bradford, JR
    Westhead, DR
    BIOINFORMATICS, 2005, 21 (08) : 1487 - 1494
  • [8] Predict Collagen Hydroxyproline Sites Using Support Vector Machines
    Yang, Zheng Rong
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (05) : 691 - 702
  • [9] Support vector machines for predicting HIV protease cleavage sites in protein
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2002, 23 (02) : 267 - 274
  • [10] Support vector machines for prediction of protein signal sequences and their cleavage sites
    Cai, YD
    Lin, SL
    Chou, KC
    PEPTIDES, 2003, 24 (01) : 159 - 161