Using support vector machines to identify protein phosphorylation sites in viruses

被引:25
|
作者
Huang, Shu-Yun [1 ]
Shi, Shao-Ping [2 ,3 ]
Qiu, Jian-Ding [1 ]
Liu, Ming-Chu [1 ]
机构
[1] Pingxiang Coll, Dept Chem Engn, Pingxiang 337055, Peoples R China
[2] Nanchang Univ, Dept Chem, Nanchang 330031, Peoples R China
[3] Nanchang Univ, Dept Math, Nanchang 330031, Peoples R China
基金
中国国家自然科学基金;
关键词
Phosphorylation site; Virus proteins; Support vector machine; Encoding scheme based on attribute grouping; Position weight amino acid composition; GROUPED WEIGHT; PREDICTION; SEQUENCE; DATABASE; IDENTIFICATION; FRAMEWORK; MUSITE;
D O I
10.1016/j.jmgm.2014.12.005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phosphorylation of viral proteins plays important roles in enhancing replication and inhibition of normal host-cell functions. Given its importance in biology, a unique opportunity has arisen to identify viral protein phosphorylation sites. However, experimental methods for identifying phosphorylation sites are resource intensive. Hence, there is significant interest in developing computational methods for reliable prediction of viral phosphorylation sites from amino acid sequences. In this study, a new method based on support vector machine is proposed to identify protein phosphorylation sites in viruses. We apply an encoding scheme based on attribute grouping and position weight amino acid composition to extract physicochemical properties and sequence information of viral proteins around phosphorylation sites. By 10-fold cross-validation, the prediction accuracies for phosphoserine, phosphothreonine and phosphotyrosine with window size of 23 are 88.8%, 95.2% and 97.1%, respectively. Furthermore, compared with the existing methods of Musite and MDD-clustered HMMs, the high sensitivity and accuracy of our presented method demonstrate the predictive effectiveness of the identified phosphorylation sites for viral proteins. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:84 / 90
页数:7
相关论文
共 50 条
  • [41] Early Detection of Alzheimers Disease with Blood Plasma Proteins Using Support Vector Machines
    Eke, Chima S.
    Jammeh, Emmanuel
    Li, Xinzhong
    Carroll, Camille
    Pearson, Stephen
    Ifeachor, Emmanuel
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (01) : 218 - 226
  • [42] Conotoxin protein classification using free scores of words and support vector machines
    Nazar Zaki
    Stefan Wolfsheimer
    Gregory Nuel
    Sawsan Khuri
    BMC Bioinformatics, 12
  • [43] Determining Protein-Protein Interaction Using Support Vector Machine: A Review
    Chakraborty, Arijit
    Mitra, Sajal
    De, Debashis
    Pal, Anindya Jyoti
    Ghaemi, Ferial
    Ahmadian, Ali
    Ferrara, Massimiliano
    IEEE ACCESS, 2021, 9 : 12473 - 12490
  • [44] Bus arrival time prediction using support vector machines
    Yu Bin
    Yang Zhongzhen
    Yao Baozhen
    JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2006, 10 (04) : 151 - 158
  • [45] Support Vector Machines for predicting protein structural class
    Yu-Dong Cai
    Xiao-Jun Liu
    Xue-biao Xu
    Guo-Ping Zhou
    BMC Bioinformatics, 2
  • [46] Predicting RNA-binding sites of proteins using support vector machines and evolutionary information
    Cheng-Wei Cheng
    Emily Chia-Yu Su
    Jenn-Kang Hwang
    Ting-Yi Sung
    Wen-Lian Hsu
    BMC Bioinformatics, 9
  • [47] Prediction of protein structural classes by support vector machines
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    COMPUTERS & CHEMISTRY, 2002, 26 (03): : 293 - 296
  • [48] Support Vector Machines for predicting protein structural class
    Cai, Yu-Dong
    Liu, Xiao-Jun
    Xu, Xue-biao
    Zhou, Guo-Ping
    BMC BIOINFORMATICS, 2001, 2 (1)
  • [49] Predicting RNA-binding sites of proteins using support vector machines and evolutionary information
    Cheng, Cheng-Wei
    Su, Emily Chia-Yu
    Hwang, Jenn-Kang
    Sung, Ting-Yi
    Hsu, Wen-Lian
    BMC BIOINFORMATICS, 2008, 9
  • [50] Forecast of Temperature using Support Vector Machines
    Perez-Vega, Abrahan
    Travieso, Carlos M.
    Hernandez-Travieso, Jose G.
    Alonso, Jesus B.
    Dutta, Malay Kishore
    Singh, Anushikha
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 388 - 392