Prediction of Protein Phosphorylation Sites by Using the Composition of k-Spaced Amino Acid Pairs

被引:50
作者
Zhao, Xiaowei [1 ]
Zhang, Wenyi [1 ]
Xu, Xin [1 ]
Ma, Zhiqiang [1 ]
Yin, Minghao [1 ]
机构
[1] NE Normal Univ, Coll Comp Sci & Informat Technol, Changchun, Peoples R China
来源
PLOS ONE | 2012年 / 7卷 / 10期
基金
中国国家自然科学基金;
关键词
POSTTRANSLATIONAL MODIFICATIONS; INTRINSIC DISORDER; P38; MAPK; SEQUENCE; DATABASE; UBIQUITYLATION; GLYCOSYLATION; LOCALIZATION; SERVER; TOOL;
D O I
10.1371/journal.pone.0046302
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As one of the most widespread protein post-translational modifications, phosphorylation is involved in many biological processes such as cell cycle, apoptosis. Identification of phosphorylated substrates and their corresponding sites will facilitate the understanding of the molecular mechanism of phosphorylation. Comparing with the labor-intensive and time-consuming experiment approaches, computational prediction of phosphorylation sites is much desirable due to their convenience and fast speed. In this paper, a new bioinformatics tool named CKSAAP_PhSite was developed that ignored the kinase information and only used the primary sequence information to predict protein phosphorylation sites. The highlight of CKSAAP_PhSite was to utilize the composition of k-spaced amino acid pairs as the encoding scheme, and then the support vector machine was used as the predictor. The performance of CKSAAP_PhSite was measured with a sensitivity of 84.81%, a specificity of 86.07% and an accuracy of 85.43% for serine, a sensitivity of 78.59%, a specificity of 82.26% and an accuracy of 80.31% for threonine as well as a sensitivity of 74.44%, a specificity of 78.03% and an accuracy of 76.21% for tyrosine. Experimental results obtained from cross validation and independent benchmark suggested that our method was very promising to predict phosphorylation sites and can be served as a useful supplement tool to the community. For public access, CKSAAP_PhSite is available at http://59.73.198.144/cksaap_phsite/.
引用
收藏
页数:8
相关论文
共 48 条
  • [1] Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information
    Biswas, Ashis Kumer
    Noman, Nasimul
    Sikder, Abdur Rahman
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [2] Sequence and structure-based prediction of eukaryotic protein phosphorylation sites
    Blom, N
    Gammeltoft, S
    Brunak, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1999, 294 (05) : 1351 - 1362
  • [3] Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence
    Blom, N
    Sicheritz-Pontén, T
    Gupta, R
    Gammeltoft, S
    Brunak, S
    [J]. PROTEOMICS, 2004, 4 (06) : 1633 - 1649
  • [4] Phosphopeptide fragmentation and analysis by mass spectrometry
    Boersema, Paul J.
    Mohammed, Shabaz
    Heck, Albert J. R.
    [J]. JOURNAL OF MASS SPECTROMETRY, 2009, 44 (06): : 861 - 878
  • [5] Insulin receptor substrate 1 regulates the cellular differentiation and the matrix metallopeptidase expression of preosteoblastic cells
    Bu, Yan-Hong
    He, Yu-Ling
    Zhou, Hou-De
    Liu, Wei
    Peng, Dan
    Tang, Ai-Guo
    Tang, Ling-Li
    Xie, Hui
    Huang, Qiu-Xia
    Luo, Xiang-Hang
    Liao, Er-Yuan
    [J]. JOURNAL OF ENDOCRINOLOGY, 2010, 206 (03) : 271 - 277
  • [6] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [7] Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs
    Chen, Ke
    Kurgan, Lukasz A.
    Ruan, Jishou
    [J]. BMC STRUCTURAL BIOLOGY, 2007, 7
  • [8] Prediction of protein crystallization using collocation of amino acid pairs
    Chen, Ke
    Kurgan, Lukasz
    Rahbari, Mandana
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 355 (03) : 764 - 769
  • [9] Prediction of Integral Membrane Protein Type by Collocated Hydrophobic Amino Acid Pair
    Chen, Ke
    Jiang, Yingfu
    Du, Li
    Kurgan, Lukasz
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (01) : 163 - 172
  • [10] Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs
    Chen, Yong-Zi
    Tang, Yu-Rong
    Sheng, Zhi-Ya
    Zhang, Ziding
    [J]. BMC BIOINFORMATICS, 2008, 9 (1) : 101