Quokka: a comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome

被引:137
作者
Li, Fuyi [1 ,2 ]
Li, Chen [1 ,2 ,3 ]
Marquez-Lago, Tatiana T. [4 ]
Leier, Andre [4 ]
Akutsu, Tatsuya [5 ]
Purcell, Anthony W. [1 ,2 ]
Smith, A. Ian [1 ,2 ,6 ]
Lithgow, Trevor [1 ,7 ]
Daly, Roger J. [1 ,2 ]
Song, Jiangning [1 ,2 ,8 ]
Chou, Kuo-Chen [9 ]
机构
[1] Monash Univ, Biomed Discovery Inst, Clayton, Vic 3800, Australia
[2] Monash Univ, Dept Biochem & Mol Biol, Clayton, Vic 3800, Australia
[3] Swiss Fed Inst Technol, Inst Mol Syst Biol, Dept Biol, CH-8093 Zurich, Switzerland
[4] Univ Alabama Birmingham, Sch Med, Dept Genet, Birmingham, AL 35294 USA
[5] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Kyoto 6110011, Japan
[6] Monash Univ, ARC Ctr Excellence Adv Mol Imaging, Melbourne, Vic 3800, Australia
[7] Monash Univ, Dept Microbiol, Clayton, Vic 3800, Australia
[8] Monash Univ, Monash Ctr Data Sci, Clayton, Vic 3800, Australia
[9] Gordon Life Sci Inst, Boston, MA 02478 USA
基金
美国国家卫生研究院; 英国医学研究理事会; 澳大利亚国家健康与医学研究理事会; 澳大利亚研究理事会;
关键词
DNA-DAMAGE RESPONSE; CLASS-I; CELL-DIVISION; SEQUENCE; GLYCOSYLATION; PROMOTERS; PROTEINS; FEATURES; KINOME;
D O I
10.1093/bioinformatics/bty522
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Kinase-regulated phosphorylation is a ubiquitous type of post-translational modification (PTM) in both eukaryotic and prokaryotic cells. Phosphorylation plays fundamental roles in many signalling pathways and biological processes, such as protein degradation and protein-protein interactions. Experimental studies have revealed that signalling defects caused by aberrant phosphorylation are highly associated with a variety of human diseases, especially cancers. In light of this, a number of computational methods aiming to accurately predict protein kinase family-specific or kinase-specific phosphorylation sites have been established, thereby facilitating phosphoproteomic data analysis. Results: In this work, we present Quokka, a novel bioinformatics tool that allows users to rapidly and accurately identify human kinase family-regulated phosphorylation sites. Quokka was developed by using a variety of sequence scoring functions combined with an optimized logistic regression algorithm. We evaluated Quokka based on well-prepared up-to-date benchmark and independent test datasets, curated from the Phospho. ELM and UniProt databases, respectively. The independent test demonstrates that Quokka improves the prediction performance compared with state-of-the-art computational tools for phosphorylation prediction. In summary, our tool provides users with high-quality predicted human phosphorylation sites for hypothesis generation and biological validation.
引用
收藏
页码:4223 / 4231
页数:9
相关论文
共 56 条
  • [1] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [2] [Anonymous], 2017, SCI REP-UK, DOI DOI 10.1038/s41598-017-07199-4
  • [3] [Anonymous], 2019, BRIEF BIOINFORM, DOI DOI 10.1093/bib/bby028
  • [4] Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence
    Blom, N
    Sicheritz-Pontén, T
    Gupta, R
    Gammeltoft, S
    Brunak, S
    [J]. PROTEOMICS, 2004, 4 (06) : 1633 - 1649
  • [5] Phosphopeptide fragmentation and analysis by mass spectrometry
    Boersema, Paul J.
    Mohammed, Shabaz
    Heck, Albert J. R.
    [J]. JOURNAL OF MASS SPECTROMETRY, 2009, 44 (06): : 861 - 878
  • [6] PHOSPHORYLATION OF CLASS-I BUT NOT CLASS-II MHC MOLECULES BY MEMBRANE-LOCALIZED PROTEIN KINASE-C
    BURKE, T
    POLLOK, K
    CUSHLEY, W
    SNOW, EC
    [J]. MOLECULAR IMMUNOLOGY, 1989, 26 (12) : 1095 - 1104
  • [7] Prediction of linear B-cell epitopes using amino acid pair antigenicity scale
    Chen, J.
    Liu, H.
    Yang, J.
    Chou, K.-C.
    [J]. AMINO ACIDS, 2007, 33 (03) : 423 - 428
  • [8] iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
    Chen, Wei
    Feng, Peng-Mian
    Lin, Hao
    Chou, Kuo-Chen
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (06) : e68
  • [9] iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences
    Chen, Zhen
    Zhao, Pei
    Li, Fuyi
    Leier, Andre
    Marquez-Lago, Tatiana T.
    Wang, Yanan
    Webb, Geoffrey I.
    Smith, A. Ian
    Daly, Roger J.
    Chou, Kuo-Chen
    Song, Jiangning
    [J]. BIOINFORMATICS, 2018, 34 (14) : 2499 - 2502
  • [10] Prediction of signal peptides using scaled window
    Chou, KC
    [J]. PEPTIDES, 2001, 22 (12) : 1973 - 1979