Quokka: a comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome

被引:141
作者
Li, Fuyi [1 ,2 ]
Li, Chen [1 ,2 ,3 ]
Marquez-Lago, Tatiana T. [4 ]
Leier, Andre [4 ]
Akutsu, Tatsuya [5 ]
Purcell, Anthony W. [1 ,2 ]
Smith, A. Ian [1 ,2 ,6 ]
Lithgow, Trevor [1 ,7 ]
Daly, Roger J. [1 ,2 ]
Song, Jiangning [1 ,2 ,8 ]
Chou, Kuo-Chen [9 ]
机构
[1] Monash Univ, Biomed Discovery Inst, Clayton, Vic 3800, Australia
[2] Monash Univ, Dept Biochem & Mol Biol, Clayton, Vic 3800, Australia
[3] Swiss Fed Inst Technol, Inst Mol Syst Biol, Dept Biol, CH-8093 Zurich, Switzerland
[4] Univ Alabama Birmingham, Sch Med, Dept Genet, Birmingham, AL 35294 USA
[5] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Kyoto 6110011, Japan
[6] Monash Univ, ARC Ctr Excellence Adv Mol Imaging, Melbourne, Vic 3800, Australia
[7] Monash Univ, Dept Microbiol, Clayton, Vic 3800, Australia
[8] Monash Univ, Monash Ctr Data Sci, Clayton, Vic 3800, Australia
[9] Gordon Life Sci Inst, Boston, MA 02478 USA
基金
英国医学研究理事会; 澳大利亚研究理事会; 美国国家卫生研究院; 澳大利亚国家健康与医学研究理事会;
关键词
DNA-DAMAGE RESPONSE; CLASS-I; CELL-DIVISION; SEQUENCE; GLYCOSYLATION; PROMOTERS; PROTEINS; FEATURES; KINOME;
D O I
10.1093/bioinformatics/bty522
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Kinase-regulated phosphorylation is a ubiquitous type of post-translational modification (PTM) in both eukaryotic and prokaryotic cells. Phosphorylation plays fundamental roles in many signalling pathways and biological processes, such as protein degradation and protein-protein interactions. Experimental studies have revealed that signalling defects caused by aberrant phosphorylation are highly associated with a variety of human diseases, especially cancers. In light of this, a number of computational methods aiming to accurately predict protein kinase family-specific or kinase-specific phosphorylation sites have been established, thereby facilitating phosphoproteomic data analysis. Results: In this work, we present Quokka, a novel bioinformatics tool that allows users to rapidly and accurately identify human kinase family-regulated phosphorylation sites. Quokka was developed by using a variety of sequence scoring functions combined with an optimized logistic regression algorithm. We evaluated Quokka based on well-prepared up-to-date benchmark and independent test datasets, curated from the Phospho. ELM and UniProt databases, respectively. The independent test demonstrates that Quokka improves the prediction performance compared with state-of-the-art computational tools for phosphorylation prediction. In summary, our tool provides users with high-quality predicted human phosphorylation sites for hypothesis generation and biological validation.
引用
收藏
页码:4223 / 4231
页数:9
相关论文
共 56 条
[1]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[2]  
[Anonymous], 2017, SCI REP-UK, DOI DOI 10.1038/s41598-017-07199-4
[3]  
[Anonymous], 2019, BRIEF BIOINFORM, DOI DOI 10.1093/bib/bby028
[4]   Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence [J].
Blom, N ;
Sicheritz-Pontén, T ;
Gupta, R ;
Gammeltoft, S ;
Brunak, S .
PROTEOMICS, 2004, 4 (06) :1633-1649
[5]   Phosphopeptide fragmentation and analysis by mass spectrometry [J].
Boersema, Paul J. ;
Mohammed, Shabaz ;
Heck, Albert J. R. .
JOURNAL OF MASS SPECTROMETRY, 2009, 44 (06) :861-878
[6]   PHOSPHORYLATION OF CLASS-I BUT NOT CLASS-II MHC MOLECULES BY MEMBRANE-LOCALIZED PROTEIN KINASE-C [J].
BURKE, T ;
POLLOK, K ;
CUSHLEY, W ;
SNOW, EC .
MOLECULAR IMMUNOLOGY, 1989, 26 (12) :1095-1104
[7]   Prediction of linear B-cell epitopes using amino acid pair antigenicity scale [J].
Chen, J. ;
Liu, H. ;
Yang, J. ;
Chou, K.-C. .
AMINO ACIDS, 2007, 33 (03) :423-428
[8]   iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e68
[9]   iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Leier, Andre ;
Marquez-Lago, Tatiana T. ;
Wang, Yanan ;
Webb, Geoffrey I. ;
Smith, A. Ian ;
Daly, Roger J. ;
Chou, Kuo-Chen ;
Song, Jiangning .
BIOINFORMATICS, 2018, 34 (14) :2499-2502
[10]   Prediction of signal peptides using scaled window [J].
Chou, KC .
PEPTIDES, 2001, 22 (12) :1973-1979