iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory

被引:95
作者
Qiu, Wang-Ren [1 ,2 ,3 ]
Sun, Bi-Qian [1 ]
Xiao, Xuan [1 ,4 ]
Xu, Dong [2 ,3 ]
Chou, Kuo-Chen [4 ,5 ]
机构
[1] Jingdezhen Ceram Inst, Comp Dept, Jingdezhen 333403, Peoples R China
[2] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
[3] Univ Missouri, Bond Life Sci Ctr, Columbia, MO 65211 USA
[4] Gordon Life Sci Inst, Boston, MA 02478 USA
[5] King Abdulaziz Univ, CEGMR, Jeddah 21589, Saudi Arabia
关键词
Disease-related phosphorylation; Evolutionary information; Grey system model; Chou's general PseAAC; Random forest classifiers; Fusion ensemble classifier; AMINO-ACID-COMPOSITION; LABEL LEARNING CLASSIFIER; SEQUENCE-BASED PREDICTOR; SUBCELLULAR-LOCALIZATION; ENSEMBLE CLASSIFIER; WEB-SERVER; K-TUPLE; PHYSICOCHEMICAL PROPERTIES; N-6-METHYLADENOSINE SITES; FUSION CLASSIFIER;
D O I
10.1002/minf.201600010
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Protein phosphorylation plays a critical role in human body by altering the structural conformation of a protein, causing it to become activated/deactivated, or functional modification. Given an uncharacterized protein sequence, can we predict whether it may be phosphorylated or may not? This is no doubt a very meaningful problem for both basic research and drug development. Unfortunately, to our best knowledge, so far no high throughput bioinformatics tool whatsoever has been developed to address such a very basic but important problem due to its extremely complexity and lacking sufficient training data. Here we proposed a predictor called iPhos-PseEvo by (1) incorporating the protein sequence evolutionary information into the general pseudo amino acid composition (PseAAC) via the grey system theory, (2) balancing out the skewed training datasets by the asymmetric bootstrap approach, and (3) constructing an ensemble predictor by fusing an array of individual random forest classifiers thru a voting system. Rigorous jackknife tests have indicated that very promising success rates have been achieved by iPhosPseEvo even for such a difficult problem. A user-friendly web-server for iPhos-PseEvo has been established at http://www.jci-bioinfo.cn/iPhos-PseEvo, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. It has not escaped our notice that the formulation and approach presented here can be used to analyze many other problems in protein science as well.
引用
收藏
页数:9
相关论文
共 120 条
[1]   Protein phosphorylation detection using dual-mode field-effect devices and nanoplasmonic sensors [J].
Bhalla, Nikhil ;
Di Lorenzo, Mirella ;
Pula, Giordano ;
Estrela, Pedro .
SCIENTIFIC REPORTS, 2015, 5
[2]   Protein phosphorylation analysis based on proton release detection: Potential tools for drug discovery [J].
Bhalla, Nikhil ;
Di Lorenzo, Mirella ;
Pula, Giordano ;
Estrela, Pedro .
BIOSENSORS & BIOELECTRONICS, 2014, 54 :109-114
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
BURNETT G, 1954, J BIOL CHEM, V211, P969
[5]   How phosphorylation activates the protein phosphatase-1 • inhibitor-2 complex [J].
Cannon, John F. .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2013, 1834 (01) :71-86
[6]   propy: a tool to generate various modes of Chou's PseAAC [J].
Cao, Dong-Sheng ;
Xu, Qing-Song ;
Liang, Yi-Zeng .
BIOINFORMATICS, 2013, 29 (07) :960-962
[7]   Prediction of linear B-cell epitopes using amino acid pair antigenicity scale [J].
Chen, J. ;
Liu, H. ;
Yang, J. ;
Chou, K.-C. .
AMINO ACIDS, 2007, 33 (03) :423-428
[8]   IACP: a sequence-based tool for identifying anticancer peptides [J].
Chen, Wei ;
Ding, Hui ;
Feng, Pengmian ;
Lin, Hao ;
Chou, Kuo-Chen .
ONCOTARGET, 2016, 7 (13) :16895-16909
[9]   Using deformation energy to analyze nucleosome positioning in genomes [J].
Chen, Wei ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
GENOMICS, 2016, 107 (2-3) :69-75
[10]   iRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition [J].
Chen, Wei ;
Feng, Pengmian ;
Ding, Hui ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2015, 490 :26-33