PRED-PPI: A server for predicting protein-protein interactions based on sequence data with probability assignment

被引:45
作者
Guo Y. [1 ]
Li M. [1 ]
Pu X. [1 ]
Li G. [1 ]
Guang X. [1 ]
Xiong W. [1 ]
Li J. [1 ]
机构
[1] College of Chemistry, Sichuan University
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Support Vector Machine; Probability Threshold; Probability Assignment; Query Protein; Interaction Probability;
D O I
10.1186/1756-0500-3-145
中图分类号
学科分类号
摘要
Background: Protein-protein interactions (PPIs) are crucial for almost all cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. Given the importance of PPIs, several methods have been developed to detect them. Since the experimental methods are time-consuming and expensive, developing computational methods for effectively identifying PPIs is of great practical significance. Findings: Most previous methods were developed for predicting PPIs in only one species, and do not account for probability estimations. In this work, a relatively comprehensive prediction system was developed, based on a support vector machine (SVM), for predicting PPIs in five organisms, specifically humans, yeast, Drosophila, Escherichia coli, and Caenorhabditis elegans. This PPI predictor includes the probability of its prediction in the output, so it can be used to assess the confidence of each SVM prediction by the probability assignment. Using a probability of 0.5 as the threshold for assigning class labels, the method had an average accuracy for detecting protein interactions of 90.67% for humans, 88.99% for yeast, 90.09% for Drosophila, 92.73% for E. coli, and 97.51% for C. elegans. Moreover, among the correctly predicted pairs, more than 80% were predicted with a high probability of 0.8, indicating that this tool could predict novel PPIs with high confidence. Conclusions. Based on this work, a web-based system, Pred-PPI, was constructed for predicting PPIs from the five organisms. Users can predict novel PPIs and obtain a probability value about the prediction using this tool. Pred-PPI is freely available at http://cic.scu.edu.cn/bioinformatics/predict-ppi/default.html. © 2010 Li et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 13 条
[1]  
Han J.D., Dupuy D., Bertin N., Cusick M.E., Vidal M., Effect of sampling on topology predictions of protein-protein interaction networks, Nat Biotechnol, 23, pp. 839-844, (2005)
[2]  
Juan D., Pazos F., Valencia A., High-confidence prediction of global interactomes based on genome-wide coevolutionary networks, Proc Natl Acad Sci USA, 105, pp. 934-939, (2008)
[3]  
Singhal M., Resat H., A domain-based approach to predict protein-protein interactions, BMC Bioinformatics, 8, (2007)
[4]  
Burger L., Van Nimwegen E., Accurate prediction of protein-protein interactions from sequence alignments using a Bayesian method, Mol Syst Biol, 4, (2008)
[5]  
Chou K.C., Cai Y.D., Predicting protein-protein interactions from sequences in a hybridization space, J Proteome Res, 5, pp. 316-322, (2006)
[6]  
Shen J.W., Zhang J., Luo X.M., Zhu W.L., Yu K.Q., Chen K.X., Li Y.X., Jiang H.L., Predicting protein-protein interactions based only on sequences information, Proc Natl Acad Sci USA, 104, pp. 4337-4341, (2007)
[7]  
Guo Y.Z., Yu L.Z., Wen Z.N., Li M.L., Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences, Nucleic Acids Res, 36, pp. 3025-3030, (2008)
[8]  
Peri S., Navarro J.D., Kristiansen T.Z., Amanchy R., Surendranath V., Muthusamy B., Gandhi T.K., Chandrika K.N., Deshpande N., Suresh S., Et al., Human protein reference database as a discovery resource for proteomics, Nucleic Acids Res, 32, (2004)
[9]  
Xenarios I., Salwinski L., Duan X.J., Higney P., Kim S.M., Eisenberg D., DIP: The database of interacting proteins. A research tool for studying cellular networks of protein interactions, Nucleic Acids Res, 30, pp. 303-305, (2002)
[10]  
LIBSVM-A Library for Support Vector Machines