Protein function prediction using guilty by association from interaction networks

被引:0
作者
Damiano Piovesan
Manuel Giollo
Carlo Ferrari
Silvio C. E. Tosatto
机构
[1] University of Padua,Department of Biomedical Sciences
[2] University of Padua,Department of Information Engineering
[3] CNR Institute of Neuroscience,undefined
来源
Amino Acids | 2015年 / 47卷
关键词
Protein function; Protein interaction network; Gene ontology; CAFA; Protein sequence;
D O I
暂无
中图分类号
学科分类号
摘要
Protein function prediction from sequence using the Gene Ontology (GO) classification is useful in many biological problems. It has recently attracted increasing interest, thanks in part to the Critical Assessment of Function Annotation (CAFA) challenge. In this paper, we introduce Guilty by Association on STRING (GAS), a tool to predict protein function exploiting protein–protein interaction networks without sequence similarity. The assumption is that whenever a protein interacts with other proteins, it is part of the same biological process and located in the same cellular compartment. GAS retrieves interaction partners of a query protein from the STRING database and measures enrichment of the associated functional annotations to generate a sorted list of putative functions. A performance evaluation based on CAFA metrics and a fair comparison with optimized BLAST similarity searches is provided. The consensus of GAS and BLAST is shown to improve overall performance. The PPI approach is shown to outperform similarity searches for biological process and cellular compartment GO predictions. Moreover, an analysis of the best practices to exploit protein–protein interaction networks is also provided.
引用
收藏
页码:2583 / 2592
页数:9
相关论文
共 73 条
  • [1] Altschul S(1990)Basic local alignment search tool J Mol Biol 215 403-410
  • [2] Ashburner M(2000)Gene ontology: tool for the unification of biology. The Gene Ontology Consortium Nat Genet 25 25-29
  • [3] Ball CA(2011)Network medicine: a network-based approach to human disease Nat Rev Genet 12 56-68
  • [4] Blake JA(2003)Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network Genome Biol 5 R6-1630
  • [5] Barabási A-L(2006)Exploiting indirect neighbours and topological weight to predict protein function from protein–protein interactions Bioinforma Oxf Engl 22 1623-2096
  • [6] Gulbahce N(2011)Analysis of protein function and its prediction from amino acid sequence Proteins Struct Funct Bioinforma 79 2086-960
  • [7] Loscalzo J(2013)Protein function prediction by massive integration of evolutionary analyses and multiple data sources BMC Bioinformatics 14 S1-D357
  • [8] Brun C(2003)Prediction of protein function using protein-protein interaction data J Comput Biol J Comput Mol Cell Biol 10 947-D570
  • [9] Chevenet F(2014)RepeatsDB: a database of tandem repeat protein structures Nucleic Acids Res 42 D352-1980
  • [10] Martin D(2011)The UniProt-GO annotation database in 2011 Nucleic Acids Res 40 D565-D815