GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function

被引:699
作者
Mostafavi, Sara [1 ]
Ray, Debajyoti [2 ]
Warde-Farley, David [1 ]
Grouios, Chris [3 ]
Morris, Quaid [1 ,3 ,4 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
[2] Gatsby Computat Neurosci Unit, London WC1N 3AR, England
[3] Univ Toronto, Dept Mol & Med Genet, Toronto, ON M5S 1A8, Canada
[4] Univ Toronto, Banting & Best Dept Med Res, Toronto, ON M5G 1L6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1186/gb-2008-9-s1-s4
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Most successful computational approaches for protein function prediction integrate multiple genomics and proteomics data sources to make inferences about the function of unknown proteins. The most accurate of these algorithms have long running times, making them unsuitable for real-time protein function prediction in large genomes. As a result, the predictions of these algorithms are stored in static databases that can easily become outdated. We propose a new algorithm, GeneMANIA, that is as accurate as the leading methods, while capable of predicting protein function in real-time. Results: We use a fast heuristic algorithm, derived from ridge regression, to integrate multiple functional association networks and predict gene function from a single process-specific network using label propagation. Our algorithm is efficient enough to be deployed on a modern webserver and is as accurate as, or more so than, the leading methods on the MouseFunc I benchmark and a new yeast function prediction benchmark; it is robust to redundant and irrelevant data and requires, on average, less than ten seconds of computation time on tasks from these benchmarks. Conclusion: GeneMANIA is fast enough to predict gene function on-the-fly while achieving state-of-the-art accuracy. A prototype version of a GeneMANIA-based webserver is available at http://morrislab.med.utoronto.ca/prototype.
引用
收藏
页数:15
相关论文
共 39 条
  • [21] Network-based prediction of protein function
    Sharan, Roded
    Ulitsky, Igor
    Shamir, Ron
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1) : 1 - 13
  • [22] A gene-coexpression network for global discovery of conserved genetic modules
    Stuart, JM
    Segal, E
    Koller, D
    Kim, SK
    [J]. SCIENCE, 2003, 302 (5643) : 249 - 255
  • [23] Fast protein classification with multiple networks
    Tsuda, K
    Shin, HJ
    Schölkopf, B
    [J]. BIOINFORMATICS, 2005, 21 : 59 - 65
  • [24] A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae
    Uetz, P
    Giot, L
    Cagney, G
    Mansfield, TA
    Judson, RS
    Knight, JR
    Lockshon, D
    Narayan, V
    Srinivasan, M
    Pochart, P
    Qureshi-Emili, A
    Li, Y
    Godwin, B
    Conover, D
    Kalbfleisch, T
    Vijayadamodar, G
    Yang, MJ
    Johnston, M
    Fields, S
    Rothberg, JM
    [J]. NATURE, 2000, 403 (6770) : 623 - 627
  • [25] Global protein function prediction from protein-protein interaction networks
    Vazquez, A
    Flammini, A
    Maritan, A
    Vespignani, A
    [J]. NATURE BIOTECHNOLOGY, 2003, 21 (06) : 697 - 700
  • [26] Comparative assessment of large-scale data sets of protein-protein interactions
    von Mering, C
    Krause, R
    Snel, B
    Cornell, M
    Oliver, SG
    Fields, S
    Bork, P
    [J]. NATURE, 2002, 417 (6887) : 399 - 403
  • [27] STRING 7 -: recent developments in the integration and prediction of protein interactions
    von Mering, Christian
    Jensen, Lars J.
    Kuhn, Michael
    Chaffron, Samuel
    Doerks, Tobias
    Krueger, Beate
    Snel, Berend
    Bork, Peer
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D358 - D362
  • [28] Prediction of gene function by genome-scale expression analysis: Prostate cancer-associated genes
    Walker, MG
    Volkmuth, W
    Sprinzak, E
    Hodgson, D
    Klingler, T
    [J]. GENOME RESEARCH, 1999, 9 (12) : 1198 - 1203
  • [29] Discovering functional relationships: biochemistry versus genetics
    Wong, SL
    Zhang, LV
    Roth, FP
    [J]. TRENDS IN GENETICS, 2005, 21 (08) : 424 - 427
  • [30] Large-scale prediction of Saccharomyces cerevisiae gene function using overlapping transcriptional clusters
    Wu, LF
    Hughes, TR
    Davierwala, AP
    Robinson, MD
    Stoughton, R
    Altschuler, SJ
    [J]. NATURE GENETICS, 2002, 31 (03) : 255 - 265