Integration of genomic data for inferring protein complexes from global protein-protein interaction networks

被引:25
作者
Zheng, Huiru [1 ]
Wang, Haiying [1 ]
Glass, David H. [1 ]
机构
[1] Univ Ulster, Sch Comp & Math, Newtownabbey BT37 0QB, Antrim, North Ireland
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2008年 / 38卷 / 01期
关键词
Bayesian networks; clustering analysis; data integration; protein-protein interaction (PPI) networks;
D O I
10.1109/TSMCB.2007.908912
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Protein-protein interactions (PPIs) play crucial roles in virtually every aspect of cellular function within an organism. One important objective of modern biology is the extraction of functional modules, such as protein complexes from global protein interaction networks. This paper describes how seven genomic features and four experimental interaction data sets were combined using a Bayesian-networks-based data integration approach to infer PPI networks in yeast. Greater coverage and higher accuracy were achieved than in previous high-throughput studies of PPI networks in yeast. A Markov clustering algorithm was then used to extract protein complexes from the inferred protein interaction networks. The quality of the computed complexes was evaluated using the hand-curated complexes from the Munich Information Center for Protein Sequences database and gene-ontology-driven semantic similarity. The results indicated that, by integrating multiple genomic information sources, a better clustering result was obtained in terms of both statistical measures and biological relevance.
引用
收藏
页码:5 / 16
页数:12
相关论文
共 42 条
  • [1] [Anonymous], 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
  • [2] [Anonymous], 2000, INSR0012 NAT RES I M
  • [3] Azuaje F, 2006, ICDM 2006: Sixth IEEE International Conference on Data Mining, Workshops, P114
  • [4] An automated method for finding molecular complexes in large protein interaction networks
    Bader, GD
    Hogue, CW
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [5] Superparamagnetic clustering of data
    Blatt, M
    Wiseman, S
    Domany, E
    [J]. PHYSICAL REVIEW LETTERS, 1996, 76 (18) : 3251 - 3254
  • [6] Prolinks: a database of protein functional linkages derived from coevolution
    Bowers, PM
    Pellegrini, M
    Thompson, MJ
    Fierro, J
    Yeates, TO
    Eisenberg, D
    [J]. GENOME BIOLOGY, 2004, 5 (05)
  • [7] Evaluation of clustering algorithms for protein-protein interaction networks
    Brohee, Sylvain
    van Helden, Jacques
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [8] A genome-wide transcriptional analysis of the mitotic cell cycle
    Cho, RJ
    Campbell, MJ
    Winzeler, EA
    Steinmetz, L
    Conway, A
    Wodicka, L
    Wolfsberg, TG
    Gabrielian, AE
    Landsman, D
    Lockhart, DJ
    Davis, RW
    [J]. MOLECULAR CELL, 1998, 2 (01) : 65 - 73
  • [9] Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae
    Collins, Sean R.
    Kemmeren, Patrick
    Zhao, Xue-Chu
    Greenblatt, Jack F.
    Spencer, Forrest
    Holstege, Frank C. P.
    Weissman, Jonathan S.
    Krogan, Nevan J.
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (03) : 439 - 450
  • [10] Cowell R. G., 1999, PROBABILISTIC NETWOR