An improved graph entropy-based method for identifying protein complexes

被引:4
作者
Chen, Bolin [1 ]
Yan, Yan [1 ,3 ]
Shi, Jinhong [1 ]
Zhang, Shenggui [3 ]
Wu, Fang-Xiang [1 ,2 ]
机构
[1] Univ Saskatchewan, Div Biomed Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada
[2] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada
[3] Northwestern Polytech Univ, Sch Sci, Dept Appl Math, Xian 710072, Shaanxi, Peoples R China
来源
2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011) | 2011年
基金
加拿大自然科学与工程研究理事会;
关键词
protein complex; protein-protein interaction network; graph clustering algorithm; graph entropy; INTERACTION NETWORKS; MODULES; YEAST;
D O I
10.1109/BIBM.2011.66
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Protein complexes are essential entities that perform the major cellular processes and biological functions in live organisms. The identification of component proteins in a complex from protein-protein interaction (PPI) networks is an important step to understand the organization and interaction of gene products. In existing literature, methods for identifying protein complexes typically start from a selected seed, commonly a vertex (a single protein), in a PPI network. However, in many circumstances, a single protein seed is not enough to generate a meaningful complex, or more than one protein is known in a complex. In this paper, we present an improved seed-growth style algorithm to identify protein complexes from PPI networks based on the concept of graph entropy. Different from existing methods, the seed is assumed to be a clique (e.g., a vertex, an edge, a triangle) in a PPI network. The computational experiments have been conducted on PPI network of S. cerevisiae. The results have shown that the larger cliques are considered as seeds, the better the presented method performs in terms of f-score. In particular, up to K-3-cliques are included as seeds, the average f-score is 57.32%, which is better than that of existing methods.
引用
收藏
页码:123 / 126
页数:4
相关论文
共 16 条
[1]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[2]  
Dongen Van, 2000, THESIS
[3]   Proteome survey reveals modularity of the yeast cell machinery [J].
Gavin, AC ;
Aloy, P ;
Grandi, P ;
Krause, R ;
Boesche, M ;
Marzioch, M ;
Rau, C ;
Jensen, LJ ;
Bastuck, S ;
Dümpelfeld, B ;
Edelmann, A ;
Heurtier, MA ;
Hoffman, V ;
Hoefert, C ;
Klein, K ;
Hudak, M ;
Michon, AM ;
Schelder, M ;
Schirle, M ;
Remor, M ;
Rudi, T ;
Hooper, S ;
Bauer, A ;
Bouwmeester, T ;
Casari, G ;
Drewes, G ;
Neubauer, G ;
Rick, JM ;
Kuster, B ;
Bork, P ;
Russell, RB ;
Superti-Furga, G .
NATURE, 2006, 440 (7084) :631-636
[4]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[5]   Enumeration of condition-dependent dense modules in protein interaction networks [J].
Georgii, Elisabeth ;
Dietmann, Sabine ;
Uno, Takeaki ;
Pagel, Philipp ;
Tsuda, Koji .
BIOINFORMATICS, 2009, 25 (07) :933-940
[6]   Detecting protein complexes and functional modules from protein interaction networks: A graph entropy approach [J].
Kenley, Edward Casey ;
Cho, Young-Rae .
PROTEOMICS, 2011, 11 (19) :3835-3844
[7]   Protein complex prediction via cost-based clustering [J].
King, AD ;
Przulj, N ;
Jurisica, I .
BIOINFORMATICS, 2004, 20 (17) :3013-3020
[8]   Computational approaches for detecting protein complexes from protein interaction networks: a survey [J].
Li, Xiaoli ;
Wu, Min ;
Kwoh, Chee-Keong ;
Ng, See-Kiong .
BMC GENOMICS, 2010, 11
[9]  
Lian H, 2010, IEEE INT C BIOINFORM, P585, DOI 10.1109/BIBM.2010.5706633
[10]   Investigation of a protein complex network [J].
Mashaghi, AR ;
Ramezanpour, A ;
Karimipour, V .
EUROPEAN PHYSICAL JOURNAL B, 2004, 41 (01) :113-121