Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy

被引:30
作者
Chen, Bolin [1 ]
Shi, Jinhong [1 ]
Zhang, Shenggui [2 ]
Wu, Fang-Xiang [1 ,3 ]
机构
[1] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK, Canada
[2] Northwestern Polytech Univ, Dept Appl Math, Xian 710072, Shaanxi, Peoples R China
[3] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 0W0, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Bioinformatics; Clique seed; Graph entropy; Protein complex; Protein-protein interaction; COMPREHENSIVE RESOURCE; MODULES; CORUM;
D O I
10.1002/pmic.201200336
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The identification of protein complexes plays a key role in understanding major cellular processes and biological functions. Various computational algorithms have been proposed to identify protein complexes from proteinprotein interaction (PPI) networks. In this paper, we first introduce a new seed-selection strategy for seed-growth style algorithms. Cliques rather than individual vertices are employed as initial seeds. After that, a result-modification approach is proposed based on this seed-selection strategy. Predictions generated by higher order clique seeds are employed to modify results that are generated by lower order ones. The performance of this seed-selection strategy and the result-modification approach are tested by using the entropy-based algorithm, which is currently the best seed-growth style algorithm to detect protein complexes from PPI networks. In addition, we investigate four pairs of strategies for this algorithm in order to improve its accuracy. The numerical experiments are conducted on a Saccharomyces cerevisiae PPI network. The group of best predictions consists of 1711 clusters, with the average f-score at 0.68 after removing all similar and redundant clusters. We conclude that higher order clique seeds can generate predictions with higher accuracy and that our improved entropy-based algorithm outputs more reasonable predictions than the original one.
引用
收藏
页码:269 / 277
页数:9
相关论文
共 24 条
[1]  
[Anonymous], 2000, GRAPH CLUSTERING FLO
[2]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[3]   An improved graph entropy-based method for identifying protein complexes [J].
Chen, Bolin ;
Yan, Yan ;
Shi, Jinhong ;
Zhang, Shenggui ;
Wu, Fang-Xiang .
2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, :123-126
[4]   A gene-centered C. elegans protein-DNA interaction network [J].
Deplancke, Bart ;
Mukhopadhyay, Arnab ;
Ao, Wanyuan ;
Elewa, Ahmed M. ;
Grove, Christian A. ;
Martinez, Natalia J. ;
Sequerra, Reynaldo ;
Doucette-Stamm, Lynn ;
Reece-Hoyes, John S. ;
Hope, Ian A. ;
Tissenbaum, Heidi A. ;
Mango, Susan E. ;
Walhout, Albertha J. M. .
CELL, 2006, 125 (06) :1193-1205
[5]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[6]   Enumeration of condition-dependent dense modules in protein interaction networks [J].
Georgii, Elisabeth ;
Dietmann, Sabine ;
Uno, Takeaki ;
Pagel, Philipp ;
Tsuda, Koji .
BIOINFORMATICS, 2009, 25 (07) :933-940
[7]   Functional cartography of complex metabolic networks [J].
Guimerà, R ;
Amaral, LAN .
NATURE, 2005, 433 (7028) :895-900
[8]   Modelling and analysis of gene regulatory networks [J].
Karlebach, Guy ;
Shamir, Ron .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2008, 9 (10) :770-780
[9]   Detecting protein complexes and functional modules from protein interaction networks: A graph entropy approach [J].
Kenley, Edward Casey ;
Cho, Young-Rae .
PROTEOMICS, 2011, 11 (19) :3835-3844
[10]   Protein complex prediction via cost-based clustering [J].
King, AD ;
Przulj, N ;
Jurisica, I .
BIOINFORMATICS, 2004, 20 (17) :3013-3020