PCE-FR: A Novel Method for Identifying Overlapping Protein Complexes in Weighted Protein-Protein Interaction Networks Using Pseudo-Clique Extension Based on Fuzzy Relation

被引:12
作者
Cao, Buwen [1 ,2 ]
Luo, Jiawei [1 ]
Liang, Cheng [3 ]
Wang, Shulin [1 ]
Ding, Pingjian [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Hunan City Univ, Sch Informat Sci & Engn, Yiyang 413000, Peoples R China
[3] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250014, Peoples R China
基金
中国国家自然科学基金;
关键词
PPI; Protein complex; Pseudo-clique extension; Fuzzy relation; NUCLEAR-PORE COMPLEX; FUNCTIONAL MODULES; SACCHAROMYCES-CEREVISIAE; CLUSTERING-ALGORITHM; BIOLOGICAL NETWORKS; PPI NETWORKS; IDENTIFICATION; PREDICTION; DISCOVERY; DATABASE;
D O I
10.1109/TNB.2016.2611683
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying overlapping protein complexes in protein-protein interaction (PPI) networks can provide insight into cellular functional organization and thus elucidate underlying cellular mechanisms. Recently, various algorithms for protein complexes detection have been developed for PPI networks. However, majority of algorithms primarily depend on network topological feature and/or gene expression profile, failing to consider the inherent biological meanings between protein pairs. In this paper, we propose a novel method to detect protein complexes using pseudo-clique extension based on fuzzy relation (PCE-FR). Our algorithm operates in three stages: it first forms the nonoverlapping protein substructure based on fuzzy relation and then expands each substructure by adding neighbor proteins to maximize the cohesive score. Finally, highly overlapped candidate protein complexes are merged to form the final protein complex set. Particularly, our algorithm employs the biological significance hidden in protein pairs to construct edge weight for protein interaction networks. The experiment results show that our method can not only outperform classical algorithms such as CFinder, ClusterONE, CMC, RRW, HC-PIN, and ProRank +, but also achieve ideal overall performance in most of the yeast PPI datasets in terms of composite score consisting of precision, accuracy, and separation. We further apply our method to a human PPI network from the HPRD dataset and demonstrate it is very effective in detecting protein complexes compared to other algorithms.
引用
收藏
页码:728 / 738
页数:11
相关论文
共 57 条
  • [1] CFinder:: locating cliques and overlapping modules in biological networks
    Adamcsek, B
    Palla, G
    Farkas, IJ
    Derényi, I
    Vicsek, T
    [J]. BIOINFORMATICS, 2006, 22 (08) : 1021 - 1023
  • [2] Structure-based assembly of protein complexes in yeast
    Aloy, P
    Böttcher, B
    Ceulemans, H
    Leutwein, C
    Mellwig, C
    Fischer, S
    Gavin, AC
    Bork, P
    Superti-Furga, G
    Serrano, L
    Russell, RB
    [J]. SCIENCE, 2004, 303 (5666) : 2026 - 2029
  • [3] Development and implementation of an algorithm for detection of protein complexes in large interaction networks
    Altaf-Ul-Amin, Md
    Shinbo, Yoko
    Mihara, Kenji
    Kurokawa, Ken
    Kanaya, Shigehiko
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [4] An automated method for finding molecular complexes in large protein interaction networks
    Bader, GD
    Hogue, CW
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [5] Evaluation of clustering algorithms for protein-protein interaction networks
    Brohee, Sylvain
    van Helden, Jacques
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [6] Identifying protein complexes in protein-protein interaction networks by using clique seeds and graph entropy
    Chen, Bolin
    Shi, Jinhong
    Zhang, Shenggui
    Wu, Fang-Xiang
    [J]. PROTEOMICS, 2013, 13 (02) : 269 - 277
  • [7] Increasing the reliability of protein interactomes
    Chua, Hon Nian
    Wong, Limsoon
    [J]. DRUG DISCOVERY TODAY, 2008, 13 (15-16) : 652 - 658
  • [8] Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions
    Chua, Hon Nian
    Sung, Wing-Kin
    Wong, Limsoon
    [J]. BIOINFORMATICS, 2006, 22 (13) : 1623 - 1630
  • [9] Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae
    Collins, Sean R.
    Kemmeren, Patrick
    Zhao, Xue-Chu
    Greenblatt, Jack F.
    Spencer, Forrest
    Holstege, Frank C. P.
    Weissman, Jonathan S.
    Krogan, Nevan J.
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (03) : 439 - 450
  • [10] Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO)
    Dwight, SS
    Harris, MA
    Dolinski, K
    Ball, CA
    Binkley, G
    Christie, KR
    Fisk, DG
    Issel-Tarver, L
    Schroeder, M
    Sherlock, G
    Sethuraman, A
    Weng, S
    Botstein, D
    Cherry, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 69 - 72