Towards the identification of protein complexes and functional modules by integrating PPI network and gene expression data

被引:110
作者
Li, Min [1 ,2 ]
Wu, Xuehong [1 ]
Wang, Jianxin [1 ]
Pan, Yi [1 ,3 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
[2] Cent S Univ, State Key Lab Med Genet, Changsha 410078, Hunan, Peoples R China
[3] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
来源
BMC BIOINFORMATICS | 2012年 / 13卷
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
COMMUNITY STRUCTURE; ALGORITHM; TRANSCRIPTION; CYCLE;
D O I
10.1186/1471-2105-13-109
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Identification of protein complexes and functional modules from protein-protein interaction (PPI) networks is crucial to understanding the principles of cellular organization and predicting protein functions. In the past few years, many computational methods have been proposed. However, most of them considered the PPI networks as static graphs and overlooked the dynamics inherent within these networks. Moreover, few of them can distinguish between protein complexes and functional modules. Results: In this paper, a new framework is proposed to distinguish between protein complexes and functional modules by integrating gene expression data into protein-protein interaction ( PPI) data. A series of time-sequenced subnetworks (TSNs) is constructed according to the time that the interactions were activated. The algorithm TSN-PCD was then developed to identify protein complexes from these TSNs. As protein complexes are significantly related to functional modules, a new algorithm DFM-CIN is proposed to discover functional modules based on the identified complexes. The experimental results show that the combination of temporal gene expression data with PPI data contributes to identifying protein complexes more precisely. A quantitative comparison based on f-measure reveals that our algorithm TSN-PCD outperforms the other previous protein complex discovery algorithms. Furthermore, we evaluate the identified functional modules by using "Biological Process" annotated in GO ( Gene Ontology). The validation shows that the identified functional modules are statistically significant in terms of "Biological Process". More importantly, the relationship between protein complexes and functional modules are studied. Conclusions: The proposed framework based on the integration of PPI data and gene expression data makes it possible to identify protein complexes and functional modules more effectively. Moveover, the proposed new framework and algorithms can distinguish between protein complexes and functional modules. Our findings suggest that functional modules are closely related to protein complexes and a functional module may consist of one or multiple protein complexes. The program is available at http://netlab.csu.edu.cn/bioinfomatics/limin/DFM-CIN/index.html.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Identifying protein complexes and functional modules-from static PPI networks to dynamic PPI networks
    Chen, Bolin
    Fan, Weiwei
    Liu, Juan
    Wu, Fang-Xiang
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (02) : 177 - 194
  • [2] Identification of Overlapping Protein Complexes Using Structural and Functional Information of PPI Network
    Wang Xuesong
    Cheng Yuhu
    Sun Weifang
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 564 - 568
  • [3] Identifying Dynamic Protein Complexes Based on Gene Expression Profiles and PPI Networks
    Li, Min
    Chen, Weijie
    Wang, Jianxin
    Wu, Fang-Xiang
    Pan, Yi
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [4] Identification of Hierarchical and Overlapping Functional Modules in PPI Networks
    Wang, Jianxin
    Ren, Jun
    Li, Min
    Wu, Fang-Xiang
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2012, 11 (04) : 386 - 393
  • [5] CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
    Xu, Ying
    Zhou, Jiaogen
    Zhou, Shuigeng
    Guan, Jihong
    BMC SYSTEMS BIOLOGY, 2017, 11
  • [6] A comparison of the functional modules identified from time course and static PPI network data
    Tang, Xiwei
    Wang, Jianxin
    Liu, Binbin
    Li, Min
    Chen, Gang
    Pan, Yi
    BMC BIOINFORMATICS, 2011, 12
  • [7] Integrating heterogeneous gene expression data for gene regulatory network modelling
    Sirbu, Alina
    Ruskin, Heather J.
    Crane, Martin
    THEORY IN BIOSCIENCES, 2012, 131 (02) : 95 - 102
  • [8] Integrating network topology, gene expression data and GO annotation information for protein complex prediction
    Zhang, Wei
    Xu, Jia
    Li, Yuanyuan
    Zou, Xiufen
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (01)
  • [9] Deciphering the Arabidopsis Floral Transition Process by Integrating a Protein-Protein Interaction Network and Gene Expression Data
    He, Fei
    Zhou, Yuan
    Zhang, Ziding
    PLANT PHYSIOLOGY, 2010, 153 (04) : 1492 - 1505
  • [10] ModuleSearch: finding functional modules in a protein-protein interaction network
    Cui, Guangyu
    Shrestha, Rojan
    Han, Kyungsook
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2012, 15 (07) : 691 - 699