Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks

被引:9
|
作者
Liu, Xiaoxia [1 ]
Yang, Zhihao [1 ]
Sang, Shengtian [1 ]
Zhou, Ziwei [1 ]
Wang, Lei [2 ]
Zhang, Yin [2 ]
Lin, Hongfei [1 ]
Wang, Jian [1 ]
Xu, Bo [3 ]
机构
[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116024, Liaoning, Peoples R China
[2] Beijing Inst Hlth Adm & Med Informat, Beijing 100850, Peoples R China
[3] Dalian Univ Technol, Sch Software Technol, Dalian 116024, Liaoning, Peoples R China
来源
BMC BIOINFORMATICS | 2018年 / 19卷
基金
中国国家自然科学基金;
关键词
Node embeddings; Random forest; Supervised learning method; Protein complex detection; PPI NETWORKS; ANNOTATION; DATABASE;
D O I
10.1186/s12859-018-2364-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Protein complexes are one of the keys to deciphering the behavior of a cell system. During the past decade, most computational approaches used to identify protein complexes have been based on discovering densely connected subgraphs in protein-protein interaction (PPI) networks. However, many true complexes are not dense subgraphs and these approaches show limited performances for detecting protein complexes from PPI networks. Results: To solve these problems, in this paper we propose a supervised learning method based on network node embeddings which utilizes the informative properties of known complexes to guide the search process for new protein complexes. First, node embeddings are obtained from human protein interaction network. Then the protein interactions are weighted through the similarities between node embeddings. After that, the supervised learning method is used to detect protein complexes. Then the random forest model is used to filter the candidate complexes in order to obtain the final predicted complexes. Experimental results on real human and yeast protein interaction networks show that our method effectively improves the performance for protein complex detection. Conclusions: We provided a new method for identifying protein complexes from human and yeast protein interaction networks, which has great potential to benefit the field of protein complex detection.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] DPCMNE: Detecting Protein Complexes From Protein-Protein Interaction Networks Via Multi-Level Network Embedding
    Meng, Xiangmao
    Xiang, Ju
    Zheng, Ruiqing
    Wu, Fang-Xiang
    Li, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1592 - 1602
  • [22] Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
    Wang, Jian
    Xie, Dong
    Lin, Hongfei
    Yang, Zhihao
    Zhang, Yijia
    PROTEOME SCIENCE, 2012, 10
  • [23] Analyzing Protein-Protein Interaction Networks
    Koh, Gavin C. K. W.
    Porras, Pablo
    Aranda, Bruno
    Hermjakob, Henning
    Orchard, Sandra E.
    JOURNAL OF PROTEOME RESEARCH, 2012, 11 (04) : 2014 - 2031
  • [24] A Novel Method to Predict Protein-Protein Interactions Based on the Information of Protein-Protein Interaction Networks and Protein Sequence
    Ma, Dai-Chuan
    Diao, Yuan-Bo
    Guo, Yan-Zhi
    Li, Yi-Zhou
    Zhang, Yong-Qing
    Wu, Jiang
    Li, Meng-Long
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (09) : 906 - 911
  • [25] On the Planarity of Validated Complexes of Model Organisms in Protein-Protein Interaction Networks
    Cooper, Kathryn
    Cornelius, Nathan
    Gasper, William
    Bhowmick, Sanjukta
    Ali, Hesham
    COMPUTATIONAL SCIENCE - ICCS 2020, PT I, 2020, 12137 : 652 - 666
  • [26] Module organization and variance in protein-protein interaction networks
    Lin, Chun-Yu
    Lee, Tsai-Ling
    Chiu, Yi-Yuan
    Lin, Yi-Wei
    Lo, Yu-Shu
    Lin, Chih-Ta
    Yang, Jinn-Moon
    SCIENTIFIC REPORTS, 2015, 5
  • [27] Active learning for protein function prediction in protein-protein interaction networks
    Xiong, Wei
    Xie, Luyu
    Zhou, Shuigeng
    Guan, Jihong
    NEUROCOMPUTING, 2014, 145 : 44 - 52
  • [28] Identifying the overlapping complexes in protein interaction networks
    Li, Min
    Wang, Jianxin
    Chen, Jianer
    Cai, Zhao
    Chen, Gang
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (01) : 91 - 108
  • [29] An analysis pipeline for the inference of protein-protein interaction networks
    Taylor, Ronald C.
    Singhal, Mudita
    Daly, Don S.
    Gilmore, Jason
    Cannon, William R.
    Domico, Kelly
    White, Amanda M.
    Auberry, Deanna L.
    Auberry, Kenneth J.
    Hooker, Brian S.
    Hurst, Greg
    McDermott, Jason E.
    McDonald, W. Hayes
    Pelletier, Dale A.
    Schmoyer, Denise
    Wiley, H. Steven
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2009, 3 (04) : 409 - 430
  • [30] Protein-protein interaction networks as miners of biological discovery
    Wang, Steven
    Wu, Runxin
    Lu, Jiaqi
    Jiang, Yijia
    Huang, Tao
    Cai, Yu-Dong
    PROTEOMICS, 2022, 22 (15-16)