Heterogeneous PPI Network Representation Learning for Protein Complex Identification

被引:1
作者
Zhou, Peixuan [1 ]
Zhang, Yijia [1 ]
Chen, Fei [1 ]
Pang, Kuo [1 ]
Lu, Mingyu [1 ]
机构
[1] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian 116024, Liaoning, Peoples R China
来源
BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2022 | 2022年 / 13760卷
关键词
Protein complexes identification; Heterogeneous PPI network; Network representation learning; Attention mechanism;
D O I
10.1007/978-3-031-23198-8_20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Protein complexes are critical units for studying a cell system. How to accurately identify protein complexes has always been the focus of research. Most of the existing methods are based on the topological structure of the Protein-Protein Interaction (PPI) network and introduce some biological information to analyze the correlation between proteins to identify protein complex. However, these methods only comprise a homogenous network of biological information and protein nodes. Most of them ignore that different types of nodes have different importance for protein complex identification. Therefore, there is an urgent need for a method to integrate different types of biological information. This paper proposes a new protein complex identification method GHAE based on heterogeneous network representation learning. Firstly, GHAE combines Gene Ontology (GO) attribute information and PPI data to construct a heterogeneous PPI network. Secondly, based on the constructed network, we use the heterogeneous representation learning method to obtain the vector representation of protein nodes. Finally, we propose a complex identification method based on a heterogeneous network to identify protein complexes. Extensive experiments show that our method achieves state-of-the-art performance in most cases.
引用
收藏
页码:217 / 228
页数:12
相关论文
共 20 条
[1]   CFinder:: locating cliques and overlapping modules in biological networks [J].
Adamcsek, B ;
Palla, G ;
Farkas, IJ ;
Derényi, I ;
Vicsek, T .
BIOINFORMATICS, 2006, 22 (08) :1021-1023
[2]  
[Anonymous], 2014, P INT C BIOINFORMATI
[3]   An ensemble framework for clustering protein-protein interaction networks [J].
Asur, Sitaram ;
Ucar, Duygu ;
Parthasarathy, Srinivasan .
BIOINFORMATICS, 2007, 23 (13) :I29-I40
[4]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[5]   A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles [J].
Chin, Chia-Hao ;
Chen, Shu-Hwa ;
Ho, Chin-Wen ;
Ko, Ming-Tat ;
Lin, Chung-Yen .
BMC BIOINFORMATICS, 2010, 11
[6]  
Heimann D., 2017, PROC KDD MLG WORKSHO, P9
[7]   Protein complex prediction via cost-based clustering [J].
King, AD ;
Przulj, N ;
Jurisica, I .
BIOINFORMATICS, 2004, 20 (17) :3013-3020
[8]   Evaluation of ontology development tools for bioinformatics [J].
Lambrix, P ;
Habbouche, M ;
Pérez, M .
BIOINFORMATICS, 2003, 19 (12) :1564-1571
[9]   Predicting Protein Complexes from PPI Data: A Core-Attachment Approach [J].
Leung, Henry C. M. ;
Xiang, Qian ;
Yiu, S. M. ;
Chin, Francis Y. L. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (02) :133-144
[10]  
Li Xiao-Li, 2007, Comput Syst Bioinformatics Conf, V6, P157