A Non-local Graph Neural Network for Identification of Essential Proteins

被引:4
作者
Zhang, Houwang [1 ]
Feng, Zhenan [1 ]
Wu, Chong [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
Graph neural network; identification of essential proteins; protein-protein interaction network; attention mechanism; PREDICTING ESSENTIAL GENES; CENTRALITY;
D O I
10.1109/IJCNN55064.2022.9892648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of essential proteins is a hot topic in bioinformatics. In recent years, various traditional methods have been proposed, which usually take topological features to rank the proteins and then set a threshold for selecting essential proteins. Some researchers have also tried to take machine learning methods or deep learning methods for the prediction of essential proteins. However, these methods can not well extract the topological features of protein-protein interaction (PPI) network. Besides, although some scholars proposed to combine biological information with PPI network to reduce the noise of PPI data, how to well combine the biological information with PPI network remains to be a problem. In this paper, we propose a non-local graph neural network to tackle the above problems. In our algorithm, we use graph convolutional network (GCN) layers to extract graph embeddings of PPI network and take three kinds of biological information like gene expression profiles, subcellular localization data, and protein complex data as the features of proteins. Besides, based on the characteristics of our model we design a non-local module as the attention mechanism, which is widely used in convolutional neural networks, to aggregate the information of the graph embeddings. Finally, we use another GCN layer for the node classification and obtain the essential proteins. In the experiments, we evaluated our method based on the widely used S. cerevisiae (Yeast) dataset. The experimental results show that our method can obtain significant improvements over traditional topology-based methods, machine learning-based methods, and recently proposed deep learning-based methods.
引用
收藏
页数:8
相关论文
共 56 条
[1]   Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information [J].
Acencio, Marcio L. ;
Lemke, Ney .
BMC BIOINFORMATICS, 2009, 10 :290
[2]  
Belloze K., 2020, NETWORKS SYSTEMS BIO, P45, DOI [DOI 10.1007/978-3-030-51862-2_4, 10.1007/978-3-030-51862-2_4]
[3]  
BONACICH P, 1987, AM J SOCIOL, V92, P1170, DOI 10.1086/228631
[4]   A non-local algorithm for image denoising [J].
Buades, A ;
Coll, B ;
Morel, JM .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :60-65
[5]  
Chen JQ, 2014, PLOS ONE, V9, DOI [10.1371/journal.pone.0085161, 10.1371/journal.pone.0101277]
[6]  
Chen SX, 2020, IEEE W CONTR MODEL, P895
[7]   A Novel Model for Predicting Essential Proteins Based on Heterogeneous Protein-Domain Network [J].
Chen, Zhiping ;
Meng, Zixuan ;
Liu, Chaoping ;
Wang, Xiangyi ;
Kuang, Linai ;
Pei, Tingrui ;
Wang, Lei .
IEEE ACCESS, 2020, 8 :8946-8958
[8]   Saccharomyces Genome Database: the genomics resource of budding yeast [J].
Cherry, J. Michael ;
Hong, Eurie L. ;
Amundsen, Craig ;
Balakrishnan, Rama ;
Binkley, Gail ;
Chan, Esther T. ;
Christie, Karen R. ;
Costanzo, Maria C. ;
Dwight, Selina S. ;
Engel, Stacia R. ;
Fisk, Dianna G. ;
Hirschman, Jodi E. ;
Hitz, Benjamin C. ;
Karra, Kalpana ;
Krieger, Cynthia J. ;
Miyasato, Stuart R. ;
Nash, Rob S. ;
Park, Julie ;
Skrzypek, Marek S. ;
Simison, Matt ;
Weng, Shuai ;
Wong, Edith D. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D700-D705
[9]   Targeting virulence: a new paradigm for antimicrobial therapy [J].
Clatworthy, Anne E. ;
Pierson, Emily ;
Hung, Deborah T. .
NATURE CHEMICAL BIOLOGY, 2007, 3 (09) :541-548
[10]   Genome-wide screening for gene function using RNAi in mammalian cells [J].
Cullen, LM ;
Arndt, GM .
IMMUNOLOGY AND CELL BIOLOGY, 2005, 83 (03) :217-223