Graph Embedding via Graph Summarization

被引:5
作者
Yang, Jingyanning [1 ]
You, Jinguo [1 ,2 ]
Wan, Xiaorong [1 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming 650500, Yunnan, Peoples R China
关键词
Graph embedding; graph summarization; graph coarsening; random walks;
D O I
10.1109/ACCESS.2021.3067901
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph representation learning aims to represent the structural and semantic information of graph objects as dense real value vectors in low dimensional space by machine learning. It is widely used in node classification, link prediction, and recommendation systems. However, directly computing the embeddings for original graphs is prohibitively inefficient, especially for large-scale graphs. To address this issue, we present the GSE (Graph Summarization Embedding) model, a more efficient model that computes the nodes' embeddings based on graph summarization. Specifically, the model first searches for the minimum information entropy of k groups to transform the original graph into a hypergraph with higher-order structural features. Next, the summarization graph's connection probabilities are used to determine the biased random walks on the hypergraph, which then generates the sequences of the super-nodes. Finally, the node sequences are fed into the skip-gram to generate the vectors of these nodes. Our proposed model improves the efficiency of graph embedding on big data graphs and effectively alleviates the local optimal problem caused by the random walks. Experimental results demonstrate that GSE outperforms main existing clustering baselines, such as K_Means Clustering, Affinity Propagation Clustering, Canopy Clustering, and ACP Clustering. Moreover, our model can be coupled with the main graph embedding methods and improves the Macro-F1 scores and Micro-F1 scores for classification tasks on a variety of real-world graph data.
引用
收藏
页码:45163 / 45174
页数:12
相关论文
共 50 条
[1]   Node Embedding Preserving Graph Summarization [J].
Zhou, Houquan ;
Liu, Shenghua ;
Shen, Huawei ;
Cheng, Xueqi .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (06)
[2]   Unsupervised Graph Embedding via Adaptive Graph Learning [J].
Zhang, Rui ;
Zhang, Yunxing ;
Lu, Chengjun ;
Li, Xuelong .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) :5329-5336
[3]   Graph embedding via multi-scale graph representations [J].
Xie, Yu ;
Chen, Cheng ;
Gong, Maoguo ;
Li, Deyu ;
Qin, A. K. .
INFORMATION SCIENCES, 2021, 578 :102-115
[4]   Revealing Biological Modules via Graph Summarization [J].
Navlakha, Saket ;
Schatz, Michael C. ;
Kingsford, Carl .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (02) :253-264
[5]   Robust graph embedding via Attack-aid Graph Denoising [J].
Qin, Zhili ;
Wang, Han ;
Yu, Zhongjing ;
Yang, Qinli ;
Shao, Junming .
INFORMATION SCIENCES, 2024, 678
[6]   ROBUST GRAPH EMBEDDING VIA SELF-SUPERVISED GRAPH DENOISING [J].
Han, Wang .
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
[7]   Hierarchical graph embedding in vector space by graph pyramid [J].
Mousavi, Seyedeh Fatemeh ;
Safayani, Mehran ;
Mirzaei, Abdolreza ;
Bahonar, Hoda .
PATTERN RECOGNITION, 2017, 61 :245-254
[8]   Graph Summarization via Node Grouping: A Spectral Algorithm [J].
Merchant, Arpit ;
Mathioudakis, Michael ;
Wang, Yanhao .
PROCEEDINGS OF THE SIXTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2023, VOL 1, 2023, :742-750
[9]   Answering pattern match queries in large graph databases via graph embedding [J].
Zou, Lei ;
Chen, Lei ;
Oezsu, M. Tamer ;
Zhao, Dongyan .
VLDB JOURNAL, 2012, 21 (01) :97-120
[10]   Answering pattern match queries in large graph databases via graph embedding [J].
Lei Zou ;
Lei Chen ;
M. Tamer Özsu ;
Dongyan Zhao .
The VLDB Journal, 2012, 21 :97-120