Efficient Distributed Core Graph Decomposition

被引:2
作者
Zhang, Wenqian [1 ]
Yang, Zhengyi [1 ]
Wen, Dong [1 ]
Wang, Xiaoyang [1 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
来源
2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023 | 2023年
关键词
k-Core; Core decomposition; Distributed Computing;
D O I
10.1109/ICDMW60847.2023.00135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Core decomposition is one of the most fundamental problems in graph analytics, which is associated with numerous applications, such as community detection, protein network analysis, and system structure analysis. As the sizes of graphs are becoming increasingly large, it is challenging to compute core decomposition on a single machine. In this paper, we study the problem of k-Core decomposition in the distributed environment. Specifically, we propose the distributed Filter-Array k-Core (FAkCore) algorithm, which adopts the commonly used Scatter-Gather framework. We design an auxiliary data structure of running counts for each vertex to track the statistics of its neighbors' core number. It allows us to recompute the core number of a vertex only when the value is updated. Together with an enhanced message filtering mechanism, our method significantly reduces redundant computation and communication in the existing distributed k-Core decomposition algorithm. Experiments on 10 real-world graphs show that our method outperforms the baseline algorithms by 1.4 times on average and up to 2.2 times.
引用
收藏
页码:1023 / 1031
页数:9
相关论文
共 26 条
[21]  
Wen D, 2016, PROC INT CONF DATA, P133, DOI 10.1109/ICDE.2016.7498235
[22]   Index-based Optimal Algorithm for Computing K-Cores in Large Uncertain Graphs [J].
Yang, Bohua ;
Wen, Dong ;
Qin, Lu ;
Zhang, Ying ;
Chang, Lijun ;
Li, Rong-Hua .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :64-75
[23]   HUGE: An Efficient and Scalable Subgraph Enumeration System [J].
Yang, Zhengyi ;
Lai, Longbin ;
Lin, Xuemin ;
Hao, Kongzhang ;
Zhang, Wenjie .
SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, :2049-2062
[24]   On Querying Historical K-Cores [J].
Yu, Michael ;
Wen, Dong ;
Qin, Lu ;
Zhang, Ying ;
Zhang, Wenjie ;
Lin, Xuemin .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11) :2033-2045
[25]   Apache Spark: A Unified Engine for Big Data Processing [J].
Zaharia, Matei ;
Xin, Reynold S. ;
Wendell, Patrick ;
Das, Tathagata ;
Armbrust, Michael ;
Dave, Ankur ;
Meng, Xiangrui ;
Rosen, Josh ;
Venkataraman, Shivaram ;
Franklin, Michael J. ;
Ghodsi, Ali ;
Gonzalez, Joseph ;
Shenker, Scott ;
Stoica, Ion .
COMMUNICATIONS OF THE ACM, 2016, 59 (11) :56-65
[26]   Using the k-core decomposition to analyze the static structure of large-scale software systems [J].
Zhang, Haohua ;
Zhao, Hai ;
Cai, Wei ;
Liu, Jie ;
Zhou, Wanlei .
JOURNAL OF SUPERCOMPUTING, 2010, 53 (02) :352-369