Efficient Distributed Core Graph Decomposition

被引:2
作者
Zhang, Wenqian [1 ]
Yang, Zhengyi [1 ]
Wen, Dong [1 ]
Wang, Xiaoyang [1 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
来源
2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023 | 2023年
关键词
k-Core; Core decomposition; Distributed Computing;
D O I
10.1109/ICDMW60847.2023.00135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Core decomposition is one of the most fundamental problems in graph analytics, which is associated with numerous applications, such as community detection, protein network analysis, and system structure analysis. As the sizes of graphs are becoming increasingly large, it is challenging to compute core decomposition on a single machine. In this paper, we study the problem of k-Core decomposition in the distributed environment. Specifically, we propose the distributed Filter-Array k-Core (FAkCore) algorithm, which adopts the commonly used Scatter-Gather framework. We design an auxiliary data structure of running counts for each vertex to track the statistics of its neighbors' core number. It allows us to recompute the core number of a vertex only when the value is updated. Together with an enhanced message filtering mechanism, our method significantly reduces redundant computation and communication in the existing distributed k-Core decomposition algorithm. Experiments on 10 real-world graphs show that our method outperforms the baseline algorithms by 1.4 times on average and up to 2.2 times.
引用
收藏
页码:1023 / 1031
页数:9
相关论文
共 26 条
[1]  
Alvarez-Hamelin J., 2005, Advances in neural information processing systems, V18
[2]  
[Anonymous], Apache Flink Gelly
[3]  
[Anonymous], Apache Flink
[4]   Large-scale identification of protein-protein interaction of Escherichia coli K-12 [J].
Arifuzzaman, M ;
Maeda, M ;
Itoh, A ;
Nishikata, K ;
Takita, C ;
Saito, R ;
Ara, T ;
Nakahigashi, K ;
Huang, HC ;
Hirai, A ;
Tsuzuki, K ;
Nakamura, S ;
Altaf-Ul-Amin, M ;
Oshima, T ;
Baba, T ;
Yamamoto, N ;
Kawamura, T ;
Ioka-Nakamichi, T ;
Kitagawa, M ;
Tomita, M ;
Kanaya, S ;
Wada, C ;
Mori, H .
GENOME RESEARCH, 2006, 16 (05) :686-691
[5]   Clique Relaxations in Social Network Analysis: The Maximum k-Plex Problem [J].
Balasundaram, Balabhaskar ;
Butenko, Sergiy ;
Hicks, Illya V. .
OPERATIONS RESEARCH, 2011, 59 (01) :133-142
[6]   Fast algorithms for determining (generalized) core groups in social networks [J].
Batagelj, Vladimir ;
Zaversnik, Matjaz .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (02) :129-145
[7]  
Cheng J, 2011, PROC INT CONF DATA, P51, DOI 10.1109/ICDE.2011.5767911
[8]   K-Connected Cores Computation in Large Dual Networks [J].
Cui L. ;
Yue L. ;
Wen D. ;
Qin L. .
Data Science and Engineering, 2018, 3 (4) :293-306
[9]   Local Search of Communities in Large Graphs [J].
Cui, Wanyun ;
Xiao, Yanghua ;
Wang, Haixun ;
Wang, Wei .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :991-1002
[10]   K-Core Decomposition on Super Large Graphs with Limited Resources [J].
Gao, Shicheng ;
Xu, Jie ;
Li, Xiaosen ;
Fu, Fangcheng ;
Zhang, Wentao ;
Ouyang, Wen ;
Tao, Yangyu ;
Cui, Bin .
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, :413-422