Practical and high-quality partitioning algorithm for large-scale and time-evolving graphs

被引:1
|
作者
Luo, Xiangyu [1 ,2 ]
Luo, Yingxiao [1 ]
Xin, Gang [3 ]
Gui, Xiaolin [2 ]
Wang, Jia [1 ]
Guo, Cheng [4 ]
机构
[1] Xian Univ Sci & Technol, Inst Syst Secur & Control, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Peoples R China
[3] AVIC Comp Tech Res Inst, Xian, Peoples R China
[4] Xian Inst Appl Opt, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Time-evolving graph; Graph partitioning; Distributed graph processing; Cross partition edge; Load balance;
D O I
10.1016/j.knosys.2021.107211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the appearance of widespread large-scale graph data, distributed graph processing grows in popularity. In order to store and analyze a large-scale graph in a distributed manner, the graph should be properly partitioned in advance. Although the partitioning of static graphs has been sufficiently investigated, the emerging graphs nowadays are typically dynamic or time-evolving. State-of-the-art partitioning algorithms for such large-scale and time-evolving graphs either are impractical because of the complexity caused by the global optimization or sacrifice partitioning quality due to a huge number of cross partition edges. This paper investigates the problem and proposes a new practical and high-quality graph partitioning algorithm. First, graph updates accumulated in a time interval are expressed as a delta graph. Second, vertices in the delta graph are classified into several subsets based on a non-overlapping community detection method. Third, vertices in the same subset are assigned to a properly chosen partition as a whole. The chosen partition is lightly loaded as well as closely related with vertices in the subset. Experimental results show that compared with the widely used hash-based and heuristic-based partitioning algorithms, our proposed algorithm gains significant decrease in terms of the number of cross partition edges while maintaining a similar level of load balance. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Incremental Partitioning of Large Time-Evolving Graphs
    Abdolrashidi, Amirreza
    Ramaswamy, Lakshmish
    2015 IEEE CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC), 2015, : 19 - 27
  • [2] Effective partitioning mechanisms for time-evolving graphs in the Flink system
    Lee, Yi-Hsuan
    Jian, Sheng-Jia
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (11): : 12336 - 12354
  • [3] Effective partitioning mechanisms for time-evolving graphs in the Flink system
    Yi-Hsuan Lee
    Sheng-Jia Jian
    The Journal of Supercomputing, 2021, 77 : 12336 - 12354
  • [4] Distributed and incremental travelling salesman algorithm on time-evolving graphs
    Sharma, Shalini
    Chou, Jerry
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (10): : 10896 - 10920
  • [5] Distributed and incremental travelling salesman algorithm on time-evolving graphs
    Shalini Sharma
    Jerry Chou
    The Journal of Supercomputing, 2021, 77 : 10896 - 10920
  • [6] TgStore: An Efficient Storage System for Large Time-Evolving Graphs
    Cheng, Yongli
    Ma, Yan
    Jiang, Hong
    Zeng, Lingfang
    Wang, Fang
    Xu, Xianghao
    Wu, Yuhang
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (02) : 158 - 173
  • [7] An efficient SSSP algorithm on time-evolving graphs with prediction of computation results
    Cheng, Yongli
    Huang, Chuanjie
    Jiang, Hong
    Xu, Xianghao
    Wang, Fang
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 186
  • [8] Adaptive Partitioning of Large-Scale Dynamic Graphs
    Vaquero, Luis M.
    Cuadrado, Felix
    Logothetis, Dionysios
    Martella, Claudio
    2014 IEEE 34TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2014), 2014, : 144 - 153
  • [9] GraphScope: Parameter-free Mining of Large Time-evolving Graphs
    Sun, Jimeng
    Yu, Philip S.
    Papadimitriou, Spiros
    Faloutsos, Christos
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 687 - 696
  • [10] Top-k Distance Queries on Large Time-Evolving Graphs
    D'Ascenzo, Andrea
    D'Emidio, Mattia
    IEEE ACCESS, 2023, 11 : 102228 - 102242