Improving vertex-frontier based GPU breadth-first search

被引:0
作者
杨博 [1 ,2 ]
卢凯 [1 ,2 ]
高颖慧 [3 ]
徐凯 [1 ,2 ]
王小平 [1 ,2 ]
程志权 [4 ]
机构
[1] Science and Technology on Parallel and Distributed Processing Laboratory,National University of Defense Technology
[2] College of Computer, National University of Defense Technology
[3] Department of Electronic Science and Engineering, National University of Defense Technology
[4] Avatar Science Company
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
breadth-first search; GPU; graph traversal; vertex frontier;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Breadth-first search(BFS) is an important kernel for graph traversal and has been used by many graph processing applications. Extensive studies have been devoted in boosting the performance of BFS. As the most effective solution, GPU-acceleration achieves the state-of-the-art result of 3.3×109 traversed edges per second on a NVIDIA Tesla C2050 GPU. A novel vertex frontier based GPU BFS algorithm is proposed, and its main features are three-fold. Firstly, to obtain a better workload balance for irregular graphs, a virtual-queue task decomposition and mapping strategy is introduced for vertex frontier expanding. Secondly, a global deduplicate detection scheme is proposed to remove reduplicative vertices from vertex frontier effectively. Finally, a GPU-based bottom-up BFS approach is employed to process large frontier. The experimental results demonstrate that the algorithm can achieve 10% improvement over the state-of-the-art method on diverse graphs. Especially, it exhibits 2-3 times speedup on low-diameter and scale-free graphs over the state-of-the-art on a NVIDIA Tesla K20 c GPU, reaching a peak traversal rate of 11.2×109 edges/s.
引用
收藏
页码:3828 / 3836
页数:9
相关论文
共 50 条
  • [21] A Low Communication Overhead Breadth-First Search Based on Global Bitmap
    Peng, Ziwei
    Lu, Yutong
    Cheng, Zhiguang
    Du, Yunfei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT II, 2018, 11335 : 114 - 129
  • [22] CyGraph: A Reconfigurable Architecture for Parallel Breadth-First Search
    Attia, Osama G.
    Johnson, Tyler
    Townsend, Kevin
    Jones, Philip
    Zambreno, Joseph
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 228 - 235
  • [23] Distributed breadth-first search LTL model checking
    Jiří Barnat
    Ivana Černá
    Formal Methods in System Design, 2006, 29 : 117 - 134
  • [24] Breadth-First Search with A Multi-Core Computer
    Belova, Maryia
    Ouyang, Ming
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 579 - 587
  • [25] An adaptive breadth-first search algorithm on integrated architectures
    Feng Zhang
    Heng Lin
    Jidong Zhai
    Jie Cheng
    Dingyi Xiang
    Jizhong Li
    Yunpeng Chai
    Xiaoyong Du
    The Journal of Supercomputing, 2018, 74 : 6135 - 6155
  • [26] Heuristic algorithms for reliability estimation based on breadth-first search of a grid tree
    Chen, Xuyong
    Xu, Zhifeng
    Wu, Yushun
    Wu, Qiaoyun
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 232
  • [27] A fewest-turn-and-shortest path algorithm based on breadth-first search
    Zhou, Yan
    Wang, Weisheng
    He, Di
    Wang, Zhe
    GEO-SPATIAL INFORMATION SCIENCE, 2014, 17 (04) : 201 - 207
  • [28] WAVE: designing a heuristics-based three-way breadth-first search on GPUs
    Daegun Yoon
    Minjoong Jeong
    Sangyoon Oh
    The Journal of Supercomputing, 2023, 79 : 6889 - 6917
  • [29] WAVE: designing a heuristics-based three-way breadth-first search on GPUs
    Yoon, Daegun
    Jeong, Minjoong
    Oh, Sangyoon
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06) : 6889 - 6917
  • [30] Fast and Scalable NUMA-based Thread Parallel Breadth-first Search
    Yasui, Yuichiro
    Fujisawa, Katsuki
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 377 - 385