On the architectural requirements for efficient execution of graph algorithms

被引:35
|
作者
Bader, DA [1 ]
Cong, GJ [1 ]
Feo, J [1 ]
机构
[1] Univ New Mexico, Dept Elect & Comp Engn, Albuquerque, NM 87131 USA
来源
2005 International Conference on Parallel Processsing, Proceedings | 2005年
关键词
list ranking; connected components; graph algorithms; shared memory; multithreading;
D O I
10.1109/ICPP.2005.55
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Combinatorial problems such as those from graph theory pose serious challenges for parallel machines due to non-contiguous, concurrent accesses to global data structures with low degrees of locality. The hierarchical memory systems of symmetric multiprocessor (SMP) clusters optimize for local, contiguous memory accesses, and so are inefficient platforms for such algorithms. Few parallel graph algorithms outperform their best sequential implementation on SMP clusters due to long memory latencies and high synchronization costs. In this paper, we consider the performance and scalability of two graph algorithms, list ranking and connected components, on two classes of shared-memory computers: symmetric multiprocessors such as the Sun Enterprise servers and multithreaded architectures (MTA) such as the Cray MTA-2. While previous studies have shown that parallel graph algorithms can speedup on SMPs, the systems' reliance on cache microprocessors limits performance. The MTA's latency tolerant processors and hardware support for fine-grain synchronization makes performance a function of parallelism. Since parallel graph algorithms have an abundance of parallelism, they perform and scale significantly better on the MTA. We describe and give a performance model for each architecture. We analyze the performance of the two algorithms and discuss how the features of each architecture affects algorithm development, ease of programming, performance, and scalability.
引用
收藏
页码:547 / 556
页数:10
相关论文
共 50 条
  • [21] Branch-Avoiding Graph Algorithms
    Green, Oded
    Dukhan, Marat
    Vuduc, Richard
    SPAA'15: PROCEEDINGS OF THE 27TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2015, : 212 - 223
  • [22] Editorial: Special Issue on Graph Algorithms
    Jansson, Jesper
    ALGORITHMS, 2013, 6 (03) : 457 - 458
  • [23] Exploring the Design Space of Static and Incremental Graph Connectivity Algorithms on GPUs
    Hong, Changwan
    Dhulipala, Laxman
    Shun, Julian
    PACT '20: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2020, : 55 - 69
  • [24] PARALLEL GRAPH ALGORITHMS FOR HYPERCUBE COMPUTERS
    DAS, SK
    DEO, N
    PRASAD, S
    PARALLEL COMPUTING, 1990, 13 (02) : 143 - 158
  • [25] Locality Analysis of Graph Reordering Algorithms
    Esfahani, Mohsen Koohi
    Kilpatrick, Peter
    Vandierendonck, Hans
    2021 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2021), 2021, : 101 - 112
  • [26] Graph Algorithms with Small Communication Costs
    Jieliang Zhou
    Patrick Dymond
    Xiaotie Deng
    Journal of Combinatorial Optimization, 2000, 4 : 291 - 305
  • [27] Lipschitz Continuous Algorithms for Graph Problems
    Kumabe, Soh
    Yoshida, Yuichi
    2023 IEEE 64TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, FOCS, 2023, : 762 - 797
  • [28] Graph algorithms with small communication costs
    Zhou, JL
    Dymond, P
    Deng, XT
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2000, 4 (03) : 291 - 305
  • [29] Space efficient execution of deterministic parallel programs
    Simpson, DJ
    Burton, FW
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (06) : 870 - 882
  • [30] SEP-Graph: Finding Shortest Execution Paths for Graph Processing under a Hybrid Framework on GPU
    Wang, Hao
    Geng, Liang
    Lee, Rubao
    Hou, Kaixi
    Zhang, Yanfeng
    Zhang, Xiaodong
    PROCEEDINGS OF THE 24TH SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '19), 2019, : 38 - 52