Hierarchical Dynamic Loop Self-Scheduling on Distributed-Memory Systems Using an MPI plus MPI Approach

被引:3
作者
Eleliemy, Ahmed [1 ]
Ciorba, Florina M. [1 ]
机构
[1] Univ Basel, Dept Math & Comp Sci, Basel, Switzerland
来源
2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW) | 2019年
基金
瑞士国家科学基金会;
关键词
Dynamic loop self-scheduling; Hierarchical dynamic loop self-scheduling; MPI; OpenMP; MPI plus OpenMP; MPI plus MPI; PARALLEL; LIBRARY; SCHEME;
D O I
10.1109/IPDPSW.2019.00117
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Computationally-intensive loops are the primary source of parallelism in scientific applications. Such loops are often irregular and a balanced execution of their loop iterations is critical for achieving high performance. However, several factors may lead to an imbalanced load execution, such as problem characteristics, algorithmic, and systemic variations. Dynamic loop self-scheduling (DLS) techniques are devised to mitigate these factors, and consequently, improve application performance. On distributed-memory systems, DLS techniques can be implemented using a hierarchical master-worker execution model and are, therefore, called hierarchical DLS techniques. These techniques self-schedule loop iterations at two levels of hardware parallelism: across and within compute nodes. Hybrid programming approaches that combine the message passing interface (MPI) with open multi-processing (OpenMP) dominate the implementation of hierarchical DLS techniques. The MPI-3 standard includes the feature of sharing memory regions among MPI processes. This feature introduced the MPI+MPI approach that simplifies the implementation of parallel scientific applications. The present work designs and implements hierarchical DLS techniques by exploiting the MPI+MPI approach. Four well-known DLS techniques are considered in the evaluation proposed herein. The results indicate certain performance advantages of the proposed approach compared to the hybrid MPI+OpenMP approach.
引用
收藏
页码:689 / 697
页数:9
相关论文
共 37 条
  • [1] Design and implementation of a novel dynamic load balancing library for cluster computing
    Banicescu, I
    Cariño, RL
    Pabico, JP
    Balasubramaniam, M
    [J]. PARALLEL COMPUTING, 2005, 31 (07) : 736 - 756
  • [2] Banicescu I., 2003, Cluster Computing, V6, P215
  • [3] Parallel dynamic load balancing strategies for adaptive irregular applications
    Biswas, R
    Das, SK
    Harvey, DJ
    Oliker, L
    [J]. APPLIED MATHEMATICAL MODELLING, 2000, 25 (02) : 109 - 122
  • [4] Dynamic load balancing with adaptive factoring methods in scientific applications
    Carino, Ricolindo L.
    Banicescu, Ioana
    [J]. JOURNAL OF SUPERCOMPUTING, 2008, 44 (01) : 41 - 63
  • [5] Cariño RL, 2004, PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, P78
  • [6] A load balancing tool for distributed parallel loops
    Cariño, RL
    Banicescu, I
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2005, 8 (04): : 313 - 321
  • [7] Performance analysis of a hybrid MPI/OpenMP application on multi-core clusters
    Chorley, Martin J.
    Walker, David W.
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2010, 1 (03) : 168 - 174
  • [8] Chronopoulos A. T., 2001, Proceedings 2001 IEEE International Conference on Cluster Computing, P282, DOI 10.1109/CLUSTR.2001.959989
  • [9] Scalable loop self-scheduling schemes for heterogeneous clusters
    Chronopoulos, Anthony T.
    Penmatsa, Satish
    Yu, Ning
    Yu, Du
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2005, 1 (2-4) : 110 - 117
  • [10] Ciorba F. M., 2018, P 2018 INT WORKSH OP