An Efficient MPI Message Queue Mechanism for Large-scale Jobs

被引:12
|
作者
Zounmevo, Judicael A. [1 ]
Afsahi, Ahmad [1 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON, Canada
来源
PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012) | 2012年
关键词
MPI; Message Queues; Multidimensional Searches; Scalability; Exascale;
D O I
10.1109/ICPADS.2012.70
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Message Passing Interface (MPI) message queues have been shown to grow proportionately to the job size for many applications. With such a behaviour and knowing that message queues are used very frequently, ensuring fast queue operations at large scales is of paramount importance in the current and the upcoming exascale computing eras. Scalability, however, is two-fold. With the growing processor core density per node, and the expected smaller memory density per core at larger scales, a queue mechanism that is blind on memory requirements poses another scalability issue even if it solves the speed of operation problem. In this work we propose a multidimensional queue traversal mechanism whose operation time and memory overhead grow sub-linearly with the job size. We compare our proposal with a linked list-based approach which is not scalable in terms of speed of operation, and with an array-based method which is not scalable in terms of memory consumption. Our proposed multidimensional approach yields queue operation time speedups that translate to up to 4-fold execution time improvement over the linked list design for the applications studied in this work. It also shows a consistent lower memory footprint compared to the array-based design.
引用
收藏
页码:464 / 471
页数:8
相关论文
共 50 条
  • [1] A fast and resource-conscious MPI message queue mechanism for large-scale jobs
    Zounmevo, Judicael A.
    Afsahi, Ahmad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING AND ESCIENCE, 2014, 30 : 265 - 290
  • [2] MapReduce in MPI for Large-scale graph algorithms
    Plimpton, Steven J.
    Devine, Karen D.
    PARALLEL COMPUTING, 2011, 37 (09) : 610 - 632
  • [3] Efficient MPI-AllReduce for large-scale deep learning on GPU-clusters
    Truong Thao Nguyen
    Wahib, Mohamed
    Takano, Ryousei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (12)
  • [4] WEB PORTAL FOR LARGE-SCALE COMPUTATIONS BASED ON GRID AND MPI
    Akzhalova, Assel Zh.
    Aizhulov, Daniar Y.
    Seralin, Galymzhan
    Balakayeva, Gulnar
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2008, 9 (02): : 135 - 142
  • [5] Interoperability strategies for GASPI and MPI in large-scale scientific applications
    Simmendinger, Christian
    Iakymchuk, Roman
    Cebamanos, Luis
    Akhmetova, Dana
    Bartsch, Valeria
    Rotaru, Tiberiu
    Rahn, Mirko
    Laure, Erwin
    Markidis, Stefano
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (03) : 554 - 568
  • [6] EFFICIENT SIMULATIONS OF LARGE-SCALE CONVECTIVE HEAT TRANSFER PROBLEMS
    Goik, Damian
    Banas, Krzysztof
    Bielanski, Jan
    Chlon, Kazimierz
    COMPUTER SCIENCE-AGH, 2021, 22 (04): : 517 - 538
  • [7] A Large-Scale Study of MPI Usage in Open-Source HPC Applications
    Laguna, Ignacio
    Marshall, Ryan
    Mohror, Kathryn
    Ruefenacht, Martin
    Skjellum, Anthony
    Sultana, Nawrin
    PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
  • [8] Web portal to make large-scale scientific computations based on Grid computing and MPI
    Akzhalova, Assel Zh.
    Aizhulov, Daniar Y.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2008, 4967 : 888 - 893
  • [9] On Efficient Network Planning and Routing in Large-Scale MANETs
    El-Hajj, Wassim
    Al-Fuqaha, Ala
    Guizani, Mohsen
    Chen, Hsiao-Hwa
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2009, 58 (07) : 3796 - 3801
  • [10] Hermes: Enabling efficient large-scale simulation in MATSim
    Graur, Dan
    Bruno, Rodrigo
    Bischoff, Joschka
    Rieser, Marcel
    Scherr, Wolfgang
    Hoefler, Torsten
    Alonso, Gustavo
    12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 635 - 641