Communication-aware Job Scheduling using SLURM

被引:2
|
作者
Mishra, Priya [1 ]
Agrawal, Tushar [1 ]
Malakar, Preeti [1 ]
机构
[1] Indian Inst Technol Kanpur, Kanpur, Uttar Pradesh, India
来源
49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOP PROCEEDINGS, ICPP 2020 | 2020年
关键词
job scheduling; communication-aware; job-aware; SLURM; PERFORMANCE; OPERATIONS;
D O I
10.1145/3409390.3409410
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Job schedulers play an important role in selecting optimal resources for the submitted jobs. However, most of the current job schedulers do not consider job-specific characteristics such as communication patterns during resource allocation. This often leads to sub-optimal node allocations. We propose three node allocation algorithms that consider the job's communication behavior to improve the performance of communication-intensive jobs. We develop our algorithms for tree-based network topologies. The proposed algorithms aim at minimizing network contention by allocating nodes on the least contended switches. We also show that allocating nodes in powers of two leads to a decrease in inter-switch communication for MPI communications, which further improves performance. We implement and evaluate our algorithms using SLURM, a widely-used and well-known job scheduler. We show that the proposed algorithms can reduce the execution times of communication-intensive jobs by 9% (326 hours) on average. The average wait time of jobs is reduced by 31% across three supercomputer job logs.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] CA-DAG: Modeling Communication-Aware Applications for Scheduling in Cloud Computing
    Dzmitry Kliazovich
    Johnatan E. Pecero
    Andrei Tchernykh
    Pascal Bouvry
    Samee U. Khan
    Albert Y. Zomaya
    Journal of Grid Computing, 2016, 14 : 23 - 39
  • [32] Work-in-Progress: Communication-Aware Scheduling of Data-Parallel Tasks
    Shimada, Kana
    Taniguchi, Ittetsu
    Tomiyama, Hiroyuki
    2018 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2018,
  • [33] Reconfiguration and Communication-Aware Task Scheduling for High-Performance Reconfigurable Computing
    Huang, Miaoqing
    Narayana, Vikram K.
    Simmler, Harald
    Serres, Olivier
    El-Ghazawi, Tarek
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (04)
  • [34] Exact Memory- and Communication-aware Scheduling of DNNs on Pipelined Edge TPUs
    Yin, Jiaqi
    Zhang, Zhiru
    Yu, Cunxi
    2022 IEEE/ACM 7TH SYMPOSIUM ON EDGE COMPUTING (SEC 2022), 2022, : 203 - 215
  • [35] Communication-Aware Robotics: Exploiting Motion for Communication
    Muralidharan, Arjun
    Mostofi, Yasamin
    ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 4, 2021, 2021, 4 : 115 - 139
  • [36] Communication-aware task placement for workflow scheduling on DaaS-based Cloud
    Croubois, Hadrien
    Caron, Eddy
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 452 - 461
  • [37] Communication-aware thread mapping using the translation lookaside buffer
    Cruz, Eduardo H. M.
    Diener, Matthias
    Navaux, Philippe O. A.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 4970 - 4992
  • [38] Communication-Aware UAV Path Planning
    Mardani, Afshin
    Chiaberge, Marcello
    Giaccon, Paolo
    IEEE ACCESS, 2019, 7 : 52609 - 52621
  • [39] A Communication-aware Scheduling Algorithm for Hardware Task Scheduling Model on FPGA-based Reconfigurable Systems
    Sheng, Yingying
    Liu, Yan
    Li, Renfa
    Xiao, Xiongren
    JOURNAL OF COMPUTERS, 2014, 9 (11) : 2552 - 2558
  • [40] Communication-aware message matching in MPI
    Ghazimirsaeed, S. Mahdieh
    Mirsadeghi, Seyed H.
    Afsahi, Ahmad
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (03):