Communication-aware Job Scheduling using SLURM

被引:4
作者
Mishra, Priya [1 ]
Agrawal, Tushar [1 ]
Malakar, Preeti [1 ]
机构
[1] Indian Inst Technol Kanpur, Kanpur, Uttar Pradesh, India
来源
49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOP PROCEEDINGS, ICPP 2020 | 2020年
关键词
job scheduling; communication-aware; job-aware; SLURM; PERFORMANCE; OPERATIONS;
D O I
10.1145/3409390.3409410
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Job schedulers play an important role in selecting optimal resources for the submitted jobs. However, most of the current job schedulers do not consider job-specific characteristics such as communication patterns during resource allocation. This often leads to sub-optimal node allocations. We propose three node allocation algorithms that consider the job's communication behavior to improve the performance of communication-intensive jobs. We develop our algorithms for tree-based network topologies. The proposed algorithms aim at minimizing network contention by allocating nodes on the least contended switches. We also show that allocating nodes in powers of two leads to a decrease in inter-switch communication for MPI communications, which further improves performance. We implement and evaluate our algorithms using SLURM, a widely-used and well-known job scheduler. We show that the proposed algorithms can reduce the execution times of communication-intensive jobs by 9% (326 hours) on average. The average wait time of jobs is reduced by 31% across three supercomputer job logs.
引用
收藏
页数:10
相关论文
共 50 条
[41]   Cost-aware real-time job scheduling for hybrid cloud using deep reinforcement learning [J].
Long Cheng ;
Archana Kalapgar ;
Amogh Jain ;
Yue Wang ;
Yongtai Qin ;
Yuancheng Li ;
Cong Liu .
Neural Computing and Applications, 2022, 34 :18579-18593
[42]   Cost-aware real-time job scheduling for hybrid cloud using deep reinforcement learning [J].
Cheng, Long ;
Kalapgar, Archana ;
Jain, Amogh ;
Wang, Yue ;
Qin, Yongtai ;
Li, Yuancheng ;
Liu, Cong .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (21) :18579-18593
[43]   Locality/Fairness-Aware Job Scheduling in Distributed Stream Processing Engines [J].
Son, Siwoon ;
Moon, Yang-Sae .
ELECTRONICS, 2020, 9 (11) :1-24
[44]   Network and Data Location Aware Job Scheduling in Grid: Improvement to GridWay Metascheduler [J].
Kumar, Saumesh ;
Kumar, Naveen .
INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2012, 5 (01) :87-99
[45]   Application and Storage-Aware Data Placement and Job Scheduling for Hadoop Clusters [J].
Li, Tao ;
He, Shuibing ;
Chen, Ping ;
Yang, Siling ;
Yin, Yanlong ;
Xu, Cheng .
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2020, 29 (16)
[46]   QoS-aware Parallel Job Scheduling Framework for Simulation Execution as a Service [J].
Li, Zhen ;
Chen, Bin ;
Liu, Xiaocheng ;
Ning, Dandan ;
Duan, Wei ;
Qiu, Xiaogang ;
Xu, Chengda .
2017 IEEE/ACM 21ST INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2017, :208-211
[47]   QoS-aware simulation job scheduling algorithm in virtualized cloud environment [J].
Li, Zhen ;
Chen, Bin ;
Liu, Xiaocheng ;
Ning, Dandan ;
Qiu, Xiaogang .
INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2020, 11 (05)
[48]   Parallel job scheduling using marginal analysis [J].
Patrick, BG ;
Jack, M .
PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, :602-608
[49]   FPIA: Communication-Aware Multi-Chiplet Integration With Field-Programmable Interconnect Fabric on Reusable Silicon Interposer [J].
Jiao, Bo ;
Xu, Lei ;
Yu, Xinyu ;
Yang, Haitao ;
Zhu, Haozhe ;
Wang, Yu ;
Zhu, Jundong ;
Wen, Dexin ;
Wang, Lingli ;
Tao, Jun ;
Chen, Chixiao ;
Han, Yinhe ;
Liu, Qi ;
Sun, Ninghui ;
Liu, Ming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (09) :4156-4168
[50]   Behavior-Based Communication-Aware Formation Control in Dynamic Multi-Agent Systems for Jamming Detection and Avoidance [J].
Peccoud, Samuel ;
Xing, Sang ;
Yang, Tianyu ;
Stansbury, Richard S. .
SOUTHEASTCON 2024, 2024, :552-558