The Aggressive Oversubscribing Scheduling for Interactive Jobs on a Supercomputing System

被引:1
作者
Minami, Shohei [1 ,2 ]
Endo, Toshio [2 ]
Nomura, Akihiro [2 ]
机构
[1] Prometech Softwere Inc, Tokyo, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
来源
2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC | 2023年
关键词
Job scheduling; Simulator; Oversubscribing; Interactive Jobs; Supercomputing systems;
D O I
10.1109/HPEC58863.2023.10363580
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As interactive usages of supercomputing systems become popular, especially in the AI and machine learning (ML) fields, the systems are expected to provide resources in real time. As interactive jobs have different features from traditional batch jobs, the systems should be designed to accept both types of jobs efficiently. This paper shows that the aggressive oversubscribing scheduling, in which multiple jobs share computational resources regardless of job types, can effectively process hybrid jobs. This paper investigates behaviors of the real interactive jobs with fluctuating CPU utilization. And a simulation method is described, which combines existing workload trace data and data on CPU utilization. Through the evaluation, we demonstrate oversubscribing scheduling achieves a short response time for interactive jobs. Also our solution eliminates the necessity of configuring dedicated queues for job types and achieves robustness towards the change of demand of interactive jobs.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] ScHeduling of jobs and Adaptive Resource Provisioning (SHARP) approach in cloud computing
    Dinesh Komarasamy
    Vijayalakshmi Muthuswamy
    Cluster Computing, 2018, 21 : 163 - 176
  • [32] ScHeduling of jobs and Adaptive Resource Provisioning (SHARP) approach in cloud computing
    Komarasamy, Dinesh
    Muthuswamy, Vijayalakshmi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (01): : 163 - 176
  • [33] Online scheduling of equal-length jobs: Randomization and restarts help
    Chrobak, Marek
    Jawor, Wojciech
    Sgall, Jiri
    Tichy, Tomas
    SIAM JOURNAL ON COMPUTING, 2007, 36 (06) : 1709 - 1728
  • [34] Scheduling Jobs in Face of Status Update Timing of Resources in Computational Grids
    Amoon, M.
    Faheem, H. M.
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2012, 5 (02): : 33 - 42
  • [35] Dynamic Job Scheduling Strategy Using Jobs Characteristics in Cloud Computing
    Alsaih, Mohammed A.
    Latip, Rohaya
    Abdullah, Azizol
    Subramaniam, Shamala K.
    Ali Alezabi, Kamal
    SYMMETRY-BASEL, 2020, 12 (10): : 1 - 13
  • [36] State-of-the-art Survey of Scheduling and Resource Management Technology for Colocation Jobs
    Wang K.-J.
    Jia T.
    Li Y.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (10): : 3100 - 3119
  • [37] Scheduling Periodical Multi-Stage Jobs With Fuzziness to Elastic Cloud Resources
    Zhu, Jie
    Li, Xiaoping
    Ruiz, Ruben
    Li, Wei
    Huang, Haiping
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (12) : 2819 - 2833
  • [38] Scheduling of variable-time jobs for distributed systems with heterogeneous processor cardinality
    Wu, Jan-Jan
    Chang, Hung-Jui
    Ho, Yu-Fan
    Liu, Pangfeng
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2012, 10 (02) : 112 - 121
  • [39] Multiobjective Variable Neighborhood Search algorithm for scheduling independent jobs on computational grid
    Selvi, S.
    Manimegalai, D.
    EGYPTIAN INFORMATICS JOURNAL, 2015, 16 (02) : 199 - 212
  • [40] Practical criteria for scheduling CPU-bound jobs in mobile devices at the edge
    Hirsch, Mataas
    Mateos, Cristian
    Zunino, Alejandro
    2018 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2018), 2018, : 340 - 345