The Aggressive Oversubscribing Scheduling for Interactive Jobs on a Supercomputing System

被引:1
作者
Minami, Shohei [1 ,2 ]
Endo, Toshio [2 ]
Nomura, Akihiro [2 ]
机构
[1] Prometech Softwere Inc, Tokyo, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
来源
2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC | 2023年
关键词
Job scheduling; Simulator; Oversubscribing; Interactive Jobs; Supercomputing systems;
D O I
10.1109/HPEC58863.2023.10363580
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As interactive usages of supercomputing systems become popular, especially in the AI and machine learning (ML) fields, the systems are expected to provide resources in real time. As interactive jobs have different features from traditional batch jobs, the systems should be designed to accept both types of jobs efficiently. This paper shows that the aggressive oversubscribing scheduling, in which multiple jobs share computational resources regardless of job types, can effectively process hybrid jobs. This paper investigates behaviors of the real interactive jobs with fluctuating CPU utilization. And a simulation method is described, which combines existing workload trace data and data on CPU utilization. Through the evaluation, we demonstrate oversubscribing scheduling achieves a short response time for interactive jobs. Also our solution eliminates the necessity of configuring dedicated queues for job types and achieves robustness towards the change of demand of interactive jobs.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] A performance comparison of data-aware heuristics for scheduling jobs in mobile Grids
    Hirsch, Matias
    Mateos, Cristian
    Rodriguez, Juan M.
    Zunino, Alejandro
    Gari, Yisel
    Monge, David A.
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [42] Heuristic Scheduling Strategies for Linear-Dependent and Independent Jobs on Heterogeneous Grids
    Tsai, Min-Yi
    Chiang, Ping-Fang
    Chang, Yen-Jan
    Wang, Wei-Jen
    GRID AND DISTRIBUTED COMPUTING, 2011, 261 : 496 - 505
  • [43] Smart-mDAG: An Intelligent Scheduling Method for Multi-DAG Jobs
    Zhu, Yifan
    Hu, Bo
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 110 - 115
  • [44] A fine-grained GPU sharing and job scheduling for deep learning jobs on the cloud
    Chung, Wu-Chun
    Tong, Jyun-Sen
    Chen, Zhi-Hao
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (02)
  • [45] A framework for scheduling IoT application jobs on fog computing infrastructure based on QoS parameters
    Kaur, Mandeep
    Sandhu, Rajinder
    Mohana, Rajni
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2023, 19 (03) : 364 - 385
  • [46] Scheduling multi-colour print jobs with sequence-dependent setup times
    A. P. Burger
    C. G. Jacobs
    J. H. van Vuuren
    S. E. Visagie
    Journal of Scheduling, 2015, 18 : 131 - 145
  • [47] Minimizing the expected cybersecurity loss in a software supply chain through scheduling scanning jobs
    Wang, Jen-Ya
    COMPUTERS & OPERATIONS RESEARCH, 2025, 180
  • [48] Scheduling multi-colour print jobs with sequence-dependent setup times
    Burger, A. P.
    Jacobs, C. G.
    van Vuuren, J. H.
    Visagie, S. E.
    JOURNAL OF SCHEDULING, 2015, 18 (02) : 131 - 145
  • [49] DRAGON: A Dynamic Scheduling and Scaling Controller for Managing Distributed Deep Learning Jobs in Kubernetes Cluster
    Lin, Chan-Yi
    Yeh, Ting-An
    Chou, Jerry
    CLOSER: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2019, : 569 - 577
  • [50] Grouping-based Scheduling with Load Balancing for Fine-Grained Jobs in Grid Computing
    Ezzat, Rabab Mohamed
    Aboutabl, Amal Elsayed
    Mostafa, Mostafa Sami
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2013, 4 (11) : 67 - 75