DRL-based and Bsld-Aware Job Scheduling for Apache Spark Cluster in Hybrid Cloud Computing Environments

被引:1
|
作者
Shi, Wenhu [1 ]
Li, Hongjian [1 ]
Zeng, Hang [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Nanshan St, Chongqing 400000, Peoples R China
关键词
Spark; Job scheduling; Deep reinforcement learning; Hybrid cloud environments; ALGORITHM;
D O I
10.1007/s10723-022-09630-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spark is one of the most important big data computing engines, favored by academia and industry for its low latency and ease of use. The explosive growth in data volumes is causing computing tasks that could otherwise run on local or on-premise resources to become infeasible. The emergence of public clouds has solved the problem of shortage of local or on-premise resources. However, deploying clusters only on public clouds can be costly on the one hand and wasteful of available local resources on the other. Therefore, deploying a Spark cluster on both local and public cloud resources becomes a good solution to save cost and not waste local resources. When Spark is deployed in hybrid cloud environments, its default scheduling policy ignores job and environment characteristics leading to performance degradation and increased cluster usage costs. In this paper, A deep reinforcement learning-based (DRL-based) Spark job scheduler is proposed to improve cluster performance and reduce the total cost of cluster usage in hybrid cloud environments. Specifically, the proposed DRL agent can adaptively learn the characteristics of different types of jobs and hybrid cloud environments to rationally schedule Spark jobs to reduce the total cluster usage cost and the average bounded slowdown of jobs. A simulation environment is built to train the proposed scheduling agent, and the Spark Core module is extended to verify the effectiveness of the proposed scheduling agent. Experimental results show that the DRL-based algorithm improves performance by 5.55% and reduces the total cluster usage cost by 13.9% on average compared to the baseline algorithm in burst arrival mode.
引用
收藏
页数:23
相关论文
共 22 条
  • [1] DRL-based and Bsld-Aware Job Scheduling for Apache Spark Cluster in Hybrid Cloud Computing Environments
    Wenhu Shi
    Hongjian Li
    Hang Zeng
    Journal of Grid Computing, 2022, 20
  • [2] Adaptive DRL-Based Task Scheduling for Energy-Efficient Cloud Computing
    Kang, Kaixuan
    Ding, Ding
    Xie, Huamao
    Yin, Qian
    Zeng, Jing
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4948 - 4961
  • [3] Context-aware Job Scheduling for Cloud Computing Environments
    Assuncao, Marcos D.
    Netto, Marco A. S.
    Koch, Fernando
    Bianchi, Silvia
    2012 IEEE/ACM FIFTH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC 2012), 2012, : 255 - 262
  • [4] SLA-Based Scheduling of Spark Jobs in Hybrid Cloud Computing Environments
    Islam, Muhammed Tawfiqul
    Wu, Huaming
    Karunasekera, Shanika
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1117 - 1132
  • [5] DRL-based Task Scheduling Scheme in Vehicular Fog Computing: Cooperative and mobility aware approach
    Raju, Mekala Ratna
    Mothku, Sai Krishna
    Somesula, Manoj Kumar
    AD HOC NETWORKS, 2025, 173
  • [6] Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments
    Islam, Muhammed Tawfiqul
    Karunasekera, Shanika
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (07) : 1695 - 1710
  • [7] Cost-based job scheduling strategy in cloud computing environments
    Mansouri, N.
    Javidi, M. M.
    DISTRIBUTED AND PARALLEL DATABASES, 2020, 38 (02) : 365 - 400
  • [8] Cost-based job scheduling strategy in cloud computing environments
    N. Mansouri
    M. M. Javidi
    Distributed and Parallel Databases, 2020, 38 : 365 - 400
  • [9] Energy-aware scheduling for spark job based on deep reinforcement learning in cloud
    Li, Hongjian
    Lu, Liang
    Shi, Wenhu
    Tan, Gangfan
    Luo, Hao
    COMPUTING, 2023, 105 (08) : 1717 - 1743
  • [10] Energy-aware scheduling for spark job based on deep reinforcement learning in cloud
    Hongjian Li
    Liang Lu
    Wenhu Shi
    Gangfan Tan
    Hao Luo
    Computing, 2023, 105 : 1717 - 1743