Spark workflow task scheduling with deadline and privacy constraints in hybrid cloud networksSpark workflow task scheduling with deadline and privacy constraints in hybrid cloud...K. Y. Rajput et al.

被引:0
作者
Kamran Yaseen Rajput [1 ]
Li Xiaoping [1 ]
Abdullah Lakhan [2 ]
机构
[1] Southeast University,School of Computer Science and Engineering
[2] Kristiania University College,School of Economics, Innovations, and Technology
[3] Dawood University of Engineering and Technology,Department of Cybersecurity
关键词
Spark workflow tasks; Heuristic algorithm; Privacy; Deadline; Hybrid cloud;
D O I
10.1007/s00500-025-10486-2
中图分类号
学科分类号
摘要
The increasing adoption of hybrid clouds in organizations stems from their ability to bolster private cloud resources with additional public cloud capacity when required. However, scheduling distributed applications, such as workflow tasks, on hybrid cloud resources presents new and intricate challenges. A significant concern revolves around the potential exposure of private data and tasks within third-party, public cloud infrastructures, especially within sensitive domains like healthcare applications. The complexity escalates when considering the selection of resources from multiple cloud providers due to the fluctuating resource computation prices and data transmission costs. This paper presents the Spark Workflow Task Scheduling to Hybrid Cloud (SWSHC) framework, designed to schedule Spark workflows precisely while adhering to deadline and task privacy constraints within a hybrid cloud setting. Our innovative approach encompasses developing and implementing three pivotal components: deadline division, stage order optimization, and task scheduling mechanisms. We segregate the workflow deadline for each stage to bridge the gaps between stages effectively. Additionally, job prioritization is achieved using the maximum rank rule. The proposed solution considers diverse factors, including interval pricing variations, utilization of heterogeneous VM instances, intra- and inter-bandwidth considerations, and the efficient utilization of private cloud resources. Through meticulous calibration of our algorithm and comprehensive experimentation with various realistic workflows, our findings unequivocally demonstrate that SWSHC surpasses existing solutions in the current literature the cost by up to 40–70% in terms of cost efficiency and resource utilization.
引用
收藏
页码:783 / 801
页数:18
相关论文
empty
未找到相关数据