Cloud-nativeWorkflow Scheduling using a Hybrid Priority Rule and Dynamic Task Parallelism

被引:2
作者
Shin, Jungeun [1 ]
Arroyo, Diana [2 ]
Tantawi, Asser [2 ]
Wang, Chen [2 ]
Youssef, Alaa [2 ]
Nagi, Rakesh [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY USA
来源
PROCEEDINGS OF THE 13TH SYMPOSIUM ON CLOUD COMPUTING, SOCC 2022 | 2022年
关键词
MACHINE; ALGORITHM;
D O I
10.1145/3542929.3563495
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Demand for efficient cloud-native workflow scheduling is growing as many data science workloads are composed of several tasks with dependencies. As container technology becomes more prevalent in cloud communities, containerized workflow orchestration tools are introduced and become standard for scheduling workflows. However, current schedulers use simple heuristics and rely on the user's choice on priority and parallelism level of tasks without accounting for workflow-specific information. We introduce aworkflow-aware scheduling algorithm that usesworkflowinformation for scheduling tasks, without user input, with an objective of improving resource utilization and minimizing weighted workflow completion time, duration multiplied by user specific workflow priority. Our scheduler comprises of two strategies, a hybrid priority rule inspired by production planning ideas, and a task splitting rule based on a convex task processing time curve for the parallelism level. Using simulation, we demonstrate that our algorithm (1) produces an efficient balance of weighted workflow completion time and resource utilization and (2) outperforms deterministic parallelism.
引用
收藏
页码:72 / 77
页数:6
相关论文
共 18 条
[1]  
Agrawal K., 2006, Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming PPoPP'06, P100, DOI 10.1145/1122971.1122988
[2]  
Agrawal Kunal., 2016, Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2016, Arlington, VA, USA, January 10-12, 2016, P176
[3]  
[Anonymous], 2014, Introduction to Probability Models, DOI DOI 10.1016/C2012-0-03564-8
[4]   Parallel machine scheduling with fuzzy processing times using a robust genetic algorithm and simulation [J].
Balin, Savas .
INFORMATION SCIENCES, 2011, 181 (17) :3551-3569
[5]   KubeSphere: An Approach to Multi-Tenant Fair Scheduling for Kubernetes Clusters [J].
Beltre, Angel ;
Saha, Pankaj ;
Govindaraju, Madhusudhan .
2019 3RD IEEE INTERNATIONAL CONFERENCE ON CLOUD AND FOG COMPUTING TECHNOLOGIES AND APPLICATIONS (IEEE CLOUD SUMMIT 2019), 2019, :14-20
[6]   Containers and Cloud: From LXC to Docker to Kubernetes [J].
Bernstein, David .
IEEE CLOUD COMPUTING, 2014, 1 (03) :81-84
[7]   SCARL: Attentive Reinforcement Learning-Based Scheduling in a Multi-Resource Heterogeneous Cluster [J].
Cheong, Mukoe ;
Lee, Hyunsung ;
Yeom, Ikjun ;
Woo, Honguk .
IEEE ACCESS, 2019, 7 (153432-153444) :153432-153444
[8]   A complete immunoglobulin-based artificial immune system algorithm for two-stage assembly flowshop scheduling problem with part splitting and distinct due windows [J].
Chung, Tsui-Ping ;
Chen, Feng .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2019, 57 (10) :3219-3237
[9]   Batch splitting in an assembly scheduling environment [J].
Dastidar, Satyaki Ghosh ;
Nagi, Rakesh .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2007, 105 (02) :372-384
[10]  
Docker, 2021, What is a Container?