Mathematical modeling for further improving task scheduling on Big Data systems

被引:0
作者
Stavros Souravlas
Sofia Anastasiadou
Angelo Sifaleras
机构
[1] University of Macedonia,Department of Applied Informatics
[2] University of Western Macedonia,Faculty of Health Sciences, Department of Midwifery
来源
Computational Management Science | 2023年 / 20卷
关键词
Task scheduling; Big data streams; Task redistribution; Scheduling;
D O I
暂无
中图分类号
学科分类号
摘要
In the big data era which we have entered, the development of smart scheduler has become a necessity. A Distributed Stream Processing System (DSPS) has the role of assigning processing tasks to the available resources (dynamically or not) and route streaming data between them. Smart and efficient task scheduling can reduce latencies and eliminate network congestions. The most commonly used scheduler is the default Storm scheduler, which has proven to have certain disadvantages, like the inability to handle system changes in a dynamic environment. In such cases, rescheduling is necessary. This paper is an extension of a previous work on dynamic task scheduling. In such a scenario, some type of rescheduling is necessary to have the system working in the most efficient way. In this paper, we extend our previous works Souravlas and Anastasiadou (Appl Sci 10(14):4796, 2020); Souravlas et al. (Appl Sci 11(1):61, 2021) and present a mathematical model that offers better balance and produces fewer communication steps. The scheduler is based on the idea of generating larger sets of communication steps among the system nodes, which we call superclasses. Our experiments have shown that this scheme achieves better balancing and reduces the overall latency.
引用
收藏
相关论文
共 17 条
[1]  
Al-Sinayyid A(2020)Job scheduler for streaming applications in heterogeneous distributed processing systems J Supercomput 20 9609-9628
[2]  
Zhu M(2017)Dhalion: self-regulating stream processing in Heron Proc VLDB Endow 10 1825-1836
[3]  
Floratou A(2018)Model-driven scheduling for distributed stream processing systems J Parall Distrib Comput 117 98-114
[4]  
Agrawal A(2020)Pipelined dynamic scheduling of big data streams Appl Sci 10 4796-601
[5]  
Graham B(2021)More on pipelined dynamic scheduling of big data streams Appl Sci 11 61-undefined
[6]  
Rao S(2020)A review on big data real-time stream processing and its scheduling techniques Int J Parall Emerg Distrib Syst 35 571-undefined
[7]  
Ramasamy K(undefined)undefined undefined undefined undefined-undefined
[8]  
Shukla A(undefined)undefined undefined undefined undefined-undefined
[9]  
Simmhan Y(undefined)undefined undefined undefined undefined-undefined
[10]  
Souravlas S(undefined)undefined undefined undefined undefined-undefined