Efficient deadline-aware scheduling for the analysis of Big Data streams in public Cloud

被引:13
作者
Mortazavi-Dehkordi, Mahmood [1 ]
Zamanifar, Kamran [1 ]
机构
[1] Univ Isfahan, Comp Engn Fac, Software Dept, Esfahan, Iran
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2020年 / 23卷 / 01期
关键词
Streaming Big Data analysis query; Deadline-aware scheduling; Cloud-based stream processing; REAL-TIME; RESOURCE-MANAGEMENT; SIMULATION;
D O I
10.1007/s10586-019-02908-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of Big Data has had a profound impact on how data are analyzed. Open source distributed stream processing platforms have gained popularity for analyzing streaming Big Data as they provide low latency required for streaming Big Data applications using Cloud resources. However, existing resource schedulers are still lacking the efficiency and deadline meeting that Big Data analytical applications require. Recent works have already considered streaming Big Data characteristics to improve the efficiency and the likelihood of deadline meeting for scheduling in the platforms. Nevertheless, they have not taken into account the specific attributes of analytical application, public Cloud utilization cost and delays caused by performance degradation of leasing public Cloud resources. This study, therefore, presents BCframework, an efficient deadline-aware scheduling framework used by streaming Big Data analysis applications based on public Cloud resources. BCframework proposes a scheduling model which considers public Cloud utilization cost, performance variation, deadline meeting and latency reduction requirements of streaming Big Data analytical applications. Furthermore, it introduces two operator scheduling algorithms based on both a novel partitioning algorithm and an operator replication method. BCframework is highly adaptable to the fluctuation of streaming Big Data and the performance degradation of public Cloud resources. Experiments with the benchmark and real-world queries show that BCframework can significantly reduce the latency and utilization cost and also minimize deadline violations and provisioned virtual machine instances.
引用
收藏
页码:241 / 263
页数:23
相关论文
共 47 条
[41]   An Efficient Task Scheduling for Cloud Computing Platforms Using Energy Management Algorithm: A Comparative Analysis of Workflow Execution Time [J].
Ahmed, Adeel ;
Adnan, Muhammad ;
Abdullah, Saima ;
Ahmad, Israr ;
Alturki, Nazik ;
Jamel, Leila .
IEEE ACCESS, 2024, 12 :34208-34221
[42]   Cost-Efficient and Quality-of-Experience-Aware Player Request Scheduling and Rendering Server Allocation for Edge-Computing-Assisted Multiplayer Cloud Gaming [J].
Gao, Yongqiang ;
Zhang, Chaoyu ;
Xie, Zhulong ;
Qi, Zhengwei ;
Zhou, Jiantao .
IEEE INTERNET OF THINGS JOURNAL, 2021, 9 (14) :12029-12040
[43]   Cost-Efficient and Quality of Experience-Aware Provisioning of Virtual Machines for Multiplayer Cloud Gaming in Geographically Distributed Data Centers [J].
Gao, Yongqiang ;
Wang, Lin ;
Zhou, Jiantao .
IEEE ACCESS, 2019, 7 :142574-142585
[44]   Novel fuzzy multi objective DVFS-aware consolidation heuristics for energy and SLA efficient resource management in cloud data centers [J].
Arianyan, Ehsan ;
Taheri, Hassan ;
Khoshdel, Vahid .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 78 :43-61
[45]   EMO-TS: An Enhanced Multi-Objective Optimization Algorithm for Energy-Efficient Task Scheduling in Cloud Data Centers [J].
Nambi, S. ;
Thanapal, P. .
IEEE ACCESS, 2025, 13 :8187-8200
[46]   Data Analysis-Oriented Stochastic Scheduling for Cost Efficient Resource Allocation in NFV Based MEC Network [J].
Li, Baozhu ;
Hou, Fen ;
Yang, Gangqiang ;
Zhao, Hui ;
Chen, Shanzhi .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) :6695-6708
[47]   A decentralized adaptation of model-free Q-learning for thermal-aware energy-efficient virtual machine placement in cloud data centers [J].
Aghasi, Ali ;
Jamshidi, Kamal ;
Bohlooli, Ali ;
Javadi, Bahman .
COMPUTER NETWORKS, 2023, 224