Efficient deadline-aware scheduling for the analysis of Big Data streams in public Cloud

被引:14
|
作者
Mortazavi-Dehkordi, Mahmood [1 ]
Zamanifar, Kamran [1 ]
机构
[1] Univ Isfahan, Comp Engn Fac, Software Dept, Esfahan, Iran
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2020年 / 23卷 / 01期
关键词
Streaming Big Data analysis query; Deadline-aware scheduling; Cloud-based stream processing; REAL-TIME; RESOURCE-MANAGEMENT; SIMULATION;
D O I
10.1007/s10586-019-02908-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of Big Data has had a profound impact on how data are analyzed. Open source distributed stream processing platforms have gained popularity for analyzing streaming Big Data as they provide low latency required for streaming Big Data applications using Cloud resources. However, existing resource schedulers are still lacking the efficiency and deadline meeting that Big Data analytical applications require. Recent works have already considered streaming Big Data characteristics to improve the efficiency and the likelihood of deadline meeting for scheduling in the platforms. Nevertheless, they have not taken into account the specific attributes of analytical application, public Cloud utilization cost and delays caused by performance degradation of leasing public Cloud resources. This study, therefore, presents BCframework, an efficient deadline-aware scheduling framework used by streaming Big Data analysis applications based on public Cloud resources. BCframework proposes a scheduling model which considers public Cloud utilization cost, performance variation, deadline meeting and latency reduction requirements of streaming Big Data analytical applications. Furthermore, it introduces two operator scheduling algorithms based on both a novel partitioning algorithm and an operator replication method. BCframework is highly adaptable to the fluctuation of streaming Big Data and the performance degradation of public Cloud resources. Experiments with the benchmark and real-world queries show that BCframework can significantly reduce the latency and utilization cost and also minimize deadline violations and provisioned virtual machine instances.
引用
收藏
页码:241 / 263
页数:23
相关论文
共 46 条
  • [1] Efficient deadline-aware scheduling for the analysis of Big Data streams in public Cloud
    Mahmood Mortazavi-Dehkordi
    Kamran Zamanifar
    Cluster Computing, 2020, 23 : 241 - 263
  • [2] Brief Announcement: Deadline-Aware Scheduling of Big-Data Processing Jobs
    Bodik, Peter
    Menache, Ishai
    Naor, Joseph
    Yaniv, Jonathan
    PROCEEDINGS OF THE 26TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'14), 2014, : 211 - 213
  • [3] Deadline-aware Scheduling in Cloud-Fog-Edge Systems
    Postoaca, Andrei-Vlad
    Negru, Catalin
    Pop, Florin
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 691 - 698
  • [4] Urgent point aware energy-efficient scheduling of tasks with hard deadline on virtualized cloud system
    Ghose, Manojit
    Sahu, Aryabartta
    Karmakar, Sushanta
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2020, 28 (28):
  • [5] Deadline-aware advance reservation scheduling algorithms for media production networks
    Barshan, Maryam
    Moens, Hendrik
    Famaey, Jeroen
    De Turck, Filip
    COMPUTER COMMUNICATIONS, 2016, 77 : 26 - 40
  • [6] FlowTime: Dynamic Scheduling of Deadline-Aware Workflows and Ad-hoc Jobs
    Hu, Zhiming
    Li, Baochun
    Chen, Chen
    Ke, Xiaodi
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 929 - 938
  • [7] RADL: a resource and deadline-aware dynamic load-balancer for cloud tasks
    Nabi, Said
    Aleem, Muhammad
    Ahmed, Masroor
    Islam, Muhammad Arshad
    Iqbal, Muhammad Azhar
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (12): : 14231 - 14265
  • [8] DALBFog: Deadline-Aware and Load-Balanced Task Scheduling for the Internet of Things in Fog Computing
    Ibrahim, Muhammad
    Lee, Yunjung
    Kim, Do-Hyuen
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2024, 10 (01): : 62 - 71
  • [9] Earliest Deadline Scheduling for Continuous Queries over Data Streams
    Li, Xin
    Jia, Zhiping
    Ma, Li
    Zhang, Ruihua
    Wang, Haiyang
    2009 INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, PROCEEDINGS, 2009, : 57 - +
  • [10] DEEDSP: Deadline-aware and energy-efficient dynamic service placement in integrated Internet of Things and fog computing environments
    Raghavendra, Meeniga Sri
    Chawla, Priyanka
    Gill, Sukhpal Singh
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (12)