Cost-Aware Streaming Workflow Allocation on Geo-Distributed Data Centers

被引:34
作者
Chen, Wuhui [1 ]
Paik, Incheon [1 ]
Li, Zhenni [1 ]
机构
[1] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima 9658580, Japan
关键词
Big data processing; streaming workflow; geo-distributed DCs; streaming workflow allocation; VIRTUAL MACHINE PLACEMENT; ENERGY;
D O I
10.1109/TC.2016.2595579
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The virtual machine (VM) allocation problem in cloud computing has been widely studied in recent years, and many algorithms have been proposed in the literature. Most of them have been successfully applied to batch processing models such as MapReduce; however, none of them can be applied to streaming workflow well because of the following weaknesses: 1) failure to capture the characteristics of tasks in streaming workflow for the short life cycle of data streams; 2) most algorithms are based on the assumptions that the price of VMs and traffic among data centers (DCs) are static and fixed. In this paper, we propose a streaming workflow allocation algorithm that takes into consideration the characteristics of streaming work and the price diversity among geo-distributed DCs, to further achieve the goal of cost minimization for streaming big data processing. First, we construct an extended streaming workflow graph (ESWG) based on the task semantics of streaming workflow and the price diversity of geo-distributed DCs, and the streaming workflow allocation problem is formulated into mixed integer linear programming based on the ESWG. Second, we propose two heuristic algorithms to reduce the computational space based on task combination and DC combination in order to meet the strict latency requirement. Finally, our experimental results demonstrate significant performance gains with lower total cost and execution time.
引用
收藏
页码:256 / 271
页数:16
相关论文
共 38 条
[1]   Data-Intensive Workflow Optimization based on Application Task Graph Partitioning in Heterogeneous Computing Systems [J].
Ahmad, Saima Gulzar ;
Liew, Chee Sun ;
Rafique, M. Mustafa ;
Munir, Ehsan Ullah ;
Khan, Samee U. .
2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, :129-136
[2]   Virtual Machine Placement optimization supporting performance SLAs [J].
Anand, Ankit ;
Lakshmi, J. ;
Nandy, S. K. .
2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), VOL 1, 2013, :298-305
[3]  
[Anonymous], 2009, INT GEOGR DISP DAT C
[4]  
[Anonymous], 2015, P 9 ACM INT C DISTRI
[5]  
[Anonymous], TRINF20120402UNIPMN
[6]  
Batista DM, 2007, APPLIED COMPUTING 2007, VOL 1 AND 2, P209, DOI 10.1145/1244002.1244057
[7]   Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in Cloud data centers [J].
Beloglazov, Anton ;
Buyya, Rajkumar .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (13) :1397-1420
[8]   Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing [J].
Beloglazov, Anton ;
Abawajy, Jemal ;
Buyya, Rajkumar .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (05) :755-768
[9]  
Bittencourt L. F., 2012, P 10 INT WORKSH MIDD, P1
[10]  
Bobroff N, 2007, 2007 10TH IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2009), VOLS 1 AND 2, P119, DOI 10.1109/INM.2007.374776