Fast Big Data Analysis in Geo-Distributed Cloud

被引:2
作者
Li, Yue [1 ]
Zhao, Laiping [2 ]
Cui, Chenzhou [3 ]
Yu, Ce [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Tianjin Univ, Sch Comp Software, Tianjin, Peoples R China
[3] CAS NAOC, Natl Astron Observ, Beijing, Peoples R China
来源
2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2016年
关键词
D O I
10.1109/CLUSTER.2016.28
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As cloud services grow to span more and more globally distributed datacenters, there is an increasingly need for scheduling algorithms to automatically place tasks across these datacenters. In geo-distributed cloud, the limited WAN bandwidth has become the major bottleneck in fast big data analytics. The scheduling algorithm needs to minimize the global completion time, by jointly optimizing task scheduling and WAN data transfer. In this paper, we model the task scheduling as a community detection problem, with respect to the dependency relations between task, data, and datacenters, and propose a Community Detection-based Scheduling (CDS) algorithm, which is able to minimize the WAN data transfer volume. We utilize the real China-Astronomy-Cloud network to evaluate the proposed algorithms. Experimental results show that we can reduce the total data transfer volume by up to 40.7%, and the global completion time by up to 35.8%, compared with the Hypergraph Partition-based scheduling algorithm and the greedy scheduling algorithm.
引用
收藏
页码:388 / 391
页数:4
相关论文
共 14 条
[1]  
Alicherry M, 2012, IEEE INFOCOM SER, P963, DOI 10.1109/INFCOM.2012.6195847
[2]   Greenhead: Virtual Data Center Embedding across Distributed Infrastructures [J].
Amokrane, Ahmed ;
Zhani, Mohamed Faten ;
Langar, Rami ;
Boutaba, Raouf ;
Pujolle, Guy .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2013, 1 (01) :36-49
[3]  
Catalyurek U.V., 2011, P 4 INT WORKSHOP DAT, P45
[4]  
Danna E, 2012, IEEE INFOCOM SER, P846, DOI 10.1109/INFCOM.2012.6195833
[5]   Community detection in graphs [J].
Fortunato, Santo .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2010, 486 (3-5) :75-174
[6]   Achieving High Utilization with Software-Driven WAN [J].
Hong, Chi-Yao ;
Kandula, Srikanth ;
Mahajan, Ratul ;
Zhang, Ming ;
Gill, Vijay ;
Nanduri, Mohan ;
Wattenhofer, Roger .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) :15-26
[7]   B4: Experience with a Globally-Deployed Software Defined WAN [J].
Jain, Sushant ;
Kumar, Alok ;
Mandal, Subhasree ;
Ong, Joon ;
Poutievski, Leon ;
Singh, Arjun ;
Venkata, Subbaiah ;
Wanderer, Jim ;
Zhou, Junlan ;
Zhu, Min ;
Zolla, Jonathan ;
Hoelzle, Urs ;
Stuart, Stephen ;
Vahdat, Amin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) :3-14
[8]  
Khanna G, 2005, 2005 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, P792
[9]   Finding and evaluating community structure in networks [J].
Newman, MEJ ;
Girvan, M .
PHYSICAL REVIEW E, 2004, 69 (02) :026113-1
[10]  
Vulimiri A., 2015, US C NETW SYST DES I