Genetic Algorithm based Data-aware Group Scheduling for Big Data Clouds

被引:8
作者
Kune, Raghavendra [1 ]
Konugurthi, Pramod Kumar [1 ]
Agarwal, Arun [2 ]
Chillarige, Raghavendra Rao [2 ]
Buyya, Rajkumar [3 ]
机构
[1] Dept Space, Adv Data Proc Res Inst, Hyderabad, Andhra Pradesh, India
[2] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad, Andhra Pradesh, India
[3] Univ Melbourne, Dept Comp & Informat Syst, CLOUDS Lab, Melbourne, Vic, Australia
来源
2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC) | 2014年
关键词
Big Data; Cloud computing; Data Intensive; Scheduling; Genetic algorithms; Big Data Clouds;
D O I
10.1109/BDC.2014.15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing is a promising cost efficient service oriented computing platform in the fields of science, engineering, business and social networking for delivering the resources on demand. Big Data Clouds is a new generation data analytics platform using Cloud computing as a back end technologies, for information mining, knowledge discovery and decision making based on statistical and empirical tools. MapReduce scheduling models for Big Data computing operate in the cluster mode, where the data nodes are pre-configured with the computing facility for processing. These MapReduce models are based on compute push model-pushing the logic to the data node for analysis, which is primarily for minimizing or eliminating data migration overheads between computing resources and data nodes. Such models, however, substantially perform well in the cluster setups, but are infelicitous for the platforms having the decoupled data storage and computing resources. In this paper, we propose a Genetic Algorithm based scheduler for such Big Data Cloud where decoupled computational and data services are offered as services. The approach is based on evolutionary methods focussed on data dependencies, computational resources and effective utilization of bandwidth thus achieving higher throughputs.
引用
收藏
页码:96 / 104
页数:9
相关论文
共 13 条
  • [1] [Anonymous], P 11 IEEE S HIGH PER
  • [2] [Anonymous], 1992, GENETIC ALGORITHMS D, DOI DOI 10.1007/978-3-662-03315-9
  • [3] Big Data computing and clouds: Trends and future directions
    Assuncao, Marcos D.
    Calheiros, Rodrigo N.
    Bianchi, Silvia
    Netto, Marco A. S.
    Buyya, Rajkumar
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 79-80 : 3 - 15
  • [4] CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms
    Calheiros, Rodrigo N.
    Ranjan, Rajiv
    Beloglazov, Anton
    De Rose, Cesar A. F.
    Buyya, Rajkumar
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (01) : 23 - 50
  • [5] Chaudhri A., 2012, INT C ADV CLOUD COMP
  • [6] Goldberg DE., 1989, GENETIC ALGORITHMS S, V13
  • [7] Gupta S., 2013, P INT C AUT COMP ICA
  • [8] Java GA Lib, 2014, GENETIC ALGORITHM
  • [9] MOHAMED H, 2004, P 2004 IEEE INT C CL
  • [10] OpenStack Swift, 2014, OBJ BAS STOR REST SE