Analyzing Hadoop power consumption and impact on application QoS

被引:22
作者
Conejero, Javier [1 ]
Rana, Omer [2 ]
Burnap, Peter [2 ]
Morgan, Jeffrey [3 ]
Caminero, Blanca [1 ]
Carrion, Carmen [1 ]
机构
[1] Univ Castilla La Mancha, Comp Syst Dept, Albacete 02071, Spain
[2] Cardiff Univ, Sch Comp & Informat, Cardiff CF24 3M, S Glam, Wales
[3] Cardiff Univ, Sch Social Sci, Cardiff CF10 3WT, S Glam, Wales
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2016年 / 55卷
关键词
Cloud computing; Power consumption; Hadoop; OpenNebula; Social media analysis;
D O I
10.1016/j.future.2015.03.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Energy efficiency is often identified as one of the key reasons for migrating to Cloud environments. It is stated that a data center hosting the Cloud environment is likely to achieve greater energy efficiency (at a reduced cost) compared to a local deployment. With increasing energy prices, it is also estimated that a large percentage of operational costs within a Cloud environment can be attributed to energy. In this work, we investigate and measure energy consumption of a number of virtual machines running the Hadoop system, over an OpenNebula Cloud. Our workload is based on sentiment analysis undertaken over Twitter messages. Our objective is to understand the tradeoff between energy efficiency and performance for such a workload. From our results we generalize and speculate on how such an analysis could be used as a basis to establish a Service Level Agreement (SLA) with a Cloud provider especially where there is likely to be a high level of variability (both in performance and energy use) over multiple runs of the same application (at different times). Among the service level objectives that might be included in a SLA, Quality of Service (QoS) related metrics (i.e., latency) are one of the most challenging to support. This work provides some insight on the relationship between power consumption and QoS related metrics, describing how a combined consideration of these two metrics could be supported for a particular workload. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:213 / 223
页数:11
相关论文
共 16 条
  • [1] [Anonymous], 2012, EuroSys, DOI [DOI 10.1145/2168836.2168843, 10.1145/2168836, DOI 10.1145/2168836]
  • [2] [Anonymous], 2009, Hadoop: The Definitive Guide
  • [3] A taxonomy and survey on Green Data Center Networks
    Bilal, Kashif
    Malik, Saif Ur Rehman
    Khalid, Osman
    Hameed, Abdul
    Alvarez, Enrique
    Wijaysekara, Vidura
    Irfan, Rizwana
    Shrestha, Sarjan
    Dwivedy, Debjyoti
    Ali, Mazhar
    Khan, Usman Shahid
    Abbas, Assad
    Jalil, Nauman
    Khan, Samee U.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 36 : 189 - 208
  • [4] Bing Shi, 2010, 2010 International Conference on Green Computing (Green Comp), P73, DOI 10.1109/GREENCOMP.2010.5598262
  • [5] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
  • [6] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [7] Garg S., 2012, HARNESSING GREEN IT
  • [8] Ghamkhari M., 2012, P IEEE INT C COMM IC
  • [9] Kaushik R.T., 2010, Proceedings of the 2010 international conference on Power aware computing and systems, HotPower'10, P1
  • [10] Lam C., 2010, HADOOP ACTION