An Approach for Characterizing Workloads in Google Cloud to Derive Realistic Resource Utilization Models

被引:111
作者
Moreno, Ismael Solis [1 ]
Garraghan, Peter [1 ]
Townend, Paul [1 ]
Xu, Jie [1 ]
机构
[1] Univ Leeds, Sch Comp, Leeds, W Yorkshire, England
来源
2013 IEEE SEVENTH INTERNATIONAL SYMPOSIUM ON SERVICE-ORIENTED SYSTEM ENGINEERING (SOSE 2013) | 2013年
基金
英国工程与自然科学研究理事会;
关键词
Cloud computing workload patterns; MapReduce analysis; resource usage patterns; workload characterization; MAPREDUCE;
D O I
10.1109/SOSE.2013.24
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Analyzing behavioral patterns of workloads is critical to understanding Cloud computing environments. However, until now only a limited number of real-world Cloud datacenter tracelogs have been available for analysis. This has led to a lack of methodologies to capture the diversity of patterns that exist in such datasets. This paper presents the first large-scale analysis of real-world Cloud data, using a recently released dataset that features traces from over 12,000 servers over the period of a month. Based on this analysis, we develop a novel approach for characterizing workloads that for the first time considers Cloud workload in the context of both user and task in order to derive a model to capture resource estimation and utilization patterns. The derived model assists in understanding the relationship between users and tasks within workload, and enables further work such as resource optimization, energy-efficiency improvements, and failure correlation. Additionally, it provides a mechanism to create patterns that randomly fluctuate based on realistic parameters. This is critical to emulating dynamic environments instead of statically replaying records in the tracelog. Our approach is evaluated by contrasting the logged data against simulation experiments, and our results show that the derived model parameters correctly describe the operational environment within a 5% of error margin, confirming the great variability of patterns that exist in Cloud computing.
引用
收藏
页码:49 / 60
页数:12
相关论文
共 31 条
  • [1] Aggarwal S., 2010, Proceedings of the 2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom 2010), P748, DOI 10.1109/CloudCom.2010.20
  • [2] [Anonymous], 2012, ISTCCCTR12101
  • [3] [Anonymous], 2011, Characterizing Task Usage Shapes in Google Compute Clusters
  • [4] [Anonymous], 2010, UCBEECS201014
  • [5] Bahga A., 2011, Journal of Software Engineering Applications, V4, P396, DOI [DOI 10.4236/JSEA.2011.47046, DOI 10.4236/jsea.2011.47046]
  • [6] Buyya Rajkumar, 2009, 2009 International Conference on High Performance Computing & Simulation (HPCS), P1, DOI 10.1109/HPCSIM.2009.5192685
  • [7] Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility
    Buyya, Rajkumar
    Yeo, Chee Shin
    Venugopal, Srikumar
    Broberg, James
    Brandic, Ivona
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (06): : 599 - 616
  • [8] Calheiros Rodrigo N., 2010, SOFTWARE PRACTICE EX
  • [9] Chen Y., 2010, AN LESS PUBL AV GOOG
  • [10] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137