Evaluating MapReduce on Virtual Machines: The Hadoop Case

被引:0
作者
Ibrahim, Shadi [1 ]
Jin, Hai [1 ]
Lu, Lu [1 ]
Qi, Li [2 ]
Wu, Song [1 ]
Shi, Xuanhua [1 ]
机构
[1] Huazhong Univ Sci & Technol, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab, Wuhan 430074, Peoples R China
[2] China Dev Bank, Operat Ctr, Beijing, Peoples R China
来源
CLOUD COMPUTING, PROCEEDINGS | 2009年 / 5931卷
关键词
Cloud Computing; Data Intensive; MapReduce; Hadoop; Distributed File System; Virtual Machine;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce is emerging as an important programming model for large scale parallel application. Meanwhile, Hadoop is an open source implementation of MapReduce enjoying wide popularity for developing data intensive applications in the cloud. As, in the cloud, the computing unit is virtual machine (VM) based; it is feasible to demonstrate the applicability of MapReduce on virtualized data center. Although the potential for poor performance and heavy load no doubt exists, virtual machines can instead be used to fully utilize the system resources, ease the management of such systems, improve the reliability, and save the power. In this paper, a series of experiments are conducted to measure and analyze the performance of Hadoop on VMs. Our experiments are used as a basis for outlining several issues that will need to be considered when implementing MapReduce to tit completely in the cloud.
引用
收藏
页码:519 / +
页数:2
相关论文
共 19 条
[1]  
*AM, AM EL CLOUD COMP
[2]  
[Anonymous], 2008, IRPTR0805
[3]  
[Anonymous], AM SIMPL STOR SERV
[4]  
[Anonymous], 2005, P USENIX S NETW SYST
[5]  
[Anonymous], Cnet news
[6]  
[Anonymous], AM EL MAPREDUCE
[7]  
BRYANT RE, 2007, CMUCS07128
[8]  
Dean J., 2004, P 6 C OP SYST DES IM
[9]  
FIGUEIREDO R, 2003, P 23 INT C DISTR COM, P550, DOI DOI 10.1109/ICDCS.2003.1203506
[10]  
Ghemawat S., 2003, Operating Systems Review, V37, P29, DOI 10.1145/1165389.945450