A Survey on MapReduce Implementations

被引:3
作者
Al-Badarneh, Amer [1 ]
Mohammad, Amr [1 ]
Harb, Salah [1 ]
机构
[1] Jordan Univ Sci & Technol, Irbid, Jordan
关键词
Big Data Managment; Cloud Computing; Distributed File System; Incremental Data Processing; Multiprocessing; Parallel Processing;
D O I
10.4018/IJCAC.2016010104
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A distinguished successful platform for parallel data processing MapReduce is attracting a significant momentum from both academia and industry as the volume of data to capture, transform, and analyse grows rapidly. Although MapReduce is used in many applications to analyse large scale data sets, there is still a lot of debate among scientists and researchers on its efficiency, performance, and usability to support more classes of applications. This survey presents a comprehensive review of various implementations of MapReduce framework. Initially the authors give an overview of MapReduce programming model. They then present a broad description of various technical aspects of the most successful implementations of MapReduce framework reported in the literature and discuss their main strengths and weaknesses. Finally, the authors conclude by introducing a comparison between MapReduce implementations and discuss open issues and challenges on enhancing MapReduce.
引用
收藏
页码:59 / 87
页数:29
相关论文
共 38 条
[1]  
Ahmad F., 2012, P 17 INT C ARCH SUPP, P61
[2]  
Bing Tang, 2010, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC 2010), P193, DOI 10.1109/3PGCIC.2010.33
[3]  
Bu YY, 2010, PROC VLDB ENDOW, V3, P285
[4]  
Cairong Yan, 2012, 2012 IEEE 5th International Conference on Cloud Computing (CLOUD), P534, DOI 10.1109/CLOUD.2012.67
[5]   FlumeJava']Java: Easy, Efficient Data-Parallel Pipelines [J].
Chambers, Craig ;
Raniwala, Ashish ;
Perry, Frances ;
Adams, Stephen ;
Henry, Robert R. ;
Bradshaw, Robert ;
Weizenbaum, Nathan .
PLDI '10: PROCEEDINGS OF THE 2010 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2010, :363-375
[6]   Tiled-MapReduce: Optimizing Resource Usages of Data-parallel Applications on Multicore with Tiling [J].
Chen, Rong ;
Chen, Haibo ;
Zang, Binyu .
PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2010, :523-534
[7]  
Chu C.T., 2006, NIPS, V6, P281
[8]  
Condie T, 2010, NSDI, P21
[9]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[10]   Clustera: An Integrated Computation And Data Management System [J].
DeWitt, David J. ;
Paulson, Erik ;
Robinson, Eric ;
Naughton, Jeffrey ;
Royalty, Joshua ;
Shankar, Srinath ;
Krioukov, Andrew .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01) :28-41