Applying MapReduce Framework to Peer-to-Peer Overlay Network

被引:1
作者
Xiao, Pei [1 ]
Zhang, Xiaolu [1 ]
Wang, Jin [1 ]
Zhang, Jixian [1 ]
Han, Qiang [1 ]
Zhang, Xuejie [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming, Peoples R China
来源
PROCEEDINGS 2014 INTERNATIONAL CONFERENCE ON SERVICE SCIENCES (ICSS 2014) | 2014年
关键词
MapReduce; Peer-to-Peer; Distributed Computing; SERVICE;
D O I
10.1109/ICSS.2014.21
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
MapReduce is a programming framework widely used in cloud computing environments for processing large amount of data in a highly parallel way. However, current MapReduce model do not cope well with its scalability, which means that under certain hardware configuration, it can only support limited scale of cluster due to the overloading of center node. In this paper, we present a prototype based on DHTs Peer-to-Peer MapReduce system, which removed the MapReduce task centralized scheduling's master node and bottom file system management's name node on the basis of remaining original MapReduce workflow unchanged. In the system, the distributed file system in bottom layer queries data through distributed hashing, while the MapReduce system in upper layer invoke and schedule the tasks by distributed notification mechanism. In this way, the system can theoretically achieve the scalability of Peer-to-Peer system. The scalability evaluation of the system has been experimented in the network scenarios using the prevailing word count problem.
引用
收藏
页码:96 / 100
页数:5
相关论文
共 11 条
[1]  
[Anonymous], 2010, P 19 ACM INT S HIGH, DOI DOI 10.1145/1851476.1851593
[2]  
[Anonymous], 2010, P USENIX S OP SYST D
[3]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[4]   Sector and Sphere: the design and implementation of a high-performance data cloud [J].
Gu, Yunhong ;
Grossman, Robert L. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2009, 367 (1897) :2429-2445
[5]  
Dang HT, 2012, LECT NOTES COMPUT SC, V7654, P69, DOI 10.1007/978-3-642-34707-8_8
[6]  
Isard M., 2007, Operating Systems Review, V41, P59, DOI 10.1145/1272998.1273005
[7]  
Karger DavidR., 1997, P 29 ANN ACM S THEOR, P654
[8]   P2P-MapReduce: Parallel data processing in dynamic Cloud environments [J].
Marozzo, Fabrizio ;
Talia, Domenico ;
Trunfio, Paolo .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2012, 78 (05) :1382-1402
[9]  
Power R., 2010, P 9 USENIX C OPERATI, P1
[10]  
Ranjan R, 2010, COMPUT COMMUN NETW S, P195, DOI 10.1007/978-1-84996-241-4_12