The Load Rebalancing Problem in Distributed File Systems

被引:6
作者
Chung, Hsueh-Yi [1 ]
Chang, Che-Wei [1 ]
Hsiao, Hung-Chang [1 ]
Chao, Yu-Chang [2 ]
机构
[1] Natl Cheng Kung Univ, Tainan 70101, Taiwan
[2] Ind Technol Res Inst S, Tainan 709, Taiwan
来源
2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2012年
关键词
D O I
10.1109/CLUSTER.2012.31
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed file systems (DFS) are key building blocks for cloud computing applications based on the MapReduce programming paradigm. In such file systems, nodes simultaneously serve computing and storage functions; a file is partitioned into a number of chunks allocated in distinct nodes so that MapReduce tasks can be performed in parallel over the nodes. However, in a cloud computing environment, failure is the norm, and nodes may be upgraded, replaced, and added in the system. Files can also be dynamically created, deleted, and appended. This results in load imbalance; that is, the file chunks are not distributed as uniformly as possible in the nodes. Although distributed load balancing algorithms exist in the literature to deal with the load imbalance problem, emerging DFSs in production systems strongly depend on a central node for chunk reallocation. This dependence is clearly inadequate in a large-scale, failure-prone environment because the central load balancer is put under considerable workload that is linearly scaled with the system size, and may thus become the performance bottleneck and the single point of failure. In this paper, we illustrate and define the load rebalancing problem in cloud DFSs. We advocate file systems in clouds shall incorporate decentralized load rebalancing algorithms to eliminate the performance bottleneck and the single point of failure. Simulation results for a potential distributed load balancing algorithm are illustrated. The performance of our proposal implemented in the Hadoop distributed file system is also demonstrated.
引用
收藏
页码:117 / 125
页数:9
相关论文
共 24 条
[1]   Symbiotic Routing in Future Data Centers [J].
Abu-Libdeh, Hussam ;
Costa, Paolo ;
Rowstron, Antony ;
O'Shea, Greg ;
Donnelly, Austin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) :51-62
[2]  
[Anonymous], LNCS
[3]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[4]  
[Anonymous], BEOWULF CLUSTER COMP
[5]  
[Anonymous], 2003, PROC 19 ACM S OPERAT
[6]  
Byers J, 2003, LECT NOTES COMPUT SC, V2735, P80
[7]  
COPELAND G, 1988, P ACM SIGMOD INT C M, P99
[8]  
DeCandia Giuseppe, 2007, Operating Systems Review, V41, P205, DOI 10.1145/1323293.1294281
[9]  
Eastlake D, 2001, RFC3174: US Secure Hash Algorithm 1 (SHA1)
[10]  
Ganesan P., 2004, VLDB, P444