MORM: A Multi-objective Optimized Replication Management strategy for cloud storage cluster

被引:93
作者
Long, Sai-Qin [1 ]
Zhao, Yue-Long [1 ]
Chen, Wei [1 ]
机构
[1] S China Univ Technol, Sch Engn & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Replication management; Cloud storage; Multi-objective optimization; Artificial immune algorithm; COMPUTING ENVIRONMENTS; PERFORMANCE; ACCESS;
D O I
10.1016/j.sysarc.2013.11.012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Effective data management is an important issue for a large-scale distributed environment such as data cloud. This can be achieved by using file replication, which efficiently reduces file service time and access latency, increases file availability and improves system load balancing. However, replication entails various costs such as storage and energy consumption for holding replicas. This article proposes a multi-objective offline optimization approach for replica management, in which we view the various factors influencing replication decisions such as mean file unavailability, mean service time, load variance, energy consumption and mean access latency as five objectives. It makes decisions of replication factor and replication layout with an improved artificial immune algorithm that evolves a set of solution candidates through clone, mutation and selection processes. The proposed algorithm named Multi-objective Optimized Replication Management (MORM) seeks the near optimal solutions by balancing the trade-offs among the five optimization objectives. The article reports a series of experiments that show the effectiveness of the MORM. Experimental results conclusively demonstrate that our MORM is energy effective and outperforms default replication management of HDFS (Hadoop Distributed File System) and MOE (Multi-objective Evolutionary) algorithm in terms of performance and load balancing for large-scale cloud storage cluster. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:234 / 244
页数:11
相关论文
共 34 条
[1]  
Aazami A., 2004, P INT WORKSH MULT IN
[2]  
[Anonymous], 2008, S3 SIMPL STOR SERV
[3]  
Bonvin N., 2009, ACDC 09
[4]   CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms [J].
Calheiros, Rodrigo N. ;
Ranjan, Rajiv ;
Beloglazov, Anton ;
De Rose, Cesar A. F. ;
Buyya, Rajkumar .
SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (01) :23-50
[5]   A dynamic data replication strategy using access-weights in data grids [J].
Chang, Ruay-Shiung ;
Chang, Hui-Ping .
JOURNAL OF SUPERCOMPUTING, 2008, 45 (03) :277-295
[6]   Dynamic hybrid replication effectively combining tree and grid topology [J].
Choi, Sung Chune ;
Youn, Hee Yong .
JOURNAL OF SUPERCOMPUTING, 2012, 59 (03) :1289-1311
[7]   Reliability in grid computing systems [J].
Dabrowski, Christopher .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2009, 21 (08) :927-959
[8]  
Dhruba B., 2010, HDFS ARCHITECTURE GU
[9]   A study on performance of dynamic file replication algorithms for real-time file access in Data Grids [J].
Dogan, Atakan .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (08) :829-839
[10]  
Ghemawat S., 2003, Operating Systems Review, V37, P29, DOI 10.1145/1165389.945450