MORM: A Multi-objective Optimized Replication Management strategy for cloud storage cluster

被引:93
作者
Long, Sai-Qin [1 ]
Zhao, Yue-Long [1 ]
Chen, Wei [1 ]
机构
[1] S China Univ Technol, Sch Engn & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Replication management; Cloud storage; Multi-objective optimization; Artificial immune algorithm; COMPUTING ENVIRONMENTS; PERFORMANCE; ACCESS;
D O I
10.1016/j.sysarc.2013.11.012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Effective data management is an important issue for a large-scale distributed environment such as data cloud. This can be achieved by using file replication, which efficiently reduces file service time and access latency, increases file availability and improves system load balancing. However, replication entails various costs such as storage and energy consumption for holding replicas. This article proposes a multi-objective offline optimization approach for replica management, in which we view the various factors influencing replication decisions such as mean file unavailability, mean service time, load variance, energy consumption and mean access latency as five objectives. It makes decisions of replication factor and replication layout with an improved artificial immune algorithm that evolves a set of solution candidates through clone, mutation and selection processes. The proposed algorithm named Multi-objective Optimized Replication Management (MORM) seeks the near optimal solutions by balancing the trade-offs among the five optimization objectives. The article reports a series of experiments that show the effectiveness of the MORM. Experimental results conclusively demonstrate that our MORM is energy effective and outperforms default replication management of HDFS (Hadoop Distributed File System) and MOE (Multi-objective Evolutionary) algorithm in terms of performance and load balancing for large-scale cloud storage cluster. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:234 / 244
页数:11
相关论文
共 34 条
[21]  
Moran M.J., 1995, FUNDAMENTALS ENG THE
[22]   Data Replication in Data Intensive Scientific Applications with Performance Guarantee [J].
Nukarapu, Dharma Teja ;
Tang, Bin ;
Wang, Liqiang ;
Lu, Shiyong .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (08) :1299-1306
[23]  
Phan D.H., 2012, GECCO 12 JUL 7 11 PH
[24]  
Qingsong Wei, 2010, Proceedings of the 2010 IEEE International Conference on Cluster Computing (CLUSTER 2010), P188, DOI 10.1109/CLUSTER.2010.24
[25]  
Qiu LL, 2001, IEEE INFOCOM SER, P1587, DOI 10.1109/INFCOM.2001.916655
[26]  
Rahman RM, 2006, SIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, P434
[27]  
RANGANATHAN K, 2001, P 2 INT WORKSH GRID
[28]  
Shang Rong-Hua, 2007, Journal of Software, V18, P2700, DOI 10.1360/josl82700
[29]  
Shvachko K., 2010, SYMPOSIUM, P1, DOI DOI 10.1109/MSST.2010.5496972
[30]   Benefit-based data caching in ad hoc networks [J].
Tang, Bin ;
Gupta, Himanshu ;
Das, Samir R. .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2008, 7 (03) :289-304