Efficient Replica Migration Scheme for Distributed Cloud Storage Systems

被引:16
作者
Mseddi, Amina [1 ]
Salahuddin, Mohammad A. [2 ]
Zhani, Mohamed Faten [3 ]
Elbiaze, Halima [1 ]
Glitho, Roch H. [4 ,5 ]
机构
[1] Univ Quebec Montreal, Dept Comp Sci, Montreal, PQ H2L 2C4, Canada
[2] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[3] Ecole Technol Super ETS Montreal, Dept Software & IT Engn, Montreal, PQ H3C 1K3, Canada
[4] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
[5] Univ Western Cape, Comp Sci Programme, ZA-7535 Cape Town, South Africa
关键词
Cloud storage; data availability; data migration; replica management;
D O I
10.1109/TCC.2018.2858792
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the wide adoption of large-scale internet services and big data, the cloud has become the ideal environment to satisfy the ever-growing storage demand. In this context, data replication has been touted as the ultimate solution to improve data availability and reduce access time. However, replica management systems usually need to migrate and create a large number of data replicas over time between and within data centers, incurring a large overhead in terms of network load and availability. In this paper, we propose CRANE, an effiCient Replica migrAtion scheme for distributed cloud Storage systEms. CRANE complements any replica placement algorithm by efficiently managing replica creation in geo-distributed infrastructures in order to (1) minimize the time needed to copy the data to the new replica location, (2) avoid network congestion, and (3) ensure the minimum desired availability for the data. Through simulation and experimental results, we show that CRANE provides a sub-optimal solution for the replica migration problem with lower computational complexity than its integer linear program formulation. We also show that, compared to OpenStack Swift, CRANE is able to reduce by up to 60 percent the replica creation and migration time and by up to 50 percent the inter-data center network traffic while ensuring the minimum required data availability.
引用
收藏
页码:155 / 167
页数:13
相关论文
共 34 条
[1]  
Agarwal S., 2010, NSDI, P17
[2]  
AMPL, 2016, STREAML MOD REAL OPT
[3]   Algorithms for Data Migration [J].
Anderson, E. ;
Hall, J. ;
Hartline, J. ;
Hobbes, M. ;
Karlin, A. ;
Saia, J. ;
Swaminathan, R. ;
Wilkes, J. .
ALGORITHMICA, 2010, 57 (02) :349-380
[4]  
[Anonymous], 2018, CISC VIS NETW IND GL
[5]  
[Anonymous], 2016, IBM CPLEX Optimizer
[6]  
[Anonymous], 2016, OPENSTACK CLOUD COMP
[7]   A dynamic data replication strategy using access-weights in data grids [J].
Chang, Ruay-Shiung ;
Chang, Hui-Ping .
JOURNAL OF SUPERCOMPUTING, 2008, 45 (03) :277-295
[8]  
Chen YY, 2011, IEEE INFOCOM SER, P1620, DOI 10.1109/INFCOM.2011.5934955
[9]  
Data Center Research, 2018, COL DAT CTR
[10]  
Dickinson J., 2013, DATA PLACEMENT SWIFT