FAST CLOUD: Pushing the Envelope on Delay Performance of Cloud Storage With Coding

被引:48
作者
Liang, Guanfeng [1 ]
Kozat, Ulas C. [1 ]
机构
[1] DOCOMO Innovat Inc, Palo Alto, CA 94304 USA
关键词
Cloud storage; delay; forward error correction (FEC); queueing;
D O I
10.1109/TNET.2013.2289382
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Our paper presents solutions that can significantly improve the delay performance of putting and retrieving data in and out of cloud storage. We first focus on measuring the delay performance of a very popular cloud storage service Amazon S3. We establish that there is significant randomness in service times for reading and writing small and medium size objects when assigned distinct keys. We further demonstrate that using erasure coding, parallel connections to storage cloud and limited chunking (i.e., dividing the object into a few smaller objects) together pushes the envelope on service time distributions significantly (e.g., 76%, 80%, and 85% reductions in mean, 90th, and 99th percentiles for 2-MB files) at the expense of additional storage (e.g., 1.75x). However, chunking and erasure coding increase the load and hence the queuing delays while reducing the supportable rate region in number of requests per second per node. Thus, in the second part of our paper, we focus on analyzing the delay performance when chunking, forward error correction (FEC), and parallel connections are used together. Based on this analysis, we develop load-adaptive algorithms that can pick the best code rate on a per-request basis by using offline computed queue backlog thresholds. The solutions work with homogeneous services with fixed object sizes, chunk sizes, operation type (e.g., read or write) as well as heterogeneous services with mixture of object sizes, chunk sizes, and operation types. We also present a simple greedy solution that opportunistically uses idle connections and picks the erasure coding rate accordingly on the fly. Both backlog-based and greedy solutions support the full rate region and provide best mean delay performance when compared to the best fixed coding rate policy. Our evaluations show that backlog-based solutions achieve better delay performance at higher percentile values than the greedy solution.
引用
收藏
页码:2012 / 2025
页数:14
相关论文
共 20 条
[1]  
[Anonymous], 2010, 2010 IEEE International Symposium on Parallel Distributed Processing (IPDPS), DOI DOI 10.1109/INFCOM.2010.5462196
[2]  
[Anonymous], ARXIV12115405
[3]   Accessing multiple mirror sites in parallel: Using tornado codes to speed up downloads [J].
Byers, JW ;
Luby, M ;
Mitzenmacher, M .
IEEE INFOCOM '99 - THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-3, PROCEEDINGS: THE FUTURE IS NOW, 1999, :275-283
[4]   On the Delay of Network Coding over Line Networks [J].
Dikaliotis, Theodoros K. ;
Dimakis, Alexandros G. ;
Ho, Tracey ;
Effros, Michelle .
2009 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1- 4, 2009, :1408-1412
[5]   Network Coding for Distributed Storage Systems [J].
Dimakis, Alexandros G. ;
Godfrey, P. Brighten ;
Wu, Yunnan ;
Wainwright, Martin J. ;
Ramchandran, Kannan .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (09) :4539-4551
[6]   On the Delay and Throughput Gains of Coding in Unreliable Networks [J].
Eryilmaz, Atilla ;
Ozdaglar, Asuman ;
Medard, Muriel ;
Ahmed, Ebad .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2008, 54 (12) :5511-5524
[7]  
Ferner UJ, 2012, ANN ALLERTON CONF, P517, DOI 10.1109/Allerton.2012.6483262
[8]  
Gabrielyan E., 2006, COMPUT RES REPOS
[9]  
Garfinkel S. L., 2007, EVALUATION AMAZONS G
[10]  
Huang Cheng, 2012, USENIX ANN TECHN C A, P15