Incremental encoding for erasure-coded cross-datacenters cloud storage

被引:4
作者
Xu, Fangliang [1 ,2 ]
Wang, Yijie [1 ,2 ]
Ma, Xingkong [2 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 87卷
基金
中国国家自然科学基金;
关键词
Cloud storage systems; Cross datacenters; Erasure code; Data encoding;
D O I
10.1016/j.future.2018.04.047
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As a storage efficient approach, erasure coding has been adopted by many large-scale cloud storage systems to protect data from server and datacenter failures. To erasure-coded storage systems, it is critical to encode newly written data blocks and generate parity blocks efficiently. Existing encoding approaches include Striping Encoding and Replicating Encoding. They either incur too high network traffic or seriously degrade the I/O performance. In this paper, we propose Incremental Encoding, a decentralized encoding framework for all linear erasure codes. To achieve the optimal write performance, Incremental Encoding forwards newly written data blocks to multiple servers in a pipelining manner. To reduce network traffic, Incremental Encoding combines newly written data blocks together incrementally at the same time when they flow through servers to generate parity blocks. Incremental Encoding also caches intermediate parity blocks into memory to further reduce disk I/O. We evaluate Incremental Encoding by theoretically analyzing the encoding overheads and conducting a series of experiments in both a single-datacenter environment and a cross-datacenters environment. Analysis and experiments show that Incremental Encoding can achieve a much better trade-off between network traffic and I/O performance. Specially, compared with Replicating Encoding, which has the optimal I/O performance, Incremental Encoding has nearly the same I/O performance with 44.5%-48.4% less encoding traffic. Compared with Striping Encoding, Incremental Encoding has up to 90% better write performance and up to 108% read performance with 56.25%-73.6% more encoding traffic. 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:527 / 537
页数:11
相关论文
共 35 条
[1]  
[Anonymous], 2011, TAOBAO FILE SYSTEM
[2]  
[Anonymous], 2010, GOOGLE FACULTY SUMMI
[3]  
[Anonymous], 2003, P 19 ACM S OP SYST P, DOI [10.1145/1165389.945450, DOI 10.1145/1165389.945450]
[4]  
[Anonymous], FAST 2005 4 USENIX C
[5]  
[Anonymous], 2013, The datacenter as a computer an introduction to the design of warehouse-scale machines
[6]  
Benson A., 2010, P 10 ACM SIGCOMM C I, P267, DOI [DOI 10.1145/1879141.1879175, 10.1145/1879141.1879175, 10.1145/1879141.1879175.5]
[7]  
Calder B, 2011, SOSP 11: PROCEEDINGS OF THE TWENTY-THIRD ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, P143
[8]  
Chan J.C., 2014, 12th USENIX Conference on File and Storage Technologies (FAST 14), P163
[9]  
Ford Daniel, 2010, OSDI
[10]  
Huang C., 2012, 2012 USENIX ANN TECH, P15