Enabling Efficient and Reliable Transition from Replication to Erasure Coding for Clustered File Systems

被引:49
作者
Li, Runhui [1 ]
Hu, Yuchong [2 ]
Lee, Patrick P. C. [1 ]
机构
[1] Chinese Univ Hong Kong, Shatin, Hong Kong, Peoples R China
[2] Huazhong Univ Sci & Technol, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Replication; erasure codes; distributed storage systems; experiments and implementation; CODES;
D O I
10.1109/TPDS.2017.2678505
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To balance performance and storage efficiency, modern clustered file systems often first store data with replication, followed by encoding the replicated data with erasure coding. We argue that the commonly used random replication does not take into account erasure coding in its design, thereby raising both performance and availability issues in the subsequent encoding operation. We propose encoding-aware replication, which carefully places the replicas so as to (i) eliminate cross-rack downloads of data blocks during the encoding operation, (ii) preserve availability without data relocation after the encoding operation, and (iii) maintain load balancing across replicas as in random replication before the encoding operation. We conduct extensive HDFS-based testbed experiments and discrete-event simulations, and demonstrate the performance gains of encoding-aware replication over random replication.
引用
收藏
页码:2500 / 2513
页数:14
相关论文
共 35 条
[1]  
Ahmad Faraz., 2014, 2014 USENIX ANN TECH, P1
[2]   A scalable, commodity data center network architecture [J].
Al-Fares, Mohammad ;
Loukissas, Alexander ;
Vahdat, Amin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2008, 38 (04) :63-74
[3]  
Ananthanarayanan G, 2011, EUROSYS 11: PROCEEDINGS OF THE EUROSYS 2011 CONFERENCE, P287
[4]  
[Anonymous], 2014, 11 USENIX S OPERATIN
[5]  
[Anonymous], 2011, IPERF
[6]  
[Anonymous], 2011, SIGMOD 11 P 2011 INT, DOI [DOI 10.1145/1989323.1989438, 10.1145/1989323.1989438]
[7]  
[Anonymous], 2016, FACEBOOKS HADOOP
[8]  
[Anonymous], 2013, SWIM PROJECT
[9]  
Benson A., 2010, P 10 ACM SIGCOMM C I, P267, DOI [10.1145/1879141.1879175.5, DOI 10.1145/1879141.1879175, 10.1145/1879141.1879175]
[10]  
Bloemer J., 1995, An XOR-based erasure-resilient coding scheme