Enabling Concurrent Failure Recovery for Regenerating-Coding-Based Storage Systems: From Theory to Practice

被引:22
作者
Li, Runhui [1 ]
Lin, Jian [1 ]
Lee, Patrick P. C. [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Shatin, Hong Kong, Peoples R China
关键词
Regenerating codes; failure recovery; distributed storage systems; coding theory; experiments and implementation; DISTRIBUTED STORAGE; CODES; CONSTRUCTION; REPAIR;
D O I
10.1109/TC.2014.2349518
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data availability is critical in distributed storage systems, especially when node failures are prevalent in real life. A key requirement is to minimize the amount of data transferred among nodes when recovering the lost or unavailable data of failed nodes. This paper explores recovery solutions based on regenerating codes, which have been designed to provide fault-tolerant storage and minimum bandwidth. Existing optimal regenerating codes are designed for single node failures. We build a system called CORE, which augments existing optimal regenerating codes for the recovery of a general number of failures including single and concurrent failures. We show theoretically that CORE achieves the minimum possible bandwidth for most cases. We implement a CORE prototype and evaluate it atop an HDFS cluster testbed with up to 20 storage nodes. We demonstrate that our CORE prototype conforms to our theoretical findings and achieves bandwidth savings when compared to the conventional recovery approach based on erasure codes.
引用
收藏
页码:1898 / 1911
页数:14
相关论文
共 52 条
[1]   Network information flow [J].
Ahlswede, R ;
Cai, N ;
Li, SYR ;
Yeung, RW .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2000, 46 (04) :1204-1216
[2]  
[Anonymous], P 10 USENIX C FIL ST
[3]  
[Anonymous], 2010, P 2 USENIX C HOT TOP
[4]  
[Anonymous], P IEEE INT S MODELIN
[5]  
[Anonymous], 2010, PROC 9 USENIX S OPER
[6]  
[Anonymous], 2010, INFORM THEORY WORKSH
[7]  
Bhagwan R, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE FIRST SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI'04), P337
[8]  
BLOEMER J, 1995, TR95048 INT COMP SCI
[9]  
Cadambe VR, 2011, IEEE INT SYMP INFO, P1225, DOI 10.1109/ISIT.2011.6033730
[10]  
Calder B, 2011, SOSP 11: PROCEEDINGS OF THE TWENTY-THIRD ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, P143