GFCache: A Greedy Failure Cache Considering Failure Recency and Failure Frequency for an Erasure-Coded Storage System

被引:4
作者
Deng, Mingzhu [1 ]
Liu, Fang [2 ]
Zhao, Ming [3 ]
Chen, Zhiguang [2 ]
Xiao, Nong [1 ,2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, 109 Deya Rd, Changsha 410073, Hunan, Peoples R China
[2] Sun Yat Sen Univ, Sch Data & Comp Sci, 132 East Outer Ring Rd,Univ City Rd, Guangzhou 510006, Guangdong, Peoples R China
[3] Arizona State Univ, BYENG 460,699 S Mill Ave, Tempe, AZ 85281 USA
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2019年 / 58卷 / 01期
基金
中国国家自然科学基金;
关键词
Failure cache; greedy recovery; erasure coding; failure recency; failure frequency; RECOVERY; SCHEME;
D O I
10.32604/cmc.2019.03585
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the big data era, data unavailability, either temporary or permanent, becomes a normal occurrence on a daily basis. Unlike the permanent data failure, which is fixed through a background job, temporarily unavailable data is recovered on-the-fly to serve the ongoing read request. However, those newly revived data is discarded after serving the request, due to the assumption that data experiencing temporary failures could come back alive later. Such disposal of failure data prevents the sharing of failure information among clients, and leads to many unnecessary data recovery processes, (e.g. caused by either recurring unavailability of a data or multiple data failures in one stripe), thereby straining system performance. To this end, this paper proposes GFCache to cache corrupted data for the dual purposes of failure information sharing and eliminating unnecessary data recovery processes. GFCache employs a greedy caching approach of opportunism to promote not only the failed data, but also sequential failure-likely data in the same stripe. Additionally, GFCache includes a FARC (Failure ARC) catch replacement algorithm, which features a balanced consideration of failure recency, frequency to accommodate data corruption with good hit ratio. The stored data in GFCache is able to support fast read of the normal data access. Furthermore, since GFCache is a generic failure cache, it can be used anywhere erasure coding is deployed with any specific coding schemes and parameters. Evaluations show that GFCache achieves good hit ratio with our sophisticated caching algorithm and manages to significantly boost system performance by reducing unnecessary data recoveries with vulnerable data in the cache.
引用
收藏
页码:153 / 167
页数:15
相关论文
共 31 条
[1]  
[Anonymous], 2008, CS08627 U TENN
[2]  
[Anonymous], IEEE T PARALLEL DIST
[3]  
[Anonymous], 2018, Security and Communication Networks, DOI DOI 10.18653/V1/N18-1121
[4]  
[Anonymous], 2011, P 9 USENIX C FIL STO
[5]  
[Anonymous], SENSORS
[6]  
[Anonymous], SECURITY COMMUNICATI
[7]  
[Anonymous], 2018, SECURITY COMMUNICATI
[8]   A Distributed TCAM Coprocessor Architecture for Integrated Longest Prefix Matching, Policy Filtering, and Content Filtering [J].
Cai, Zhiping ;
Wang, Zhijun ;
Zheng, Kai ;
Cao, Jiannong .
IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (03) :417-427
[9]   Network Coding for Distributed Storage Systems [J].
Dimakis, Alexandros G. ;
Godfrey, P. Brighten ;
Wu, Yunnan ;
Wainwright, Martin J. ;
Ramchandran, Kannan .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (09) :4539-4551
[10]   A Stack-Based Single Disk Failure Recovery Scheme for Erasure Coded Storage Systems [J].
Fu, Yingxun ;
Shu, Jiwu ;
Luo, Xianghong .
2014 IEEE 33RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2014, :136-145