Data repair accelerating scheme for erasure-coded storage system based on FPGA and hierarchical parallel decoding structure

被引:1
作者
Chen, Junqi [1 ]
Yang, Sijie [1 ]
Wang, Yong [1 ,2 ]
Ye, Miao [1 ,3 ]
Lei, Fan [1 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Engn Technol Res Ctr Cloud Secur & Cloud S, Guilin 541004, Peoples R China
[3] Guilin Univ Elect Technol, Guangxi Key Lab Wireless Wideband Commun & Signal, Guilin 541004, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2024年 / 27卷 / 06期
基金
中国国家自然科学基金;
关键词
Data repair; Fault-tolerant storage; Erasure code; Hardware acceleration; FPGA;
D O I
10.1007/s10586-024-04401-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Erasure coding has been widely used in commodity datacenter to tolerate faults, due to its ability to simultaneously provide high storage space utilization and data reliability. However, when data loss occurs, the extra data decoding and traffic overhead makes it difficult to improve their data repair efficiency, limiting their further application in the hot data storage systems. In this paper, we proposed an FPGA-based Data Repair accelerating scheme (FPGA-ECDR) for erasure-coded storage system, which employs the Cauchy Reed-Solomon(CRS) code, to overcome the aforementioned limitation. In FPGA-ECDR, multiple modules are designed to work collaboratively, enhancing the efficiency of data flow and ensuring the reliability of the data repair process. Then, a CRS decoding algorithm based on check matrix is used to reduce the complexity of matrix inversion in the decoding process, and hardware acceleration of the algorithm is realized by FPGA. Moreover, we proposed a Hierarchical Parallel Decoding Structure (HPDS) to optimize cache data reading timing and XOR logic operations in the decoding process. HPDS can effectively reduce the impact of Column Address Strobe (CAS) latency and improve repair efficiency. Finally, we conducted FPGA board-level verification of the proposed scheme, testing on CRS codes with different data sizes and parameters. The experimental results show that compared with the schemes of the current mainstream open-source erasure coding library Jerasure and Intel ISA-L acceleration library, as well as the Xilinx RS code decoding acceleration scheme based on the same FPGA platform, our proposed scheme has lower decoding latency and can improve the data decoding rate by 3.2 to 148.5 times and enhances the repair throughput by up to 21.4 times in fault-tolerant storage system.
引用
收藏
页码:7803 / 7823
页数:21
相关论文
共 38 条
  • [1] [Anonymous], 2023, APACHE HADOOP 335
  • [2] Erasure coding for distributed storage: an overview
    Balaji, S. B.
    Krishnan, M. Nikhil
    Vajha, Myna
    Ramkumar, Vinayak
    Sasidharan, Birenjith
    Kumar, P. Vijay
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (10)
  • [3] Bao H., 2023, J COMPUT RES DEV, DOI [10.7544/issn1000-1239.202220580, DOI 10.7544/ISSN1000-1239.202220580]
  • [4] The Future of FPGA Acceleration in Datacenters and the Cloud
    Bobda, Christophe
    Mbongue, Joel Mandebi
    Chow, Paul
    Ewais, Mohammad
    Tarafdar, Naif
    Vega, Juan Camilo
    Eguro, Ken
    Koch, Dirk
    Handagala, Suranga
    Leeser, Miriam
    Herbordt, Martin
    Shahzad, Hafsah
    Hofste, Peter
    Ringlein, Burkhard
    Szefer, Jakub
    Sanaullah, Ahmed
    Tessier, Russell
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (03)
  • [5] Towards Optimal Outsourcing of Service Function Chain Across Multiple Clouds
    Chen, Huan
    Xu, Shizhong
    Wang, Xiong
    Zhao, Yangming
    Li, Ke
    Wang, Yang
    Wang, Wei
    Li, Le Min
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [6] FPGA HLS Today: Successes, Challenges, and Opportunities
    Cong, Jason
    Lau, Jason
    Liu, Gai
    Neuendorffer, Stephen
    Pan, Peichen
    Vissers, Kees
    Zhang, Zhiru
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (04)
  • [7] Design of FPGA-Implemented Reed-Solomon Erasure Code (RS-EC) Decoders With Fault Detection and Location on User Memory
    Gao, Zhen
    Zhang, Lingling
    Cheng, Yinghao
    Guo, Kangkang
    Ullah, Anees
    Reviriego, Pedro
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (06) : 1073 - 1082
  • [8] Heterogeneous multicore SDRAM interference analysis
    Gonzalez, Alfonso Mascarenas
    Boniol, Frederic
    Bouchebaba, Youcef
    Bussenot, Jean-Loup
    Chaudron, Jean-Baptiste
    [J]. 29TH INTERNATIONAL CONFERENCE ON REAL TIME NETWORKS AND SYSTEMS (RTNS 2021), 2021, : 12 - 23
  • [9] Rack-Aware Regenerating Codes for Data Centers
    Hou, Hanxu
    Lee, Patrick P. C.
    Shum, Kenneth W.
    Hu, Yuchong
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (08) : 4730 - 4745
  • [10] Intel, 2023, ISA L