Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques

被引:21
|
作者
Zhou, Tianli [1 ]
Tian, Chao [1 ]
机构
[1] Texas A&M Univ, Wisenbaker Engn Bldg 3128,188 Bizzell St, College Stn, TX 77843 USA
关键词
Erasure code; performance; SCHEME; RAID;
D O I
10.1145/3375554
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various techniques have been proposed in the literature to improve erasure code computation efficiency, including optimizing bitmatrix design and computation schedule, common XOR (exclusive-OR) operation reduction, caching management techniques, and vectorization techniques. These techniques were largely proposed individually, and, in this work, we seek to use them jointly. To accomplish this task, these techniques need to be thoroughly evaluated individually and their relation better understood. Building on extensive testing, we develop methods to systematically optimize the computation chain together with the underlying bitmatrix. This led to a simple design approach of optimizing the bitmatrix by minimizing a weighted computation cost function, and also a straightforward coding procedure-follow a computation schedule produced from the optimized bitmatrix to apply XOR-level vectorization. This procedure provides better performances than most existing techniques (e.g., those used in ISA-L and Jerasure libraries), and sometimes can even compete against well-known but less general codes such as EVENODD, RDP, and STAR codes. One particularly important observation is that vectorizing the XOR operations is a better choice than directly vectorizing finite field operations, not only because of the flexibility in choosing finite field size and the better encoding throughput, but also its minimal migration efforts onto newer CPUs.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques
    Zhou, Tianli
    Tian, Chao
    PROCEEDINGS OF THE 17TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2019, : 317 - 329
  • [2] Erasure Coding-Oriented Data Update for Cloud Storage: A Survey
    Xiao, Yifei
    Zhou, Shijie
    Zhong, Linpeng
    IEEE ACCESS, 2020, 8 (08): : 227982 - 227998
  • [3] Exploring Erasure Coding Techniques for High Availability of Intermediate Data
    Zhang, Zhe
    Bockelman, Brian
    Weitzel, Derek
    Swanson, David
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 865 - 872
  • [4] P-Schedule: Erasure Coding Schedule Strategy in Big Data Storage System
    Yin, Chao
    Lv, Haitao
    Li, Tongfang
    Liu, Yan
    Qu, Xiaoping
    Yuan, Sihao
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT III, 2018, 11336 : 270 - 279
  • [5] Cerasure: Fast Acceleration Strategies For XOR-Based Erasure Codes
    Niu, Tianyang
    Lyu, Min
    Wang, Wei
    Li, Qiliang
    Xu, Yinlong
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 535 - 542
  • [6] ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO
    Chang, Chan Jung
    Chou, Jerry
    Chou, Yu-Ching
    Chung, I-Hsin
    2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 349 - 358
  • [7] Erasure Codes for Cold Data in Distributed Storage Systems
    Yin, Chao
    Xu, Zhiyuan
    Li, Wei
    Li, Tongfang
    Yuan, Sihao
    Liu, Yan
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [8] In-network block repairing for erasure coding storage systems
    Xia, Junxu
    Guo, Deke
    Cheng, Geyao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (24)
  • [9] A reliable and energy-efficient storage system with erasure coding cache
    Wan, Ji-guang
    Li, Da-ping
    Qu, Xiao-yang
    Yin, Chao
    Wang, Jun
    Xie, Chang-sheng
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (09) : 1370 - 1384
  • [10] EC-FRM: An Erasure Coding Framework to Speed up Reads for Erasure Coded Cloud Storage Systems
    Fu, Yingxun
    Shu, Jiwu
    Shen, Zhirong
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 480 - 489