Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques

被引：21

作者：

Zhou, Tianli ^{[1
]}

Tian, Chao ^{[1
]}

机构：

[1] Texas A&M Univ, Wisenbaker Engn Bldg 3128,188 Bizzell St, College Stn, TX 77843 USA

来源：

ACM TRANSACTIONS ON STORAGE | 2020年 / 16卷 / 01期

关键词：

Erasure code; performance; SCHEME; RAID;

D O I：

10.1145/3375554

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Various techniques have been proposed in the literature to improve erasure code computation efficiency, including optimizing bitmatrix design and computation schedule, common XOR (exclusive-OR) operation reduction, caching management techniques, and vectorization techniques. These techniques were largely proposed individually, and, in this work, we seek to use them jointly. To accomplish this task, these techniques need to be thoroughly evaluated individually and their relation better understood. Building on extensive testing, we develop methods to systematically optimize the computation chain together with the underlying bitmatrix. This led to a simple design approach of optimizing the bitmatrix by minimizing a weighted computation cost function, and also a straightforward coding procedure-follow a computation schedule produced from the optimized bitmatrix to apply XOR-level vectorization. This procedure provides better performances than most existing techniques (e.g., those used in ISA-L and Jerasure libraries), and sometimes can even compete against well-known but less general codes such as EVENODD, RDP, and STAR codes. One particularly important observation is that vectorizing the XOR operations is a better choice than directly vectorizing finite field operations, not only because of the flexibility in choosing finite field size and the better encoding throughput, but also its minimal migration efforts onto newer CPUs.

引用

页数：24

共 50 条

[1] Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques
Zhou, Tianli
Tian, Chao
PROCEEDINGS OF THE 17TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2019, : 317 - 329
[2] Erasure Coding-Oriented Data Update for Cloud Storage: A Survey
Xiao, Yifei
Zhou, Shijie
Zhong, Linpeng
IEEE ACCESS, 2020, 8 (08): : 227982 - 227998
[3] Exploring Erasure Coding Techniques for High Availability of Intermediate Data
Zhang, Zhe
Bockelman, Brian
Weitzel, Derek
Swanson, David
2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 865 - 872
[4] P-Schedule: Erasure Coding Schedule Strategy in Big Data Storage System
Yin, Chao
Lv, Haitao
Li, Tongfang
Liu, Yan
Qu, Xiaoping
Yuan, Sihao
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT III, 2018, 11336 : 270 - 279
[5] Cerasure: Fast Acceleration Strategies For XOR-Based Erasure Codes
Niu, Tianyang
Lyu, Min
Wang, Wei
Li, Qiliang
Xu, Yinlong
2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 535 - 542
[6] ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO
Chang, Chan Jung
Chou, Jerry
Chou, Yu-Ching
Chung, I-Hsin
2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 349 - 358
[7] Erasure Codes for Cold Data in Distributed Storage Systems
Yin, Chao
Xu, Zhiyuan
Li, Wei
Li, Tongfang
Yuan, Sihao
Liu, Yan
APPLIED SCIENCES-BASEL, 2023, 13 (04):
[8] In-network block repairing for erasure coding storage systems
Xia, Junxu
Guo, Deke
Cheng, Geyao
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (24)
[9] A reliable and energy-efficient storage system with erasure coding cache
Wan, Ji-guang
Li, Da-ping
Qu, Xiao-yang
Yin, Chao
Wang, Jun
Xie, Chang-sheng
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (09) : 1370 - 1384
[10] EC-FRM: An Erasure Coding Framework to Speed up Reads for Erasure Coded Cloud Storage Systems
Fu, Yingxun
Shu, Jiwu
Shen, Zhirong
2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 480 - 489

← 1 2 3 4 5 →