CPU: Cross-Rack-Aware Pipelining Update for Erasure-Coded Storage

被引：2

作者：

Wu, Haiqiao ^{[1
,2
]}

Du, Wan ^{[2
]}

Gong, Peng ^{[1
]}

Wu, Dapeng Oliver ^{[3
]}

机构：

[1] Beijing Inst Technol, Sch Mechatron Engn, Beijing Beijing, Peoples R China

[2] Univ Calif Merced, Dept Comp Sci & Engn, Merced, CA 95343 USA

[3] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA

来源：

IEEE TRANSACTIONS ON CLOUD COMPUTING | 2022年 / 10卷 / 04期

关键词：

Distributed storage system; cross-rack-aware updates; pipelining; erasure coding; CAPACITY;

D O I：

10.1109/TCC.2020.3035526

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Erasure coding is widely used in distributed storage systems (DSSs) to efficiently achieve fault tolerance. However, when the original data need to be updated, erasure coding must update every encoded block, resulting in long update time and high bandwidth consumption. Exiting solutions are mainly focused on coding schemes to minimize the size of transmitted update information, while ignoring more efficient utilization of bandwidth among update racks. In this article, we propose a parallel Cross-rack Pipelining Update scheme (CPU), which divides the update information into small-size units and transmits these units in parallel along with an update pipeline path among multiple racks. The performance of CPU is mainly determined by slice size and update path. More slices bring finer-grained parallel transmissions over cross-rack links, but also introduces more overheads. An update path that traverses all racks with large-bandwidth links provide short update time. We formulate the proposed pipelining update scheme as an optimization problem, based on a new theoretical pipelining update model. We prove the optimization problem is NP-hard and develop a heuristic algorithm to solve it based on the features of practical DSSs and our implementations, including Big chunk and Small overhead. Specifically, we determine the best update path first by solving a max-min problem and then decide the slice size. We further simplify the slice size selection by offline learning a range of interesting (RoI), in which all slice sizes provide similar performance. We implement CPU and conduct experiments on Amazon EC2 under a variety of scenarios. The results show that CPU can reduce the average update time by 48.2 percent, compared with the state-of-the-art update schemes.

引用

页码：2424 / 2436

页数：13

共 53 条

[1] EC-Store: Bridging the Gap Between Storage and Latency in Distributed Erasure Coded Systems [J].

Abebe, Michael ;

Daudjee, Khuzaima ;

Glasbergen, Brad ;

Tian, Yuanfeng .

2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, :255-266

[2] Analysis of Workload Behavior in Scientific and Historical Long-Term Data Repositories [J].

Adams, Ian F. ;

Storer, Mark W. ;

Miller, Ethan L. .

ACM TRANSACTIONS ON STORAGE, 2012, 8 (02)

[3]

Ahmad Faraz, 2014, Proceedings of USENIX ATC '14: 2014 USENIX Annual Technical Conference. ATC '14, P1

[4]

Borthakur D., 2008, Hadoop apache project, V53, P2

[5] THE TICKERTAIP PARALLEL RAID ARCHITECTURE [J].

CAO, P ;

LIM, SB ;

VENKATARAMAN, S ;

WILKES, J .

ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1994, 12 (03) :236-269

[6]

Chan Jeremy C. W., 2014, Proceedings of the 12th USENIX Conference on File and Storage Technologies. FAST '14, P163

[7] Secure Cloud Storage Meets with Secure Network Coding [J].

Chen, Fei ;

Xiang, Tao ;

Yang, Yuanyuan ;

Chow, Sherman S. M. .

IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (06) :1936-1948

[8] Leveraging Endpoint Flexibility in Data-Intensive Clusters [J].

Chowdhury, Mosharaf ;

Kandula, Srikanth ;

Stoica, Ion .

ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) :231-242

[9] Is it time to revisit Erasure Coding in Data-intensive clusters? [J].

Darrous, Jad ;

Ibrahim, Shadi ;

Perez, Christian .

2019 IEEE 27TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2019), 2019, :165-178

[10] Network Coding for Distributed Storage Systems [J].

Dimakis, Alexandros G. ;

Godfrey, P. Brighten ;

Wu, Yunnan ;

Wainwright, Martin J. ;

Ramchandran, Kannan .

IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (09) :4539-4551

← 1 2 3 4 5 6 →