CPU: Cross-Rack-Aware Pipelining Update for Erasure-Coded Storage

被引:1
作者
Wu, Haiqiao [1 ,2 ]
Du, Wan [2 ]
Gong, Peng [1 ]
Wu, Dapeng Oliver [3 ]
机构
[1] Beijing Inst Technol, Sch Mechatron Engn, Beijing Beijing, Peoples R China
[2] Univ Calif Merced, Dept Comp Sci & Engn, Merced, CA 95343 USA
[3] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
关键词
Distributed storage system; cross-rack-aware updates; pipelining; erasure coding; CAPACITY;
D O I
10.1109/TCC.2020.3035526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Erasure coding is widely used in distributed storage systems (DSSs) to efficiently achieve fault tolerance. However, when the original data need to be updated, erasure coding must update every encoded block, resulting in long update time and high bandwidth consumption. Exiting solutions are mainly focused on coding schemes to minimize the size of transmitted update information, while ignoring more efficient utilization of bandwidth among update racks. In this article, we propose a parallel Cross-rack Pipelining Update scheme (CPU), which divides the update information into small-size units and transmits these units in parallel along with an update pipeline path among multiple racks. The performance of CPU is mainly determined by slice size and update path. More slices bring finer-grained parallel transmissions over cross-rack links, but also introduces more overheads. An update path that traverses all racks with large-bandwidth links provide short update time. We formulate the proposed pipelining update scheme as an optimization problem, based on a new theoretical pipelining update model. We prove the optimization problem is NP-hard and develop a heuristic algorithm to solve it based on the features of practical DSSs and our implementations, including Big chunk and Small overhead. Specifically, we determine the best update path first by solving a max-min problem and then decide the slice size. We further simplify the slice size selection by offline learning a range of interesting (RoI), in which all slice sizes provide similar performance. We implement CPU and conduct experiments on Amazon EC2 under a variety of scenarios. The results show that CPU can reduce the average update time by 48.2 percent, compared with the state-of-the-art update schemes.
引用
收藏
页码:2424 / 2436
页数:13
相关论文
共 53 条
  • [1] EC-Store: Bridging the Gap Between Storage and Latency in Distributed Erasure Coded Systems
    Abebe, Michael
    Daudjee, Khuzaima
    Glasbergen, Brad
    Tian, Yuanfeng
    [J]. 2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 255 - 266
  • [2] Analysis of Workload Behavior in Scientific and Historical Long-Term Data Repositories
    Adams, Ian F.
    Storer, Mark W.
    Miller, Ethan L.
    [J]. ACM TRANSACTIONS ON STORAGE, 2012, 8 (02)
  • [3] Ahmad Faraz., 2014, 2014 USENIX ANN TECH, P1
  • [4] Borthakur D., 2008, HDFS ARCHITECTURE GU
  • [5] THE TICKERTAIP PARALLEL RAID ARCHITECTURE
    CAO, P
    LIM, SB
    VENKATARAMAN, S
    WILKES, J
    [J]. ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1994, 12 (03): : 236 - 269
  • [6] Chan J.C., 2014, 12th USENIX Conference on File and Storage Technologies (FAST 14), P163
  • [7] Secure Cloud Storage Meets with Secure Network Coding
    Chen, Fei
    Xiang, Tao
    Yang, Yuanyuan
    Chow, Sherman S. M.
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (06) : 1936 - 1948
  • [8] Leveraging Endpoint Flexibility in Data-Intensive Clusters
    Chowdhury, Mosharaf
    Kandula, Srikanth
    Stoica, Ion
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) : 231 - 242
  • [9] Is it time to revisit Erasure Coding in Data-intensive clusters?
    Darrous, Jad
    Ibrahim, Shadi
    Perez, Christian
    [J]. 2019 IEEE 27TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2019), 2019, : 165 - 178
  • [10] Network Coding for Distributed Storage Systems
    Dimakis, Alexandros G.
    Godfrey, P. Brighten
    Wu, Yunnan
    Wainwright, Martin J.
    Ramchandran, Kannan
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (09) : 4539 - 4551