Joint Scheduling and Source Selection for Background Traffic in Erasure-Coded Storage

被引:4
|
作者
Li, Shijing [1 ]
Lan, Tian [1 ]
Ra, Moo-Ryong [2 ]
Panta, Rajesh [2 ]
机构
[1] George Washington Univ, ECE, Washington, DC 20052 USA
[2] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
Traffic scheduling; erasure code; storage; LINEAR-TIME; ALGORITHM;
D O I
10.1109/TPDS.2018.2845845
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Erasure-coded storage systems have gained considerable adoption recently since they can provide the same level of reliability with significantly lower storage overhead compared to replicated systems. However, background traffic of such systems - e.g., repair, rebalance, backup and recovery traffic - often has large volume and consumes significant network resources. Independently scheduling such tasks and selecting their sources can easily create interference among data flows, causing severe deadline violation. We show that the well-known heuristic scheduling algorithms fail to consider important constraints, thus resulting in unsatisfactory performance. In this paper, we claim that an optimal scheduling algorithm, which aims to maximize the number of background tasks completed before deadlines, must simultaneously consider task deadline, network topology, chunk placement, and time-varying resource availability. We first show that the corresponding optimization problem is NP-hard. Then we propose a novel algorithm, called Linear Programming for Selected Tasks (LPST) to maximize the number of successful tasks and improve overall utilization of the datacenter network. It jointly schedules tasks and selects their sources based on a notion of Remaining Time Flexibility, which measures the slackness of the starting time of a task. We evaluated the efficacy of our algorithm using extensive simulations and validate the results with experiments in a real cloud environment. Our results show that, under certain scenarios, LPSTcan perform 7x similar to 10x better than the heuristics which blindly treat the infrastructure as a collection of homogeneous resources, and 21.7 similar to 65.9 percent better than the algorithms that only take the network topology into account.
引用
收藏
页码:2826 / 2837
页数:12
相关论文
共 20 条
  • [1] Joint Latency and Cost Optimization for Erasure-Coded Data Center Storage
    Xiang, Yu
    Lan, Tian
    Aggarwal, Vaneet
    Chen, Yih-Farn R.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (04) : 2443 - 2457
  • [2] Modeling and Optimization of Latency in Erasure-coded Storage Systems
    Aggarwal, Vaneet
    Lan, Tian
    FOUNDATIONS AND TRENDS IN COMMUNICATIONS AND INFORMATION THEORY, 2021, 18 (03): : 380 - 525
  • [3] An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching Algorithm
    Wang, Zizhong
    Wang, Haixia
    Shao, Airan
    Wang, Dongsheng
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [4] Reliability Evaluation of Erasure-coded Storage Systems with Latent Errors
    Iliadis, Ilias
    ACM TRANSACTIONS ON STORAGE, 2023, 19 (01)
  • [5] An Efficient Parallel Coding Scheme in Erasure-Coded Storage Systems
    Dong, Wenrui
    Liu, Guangming
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (03): : 627 - 643
  • [6] Repair Pipelining for Erasure-Coded Storage Based on Load-Balanced
    Jiang X.-Y.
    Li G.-Y.
    Zhou Y.
    Hu J.-P.
    Li H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (05): : 930 - 936
  • [7] Incremental encoding for erasure-coded cross-datacenters cloud storage
    Xu, Fangliang
    Wang, Yijie
    Ma, Xingkong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 527 - 537
  • [8] Aggregation Encoding: Reducing Network Traffic for Big Data Erasure-Coded Storages
    Zhang, Jing
    Li, Shanshan
    Liao, Xiangke
    Liu, Xiaodong
    2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 205 - 208
  • [9] Parallelized In-Network Aggregation for Failure Repair in Erasure-Coded Storage Systems
    Xia, Junxu
    Luo, Lailong
    Sun, Bowen
    Cheng, Geyao
    Guo, Deke
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (04) : 2888 - 2903
  • [10] Tight Bound of Parallel Request Latency for Erasure-Coded Distributed Storage System
    Zou, Xingshun
    Li, Wei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I, 2020, 12452 : 355 - 368