Parallelized In-Network Aggregation for Failure Repair in Erasure-Coded Storage Systems

被引:3
作者
Xia, Junxu [1 ]
Luo, Lailong [1 ]
Sun, Bowen [1 ]
Cheng, Geyao [1 ]
Guo, Deke [2 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China
[2] Xiangjiang Lab, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Erasure code; distributed storage system; programmable switch; fault tolerance;
D O I
10.1109/TNET.2024.3367995
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To repair a failed block in the erasure-coded storage system, multiple related blocks have to be retrieved from other storage nodes across the network. Such a process can lead to significant incast-type repair traffics and delays. The existing efforts mainly try to schedule the transmission of the requested blocks across different storage nodes to avoid network congestion. At their cores, they utilize part of the involved hosts to rely on or aggregate the file blocks from others. While we notice that, the programmability and capability of today's network devices (i.e., routers and switches) bring a great opportunity to further speed up the repair progress by aggregating the file blocks with such devices. By mitigating the aggregation operations from the network edges to network cores, it is possible to save more time and bandwidth. With this intuition, we propose Paint, a parallelized in-network aggregation framework for failure repair. Paint utilizes programmable switches to aggregate relevant data and improves the repair performance by implementing multiple parallelized repair pipelines. We propose a series of novel and time-friendly algorithms to construct the routing paths for Paint and design the Aggregation Control Protocol to implement Paint in production clusters. For all we know, this is the first work to explore and implement parallelized in-network repair with programmable switches. The extensive experiments on the prototype system and real-world datasets indicate that Paint can significantly improve repair performance while effectively reducing bandwidth overhead.
引用
收藏
页码:2888 / 2903
页数:16
相关论文
共 50 条
  • [21] An Ant Colony Optimization Algorithms Based Data Update Scheme for Erasure-Coded Storage Systems
    Li Q.
    Hu Y.
    Ye Z.
    Xiao Y.
    Qin Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (02): : 305 - 318
  • [22] Incremental encoding for erasure-coded cross-datacenters cloud storage
    Xu, Fangliang
    Wang, Yijie
    Ma, Xingkong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 527 - 537
  • [23] An Ant Colony Optimization Based Data Update Scheme for Distributed Erasure-Coded Storage Systems
    Hu, Yupeng
    Li, Qian
    Xie, Wei
    Ye, Zhenyu
    IEEE ACCESS, 2020, 8 : 118696 - 118706
  • [24] Joint Latency and Cost Optimization for Erasure-Coded Data Center Storage
    Xiang, Yu
    Lan, Tian
    Aggarwal, Vaneet
    Chen, Yih-Farn R.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (04) : 2443 - 2457
  • [25] Data repair accelerating scheme for erasure-coded storage system based on FPGA and hierarchical parallel decoding structure
    Chen, Junqi
    Yang, Sijie
    Wang, Yong
    Ye, Miao
    Lei, Fan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (06): : 7803 - 7823
  • [26] NetEC: Accelerating Erasure Coding Reconstruction With In-Network Aggregation
    Qiao, Yi
    Zhang, Menghao
    Zhou, Yu
    Kong, Xiao
    Zhang, Han
    Bi, Jun
    Xu, Mingwei
    Wang, Jilong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (10) : 2571 - 2583
  • [27] CPU: Cross-Rack-Aware Pipelining Update for Erasure-Coded Storage
    Wu, Haiqiao
    Du, Wan
    Gong, Peng
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (04) : 2424 - 2436
  • [28] ESDU: An elastic stripe-based delta update method for erasure-coded cross-data center storage systems
    Bao, Han
    Wang, Yijie
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 167 : 173 - 186
  • [29] TA-Update: An Adaptive Update Scheme with Tree-Structured Transmission in Erasure-Coded Storage Systems
    Wang, Yijie
    Pei, Xiaoqiang
    Ma, Xingkong
    Xu, Fangliang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (08) : 1893 - 1906
  • [30] Storage vs Repair Bandwidth for Network Erasure Coding in Distributed Storage Systems
    Singal, Swati Mittal
    Rakesh, Nitin
    Matam, Rakesh
    2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,