Asymptotically Optimal Coded Distributed Computing via Combinatorial Designs

被引:0
作者
Cheng, Minquan [1 ,2 ]
Wu, Youlong [3 ]
Li, Xianxian [1 ,2 ]
Wu, Dianhua [4 ]
机构
[1] Guangxi Normal Univ, Minist Educ, Key Lab Educ Blockchain & Intelligent Technol, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Miningand Sec, Guilin 541004, Peoples R China
[3] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China
[4] Guangxi Normal Univ, Sch Math & Stat, Guilin 541006, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributed computing; Costs; Task analysis; Encoding; Vectors; Technological innovation; Symbols; Coded distributed computing; asymptotically optimal; t-design; t-GDD; COMPUTATION;
D O I
10.1109/TNET.2024.3372698
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Coded distributed computing (CDC) introduced by Li et al. can greatly reduce the communication load for MapReduce computing systems. In the cascaded CDC with K workers, N input files and Q output functions, each input file will be mapped by r workers and each output function will be computed by s workers such that coding techniques can be applied to create multicast opportunities. The main drawback of most existing CDC schemes is that they require the original data to be split into a large number of input files that grows exponentially with K , which would significantly increase the coding complexity and degrade the system performance. In this paper, we first use a classical combinatorial structure t -design, for any integer t >= 2 >= 2 , to develop a low-complexity and communication-efficient CDC with r=s. Our scheme has much smaller N and Q than the existing schemes under the same parameters K , r , and s ; and achieves smaller communication loads compared with the state-of-the-art schemes when K is relatively large. Remarkably, unlike the previous schemes that realize on large operation fields, our scheme operates in one-shot communication on the minimum binary field F-2 . With a derived lower bound on the communication load under one-shot linear delivery, we show that the t -design scheme is asymptotically optimal. Furthermore, we show that our construction method can incorporate the other combinatorial structures that have a similar property to t -design. For instance, we use t -GDD to obtain another one-shot asymptotically optimal CDC scheme over F-2 that has different parameters from t -design. Finally, we show that our construction method can also be used to construct CDC schemes with r not equal s that have small file number and output function number. Like what you're reading?
引用
收藏
页码:3018 / 3033
页数:16
相关论文
共 26 条
  • [1] Tarazu: Optimizing MapReduce On Heterogeneous Clusters
    Ahmad, Faraz
    Chakradhar, Srimat
    Raghunathan, Anand
    Vijaykumar, T. N.
    [J]. ACM SIGPLAN NOTICES, 2012, 47 (04) : 61 - 74
  • [2] [Anonymous], 2010, 2 USENIX WORKSH HOT
  • [3] Chen Y., 2021, 2020 IEEE INF THEOR, P1
  • [4] Managing Data Transfers in Computer Clusters with Orchestra
    Chowdhury, Mosharaf
    Zaharia, Matei
    Ma, Justin
    Jordan, Michael I.
    Stoica, Ion
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2011, 41 (04) : 98 - 109
  • [5] Colbourn, 2006, HDB COMBINATORIAL DE
  • [6] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [7] Glock D., 2023, MEMOIRS AM MATH SOC, V284
  • [8] iShuffle: Improving Hadoop Performance with Shuffle-on-Write
    Guo, Yanfei
    Rao, Jia
    Cheng, Dazhao
    Zhou, Xiaobo
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1649 - 1662
  • [9] Horii S, 2020, IEEE INT SYMP INFO, P179, DOI [10.1109/ISIT44484.2020.9174132, 10.1109/isit44484.2020.9174132]
  • [10] Fundamental Limits of Caching in Wireless D2D Networks
    Ji, Mingyue
    Caire, Giuseppe
    Molisch, Andreas F.
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (02) : 849 - 869