Coded Computing for Multi-Cluster Distributed Computations

被引:0
作者
Wu, Youlong [1 ]
Li, Chenglin [1 ]
Hu, Haoyang [1 ]
Song, Xiyu [2 ]
Ma, Shuai [3 ]
Shi, Yuanming [1 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China
[2] Guilin Univ Elect Technol, Sch Informat & Commun, Guilin 541004, Guangxi, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
关键词
Task analysis; Symbols; Servers; Switches; Encoding; Downlink; Uplink; Distributed computing; coded computation; communication load; MapReduce; DESIGN;
D O I
10.1109/TCOMM.2024.3446641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Distributed computing, which leverages distributed storage and computing resources, is a promising paradigm for handling large-scale computational tasks. However, its potential is often hindered by high communication latency due to limited network bandwidth. In this paper, we study the computation-communication tradeoff of multi-cluster MapReduce systems where a central server connects to multiple clusters, each comprising a set of workers that jointly perform a MapReduce task. Workers can exchange information directly within their cluster (inner-cluster communication) or indirectly through the central server (cross-cluster communication). To reduce the communication load, we propose a nested coded distributed computing (CDC) scheme that is feasible for the heterogeneous scenario where different clusters could have arbitrary numbers of workers and computation loads. It is shown that our scheme can greatly reduce communication load compared to all existing schemes, and could achieve the optimal cross-cluster communication load. In addition, the proposed scheme can significantly reduce the computational complexity of the conventional CDC schemes, whose computational complexity exponentially increases with the computation load.
引用
收藏
页码:1114 / 1127
页数:14
相关论文
共 35 条
[1]   DoF of a Cooperative X-Channel with an Application to Distributed Computing [J].
Bi, Yue ;
Ciblat, Philippe ;
Wigger, Michele ;
Wu, Yue .
2022 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, ISIT, 2022, :566-571
[2]   On the Optimality of Data Exchange for Master-Aided Edge Computing Systems [J].
Chen, Haoning ;
Long, Junfeng ;
Ma, Shuai ;
Tang, Mingjian ;
Wu, Youlong .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (03) :1364-1376
[3]   Coded Computing for Master-Aided Distributed Computing Systems [J].
Chen, Haoning ;
Wu, Youlong .
2020 IEEE INFORMATION THEORY WORKSHOP (ITW), 2021,
[4]  
Cheng MQ, 2023, Arxiv, DOI arXiv:2302.05826
[5]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[6]  
Gupta S, 2017, INFO THEOR WORKSH, P459, DOI 10.1109/ITW.2017.8277996
[7]   Communication-Efficient Coded Computing for Distributed Multi-Task Learning [J].
Hu, Haoyang ;
Wu, Youlong ;
Shi, Yuanming ;
Li, Songze ;
Jiang, Chunxiao ;
Zhang, Wei .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (07) :3861-3875
[8]   Coded Distributed Computing for Hierarchical Multi-task Learning [J].
Hu, Haoyang ;
Li, Songze ;
Cheng, Minquan ;
Wu, Youlong .
2023 IEEE INFORMATION THEORY WORKSHOP, ITW, 2023, :480-485
[9]   Cascaded Coded Distributed Computing Schemes Based on Placement Delivery Arrays [J].
Jiang, Jing ;
Qu, Lingxiao .
IEEE ACCESS, 2020, 8 :221385-221395
[10]   Hierarchical Coded Matrix Multiplication [J].
Kianidehkordi, Shahrzad ;
Ferdinand, Nuwan ;
Draper, Stark C. .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (02) :726-754