Distributed Decoding for Coded Distributed Computing

被引:2
作者
Yazdanialahabadi, Arash [1 ]
Ardakani, Masoud [1 ]
机构
[1] Univ Alberta, Elect & Comp Engn Dept, Edmonton, AB T6G 2R3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Decoding; Task analysis; Codes; Encoding; Distributed computing; Internet of Things; Complexity theory; Distributed coding; distributed computing; edge computing; heterogeneous network; matrix multiplication;
D O I
10.1109/JIOT.2021.3138855
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In distributed computing, when a large number of helper nodes assist a master node to finish a large task, the main challenge is when some of these helpers straggle. Several coded distributed computing schemes exist that resolve the straggling problem. In these solutions, however, the decoding remains a responsibility of the master node. The complexity of decoding can be significant and increase the execution time considerably. In this work, we propose a multilayer coding strategy that allows some helpers to assist with the decoding. With this multilayer structure, the original decoding is performed by some decoding helpers, and the master only minimally participates in decoding to make every layer reliable. The optimization problem to minimize the overall completion time is also studied and shown to have a simple solution in almost all practical scenarios.
引用
收藏
页码:12555 / 12562
页数:8
相关论文
共 32 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Ananthanarayanan Ganesh, 2013, Proceedings of NSDI '13: 10th USENIX Symposium on Networked Systems Design and Implementation. NSDI '13, P185
[3]  
[Anonymous], 2019, ARXIV190106811
[4]  
[Anonymous], 2018, ARXIV180410331
[5]  
[Anonymous], 2017, P ADV NEUR INF PROC
[6]   Accelerating matrix product on reconfigurable hardware for image processing applications [J].
Bensaali, F ;
Amira, A ;
Bouridane, A .
IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 2005, 152 (03) :236-246
[7]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[8]  
Da Wang, 2015, ACM SIGMETRICS Performance Evaluation Review, V43, P7, DOI 10.1145/2847220.2847223
[9]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[10]   The Tail at Scale [J].
Dean, Jeffrey ;
Barroso, Luiz Andre .
COMMUNICATIONS OF THE ACM, 2013, 56 (02) :74-80