A Worst-Case Latency and Age Analysis of Coded Distributed Computing With Unreliable Workers and Periodic Tasks

被引:0
作者
Chiariotti, Federico [1 ]
Soret, Beatriz [2 ]
Popovski, Petar [3 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
[2] Univ Malaga, Telecommun Res Inst, Malaga 29010, Spain
[3] Aalborg Univ, Dept Elect Syst, DK-9220 Aalborg, Denmark
来源
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY | 2024年 / 5卷
关键词
Computational modeling; Distributed computing; Reliability; Queueing analysis; Probability density function; Solid modeling; Optimization; Coded distributed computing; latency analysis; age of information; fork-join queues; PEAK AGE; INFORMATION; COMPUTATION;
D O I
10.1109/OJCOMS.2024.3458802
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Over the past decade, the deep learning revolution has led to ever-increasing demands for computing power and working memory to support larger and larger neural networks. As this coincided with the end of Moore's law, distributed solutions have emerged as a natural answer: in particular, the novel Coded Distributed Computing (CDC) paradigm exploits results from coding theory to divide large tasks into redundant sets of smaller subtasks to be processed across multiple workers, making the computation more robust to stragglers and malicious worker nodes. Optimizing the use of these distributed computing resources is critical, as excessive redundancy might impact on performance and energy consumption. This work considers a CDC system receiving periodic tasks, deriving the full distribution of the latency, reliability, and Peak Age of Information (PAoI) under worker diversity and random failures. The CDC system is modeled as a fork-join D/M/(K, N)/L queue, where only K of the coded N subtasks are necessary to solve the overall task, and workers can hold up to L subtasks in their queues. Our results are useful for resource optimization, showing the relationship between system load, redundancy, and latency, as well as the trade-off between latency, reliability, and age performance.
引用
收藏
页码:5874 / 5889
页数:16
相关论文
共 41 条
[1]   On the Role of Age of Information in the Internet of Things [J].
Abd-Elmagid, Mohamed A. ;
Pappas, Nikolaos ;
Dhillon, Arpreet S. .
IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (12) :72-77
[2]   Secure Coded Multi-Party Computation for Massive Matrix Operations [J].
Akbari-Nodehi, Hanzaleh ;
Maddah-Ali, Mohammad Ali .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (04) :2379-2398
[3]  
[Anonymous], 2022, 3GPP TS 22.186
[4]   THE FORK-JOIN QUEUE AND RELATED SYSTEMS WITH SYNCHRONIZATION CONSTRAINTS - STOCHASTIC ORDERING AND COMPUTABLE BOUNDS [J].
BACCELLI, F ;
MAKOWSKI, AM ;
SHWARTZ, A .
ADVANCES IN APPLIED PROBABILITY, 1989, 21 (03) :629-660
[5]  
Baccelli F., 2012, Lecture Notes in Statistics., V41
[6]   Minimizing the Age of Information Through Queues [J].
Bedewy, Ahmed M. ;
Sun, Yin ;
Shroff, Ness B. .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (08) :5215-5232
[7]   The Age of Information in Multihop Networks [J].
Bedewy, Ahmed M. ;
Sun, Yin ;
Shroff, Ness B. .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2019, 27 (03) :1248-1257
[8]   Minimizing Latency for Secure Coded Computing Using Secret Sharing via Staircase Codes [J].
Bitar, Rawad ;
Parag, Parimal ;
El Rouayheb, Salim .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (08) :4609-4619
[9]   Timely Distributed Computation With Stragglers [J].
Buyukates, Baturalp ;
Ulukus, Sennur .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (09) :5273-5282
[10]   Latency and Peak Age of Information in Multipath Coded Communications [J].
Chiariotti, Federico ;
Soret, Beatriz ;
Popovski, Petar .
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, :4971-4976