Elastic Resource Allocation for Coded Distributed Computing Over Heterogeneous Wireless Edge Networks

被引:8
作者
Nguyen, Cong T. [1 ,2 ]
Nguyen, Diep N. [1 ]
Hoang, Dinh Thai [1 ]
Phan, Khoa Tran [3 ]
Niyato, Dusit [4 ]
Pham, Hoang-Anh [2 ]
Dutkiewicz, Eryk [1 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Sydney, NSW 2007, Australia
[2] Vietnam Natl Univ Ho Chi Minh City VNU HCM, Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, Ho Chi Minh City 700000, Vietnam
[3] La Trobe Univ, Sch Engn & Math Sci, Melbourne, VIC 3086, Australia
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore City 639798, Singapore
基金
澳大利亚研究理事会; 新加坡国家研究基金会;
关键词
Task analysis; Servers; Optimization; Distributed computing; Wireless communication; Energy consumption; Codes; Coded distributed computing; maximum distance separable code; resource allocation; INLP; Lyapunov optimization; edge computing; straggling effects; COMPUTATION; CLOUD; MAPREDUCE; TRADEOFF;
D O I
10.1109/TWC.2022.3213256
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Coded distributed computing (CDC) has recently emerged to be a promising solution to address the straggling effects in conventional distributed computing systems. By assigning redundant workloads to the computing nodes, CDC can significantly enhance the performance of the whole system. However, since the core idea of CDC is to introduce redundancies to compensate for uncertainties, it may lead to a large amount of wasted energy at the edge nodes. It can be observed that the more redundant workload added, the less impact the straggling effects have on the system. However, at the same time, the more energy is needed to perform redundant tasks. In this work, we develop a novel framework, namely CERA, to elastically allocate computing resources for CDC processes. Particularly, CERA consists of two stages. In the first stage, we model a joint coding and node selection optimization problem to minimize the expected processing time for a CDC task. Since the problem is NP-hard, we propose a linearization approach and a hybrid algorithm to quickly obtain the optimal solutions. In the second stage, we develop a smart online approach based on Lyapunov optimization to dynamically turn off straggling nodes based on their actual performance. As a result, wasteful energy consumption can be significantly reduced with minimal impact on the total processing time. Simulations using real-world datasets have shown that our proposed approach can reduce the system's total processing time by more than 200% compared to that of the state-of-the-art approach, even when the nodes' actual performance is not known in advance. Moreover, the results have shown that CERA's online optimization stage can reduce the energy consumption by up to 37.14% without affecting the total processing time.
引用
收藏
页码:2636 / 2649
页数:14
相关论文
共 34 条
[1]  
Attia MA, 2017, Arxiv, DOI arXiv:1711.08452
[2]   Block-Division-Based Wireless Coded Computation [J].
Chen, Li ;
Han, Kaifeng ;
Du, Ying ;
Wang, Zhiqin .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (02) :283-287
[3]   Toward Optimal Rate-Delay Tradeoff for Computation Over Multiple Access Channel [J].
Chen, Li ;
Zhao, Nan ;
Chen, Yunfei ;
Yu, F. Richard ;
Wei, Guo .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (07) :4335-4346
[4]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[5]   Optimal Incentive and Load Design for Distributed Coded Machine Learning [J].
Ding, Ningning ;
Fang, Zhixuan ;
Duan, Lingjie ;
Huang, Jianwei .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (07) :2090-2104
[6]   Edge of Things: The Big Picture on the integration of Edge, IoT and the Cloud in a Distributed Computing Environment [J].
El-Sayed, Hesham ;
Sankar, Sharmi ;
Prasad, Mukesh ;
Puthal, Deepak ;
Gupta, Akshansh ;
Mohanty, Manoranjan ;
Lin, Chin-Teng .
IEEE ACCESS, 2018, 6 :1706-1717
[7]  
Feeney LM, 2001, IEEE INFOCOM SER, P1548, DOI 10.1109/INFCOM.2001.916651
[8]  
Goemans M. X., 1994, Advanced Algorithms.
[9]  
Halbawi W, 2018, IEEE INT SYMP INFO, P2027, DOI 10.1109/ISIT.2018.8437467
[10]   Coded Wireless Distributed Computing With Packet Losses and Retransmissions [J].
Han, Dong-Jun ;
Sohn, Jy-Yong ;
Moon, Jaekyun .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (12) :8204-8217