Edge/Cloud Infinite-Time Horizon Resource Allocation for Distributed Machine Learning and General Tasks

被引:1
作者
Sartzetakis, Ippokratis [1 ,2 ]
Soumplis, Polyzois [1 ,2 ]
Pantazopoulos, Panagiotis [2 ]
Katsaros, Konstantinos V. [2 ]
Sourlas, Vasilis [2 ]
Varvarigos, Emmanouel [1 ,2 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[2] Natl Tech Univ Athens, Inst Commun & Comp Syst, Athens 15773, Greece
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2024年 / 21卷 / 01期
基金
欧盟地平线“2020”;
关键词
Cloud and edge computing; distributed computing; distributed machine learning; inference; training; resource allocation; INTERNET; IOT;
D O I
10.1109/TNSM.2023.3312593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge computing has emerged as a computing paradigm where the application and data processing takes place close to the end devices. It decreases the distances over which data transfers are made, offering reduced delay and fast speed of action for general data processing and store/retrieve jobs. The benefits of edge computing can also be reaped for distributed computation algorithms, where the cloud also plays an assistive role. In this context, an important challenge is to allocate the required resources at both edge and cloud to carry out the processing of data that are generated over a continuous ("infinite") time horizon. This is a complex problem due to the variety of requirements (resource needs, accuracy, delay, etc.) that may be posed by each computation algorithm, as well as the heterogeneous resources' features (e.g., processing, bandwidth). In this work, we develop a solution for serving weakly coupled general distributed algorithms, with emphasis on machine learning algorithms, at the edge and/or the cloud. We present a dual-objective Integer Linear Programming formulation that optimizes monetary cost and computation accuracy. We also introduce efficient heuristics to perform the resource allocation. We examine various distributed ML allocation scenarios using realistic parameters from actual vendors. We quantify trade-offs related to accuracy, performance and cost of edge/cloud bandwidth and processing resources. Our results indicate that among the many parameters of interest, the processing costs seem to play the most important role for the allocation decisions. Finally, we explore interesting interactions between target accuracy, monetary cost and delay.
引用
收藏
页码:697 / 713
页数:17
相关论文
共 50 条
[31]   Edge-Cloud Solutions for Big Data Analysis and Distributed Machine Learning-1 [J].
Belcastro, Loris ;
Carretero, Jesus ;
Talia, Domenico .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 :323-326
[32]   Joint Task and Computing Resource Allocation in Distributed Edge Computing Systems via Multi-Agent Deep Reinforcement Learning [J].
Chen, Yan ;
Sun, Yanjing ;
Yu, Hao ;
Taleb, Tarik .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (04) :3479-3494
[33]   Distributed Deep Reinforcement Learning Based Mode Selection and Resource Allocation for VR Transmission in Edge Networks [J].
Luo, Jie ;
Liu, Bei ;
Gao, Hui ;
Su, Xin .
COMMUNICATIONS AND NETWORKING (CHINACOM 2021), 2022, :153-167
[34]   Computation offloading and resource allocation based on distributed deep learning and software defined mobile edge computing [J].
Wang, Zhongyu ;
Lv, Tiejun ;
Chang, Zheng .
COMPUTER NETWORKS, 2022, 205
[35]   Deep Learning Modified Reinforcement Learning with Virtual Machine Consolidation for Energy-Efficient Resource Allocation in Cloud Computing [J].
Dutta, Chiranjit ;
Rani, R. M. ;
Jain, Amar ;
Poonguzhali, I. ;
Salunke, Dipmala ;
Patel, Ruchi .
INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2024,
[36]   Offloading and Resource Allocation With General Task Graph in Mobile Edge Computing: A Deep Reinforcement Learning Approach [J].
Yan, Jia ;
Bi, Suzhi ;
Zhang, Ying-Jun Angela .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (08) :5404-5419
[37]   A Truthful Reverse Auction Mechanism for Federated Learning Utility Maximization Resource Allocation in Edge-Cloud Collaboration [J].
Liu, Linjie ;
Zhang, Jixian ;
Wang, Zhemin ;
Xu, Jia .
MATHEMATICS, 2023, 11 (24)
[38]   Distributed Fixed-Time Resource Allocation Algorithm for the General Linear Multi-Agent Systems [J].
Shi, Xiasheng ;
Xu, Lei ;
Yang, Tao ;
Lin, Zhiyun ;
Wang, Xuesong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (06) :2867-2871
[39]   Single machine due window assignment and resource allocation scheduling problems with learning and general positional effects [J].
Liu, Lu ;
Wang, Jian-Jun ;
Liu, Feng ;
Liu, Ming .
JOURNAL OF MANUFACTURING SYSTEMS, 2017, 43 :1-14
[40]   Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach [J].
Huang, Chong ;
Chen, Gaojie ;
Xiao, Pei ;
Xiao, Yue ;
Han, Zhu ;
Chambers, Jonathon A. .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (05) :1029-1043