Edge/Cloud Infinite-Time Horizon Resource Allocation for Distributed Machine Learning and General Tasks

被引:2
作者
Sartzetakis, Ippokratis [1 ,2 ]
Soumplis, Polyzois [1 ,2 ]
Pantazopoulos, Panagiotis [2 ]
Katsaros, Konstantinos V. [2 ]
Sourlas, Vasilis [2 ]
Varvarigos, Emmanouel [1 ,2 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Athens 15773, Greece
[2] Natl Tech Univ Athens, Inst Commun & Comp Syst, Athens 15773, Greece
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2024年 / 21卷 / 01期
基金
欧盟地平线“2020”;
关键词
Cloud and edge computing; distributed computing; distributed machine learning; inference; training; resource allocation; INTERNET; IOT;
D O I
10.1109/TNSM.2023.3312593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge computing has emerged as a computing paradigm where the application and data processing takes place close to the end devices. It decreases the distances over which data transfers are made, offering reduced delay and fast speed of action for general data processing and store/retrieve jobs. The benefits of edge computing can also be reaped for distributed computation algorithms, where the cloud also plays an assistive role. In this context, an important challenge is to allocate the required resources at both edge and cloud to carry out the processing of data that are generated over a continuous ("infinite") time horizon. This is a complex problem due to the variety of requirements (resource needs, accuracy, delay, etc.) that may be posed by each computation algorithm, as well as the heterogeneous resources' features (e.g., processing, bandwidth). In this work, we develop a solution for serving weakly coupled general distributed algorithms, with emphasis on machine learning algorithms, at the edge and/or the cloud. We present a dual-objective Integer Linear Programming formulation that optimizes monetary cost and computation accuracy. We also introduce efficient heuristics to perform the resource allocation. We examine various distributed ML allocation scenarios using realistic parameters from actual vendors. We quantify trade-offs related to accuracy, performance and cost of edge/cloud bandwidth and processing resources. Our results indicate that among the many parameters of interest, the processing costs seem to play the most important role for the allocation decisions. Finally, we explore interesting interactions between target accuracy, monetary cost and delay.
引用
收藏
页码:697 / 713
页数:17
相关论文
共 50 条
[41]   A Truthful Reverse Auction Mechanism for Federated Learning Utility Maximization Resource Allocation in Edge-Cloud Collaboration [J].
Liu, Linjie ;
Zhang, Jixian ;
Wang, Zhemin ;
Xu, Jia .
MATHEMATICS, 2023, 11 (24)
[42]   Deep-Reinforcement-Learning-Based Task Offloading and Resource Allocation in Mobile Edge Computing Network With Heterogeneous Tasks [J].
Jiang, Tao ;
Chen, Zhaoping ;
Zhao, Zilong ;
Feng, Mingjie ;
Zhou, Jiaxi .
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (08) :10899-10906
[43]   Distributed Fixed-Time Resource Allocation Algorithm for the General Linear Multi-Agent Systems [J].
Shi, Xiasheng ;
Xu, Lei ;
Yang, Tao ;
Lin, Zhiyun ;
Wang, Xuesong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (06) :2867-2871
[44]   Single machine due window assignment and resource allocation scheduling problems with learning and general positional effects [J].
Liu, Lu ;
Wang, Jian-Jun ;
Liu, Feng ;
Liu, Ming .
JOURNAL OF MANUFACTURING SYSTEMS, 2017, 43 :1-14
[45]   Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach [J].
Huang, Chong ;
Chen, Gaojie ;
Xiao, Pei ;
Xiao, Yue ;
Han, Zhu ;
Chambers, Jonathon A. .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (05) :1029-1043
[46]   Task Placement and Resource Allocation for Edge Machine Learning: A GNN-Based Multi-Agent Reinforcement Learning Paradigm [J].
Li Y. ;
Zhang X. ;
Zeng T. ;
Duan J. ;
Wu C. ;
Wu D. ;
Chen X. .
IEEE Transactions on Parallel and Distributed Systems, 2023, 34 (12) :3073-3089
[47]   A Joint Resource Allocation, Security with Efficient Task Scheduling in Cloud Computing Using Hybrid Machine Learning Techniques [J].
Bal, Prasanta Kumar ;
Mohapatra, Sudhir Kumar ;
Das, Tapan Kumar ;
Srinivasan, Kathiravan ;
Hu, Yuh-Chung .
SENSORS, 2022, 22 (03)
[48]   A Machine-Learning Based Time Constrained Resource Allocation Scheme for Vehicular Fog Computing [J].
Chen, Xiaosha ;
Leng, Supeng ;
Zhang, Ke ;
Xiong, Kai .
CHINA COMMUNICATIONS, 2019, 16 (11) :29-41
[49]   A Machine-Learning Based Time Constrained Resource Allocation Scheme for Vehicular Fog Computing [J].
Xiaosha Chen ;
Supeng Leng ;
Ke Zhang ;
Kai Xiong .
中国通信, 2019, 16 (11) :29-41
[50]   Joint DNN Partitioning and Resource Allocation for Multiple Machine Learning-based Mobile Applications at the Network Edge [J].
Cheng, Cheng-Yu ;
Gazda, Robert ;
Liu, Hang .
ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, :3937-3942