Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks

被引：1

作者：

Shahhosseini, Sina ^{[1
]}

Hu, Tianyi ^{[1
]}

Seo, Dongjoo ^{[1
]}

Kanduri, Anil ^{[2
]}

Donyanavard, Bryan ^{[3
]}

Rahmani, Amir M. ^{[1
]}

Dutt, Nikil ^{[1
]}

机构：

[1] Univ Calif Irvine, Irvine, CA USA

[2] Univ Turku, Turku, Finland

[3] San Diego State Univ, San Diego, CA USA

来源：

PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022) | 2022年

关键词：

D O I：

10.1109/ISQED54688.2022.9806291

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Deep-learning-based intelligent services have become prevalent in cyber-physical applications including smart cities and health-care. Collaborative end-edge-cloud computing for deep learning provides a range of performance and efficiency that can address application requirements through computation offloading. The decision to offload computation is a communication-computation co-optimization problem that varies with both system parameters (e.g., network condition) and workload characteristics (e.g., inputs). Identifying optimal orchestration considering the cross-layer opportunities and requirements in the face of varying system dynamics is a challenging multi-dimensional problem. While Reinforcement Learning (RL) approaches have been proposed earlier, they suffer from a large number of trial-and-errors during the learning process resulting in excessive time and resource consumption. We present a Hybrid Learning orchestration framework that reduces the number of interactions with the system environment by combining model-based and model-free reinforcement learning. Our Deep Learning inference orchestration strategy employs reinforcement learning to find the optimal orchestration policy. Furthermore, we deploy Hybrid Learning (HL) to accelerate the RL learning process and reduce the number of direct samplings. We demonstrate efficacy of our HL strategy through experimental comparison with state-of-the-art RL-based inference orchestration, demonstrating that our HL strategy accelerates the learning process by up to 166.6 x.

引用

页码：1 / 6

页数：6

共 50 条

[1] Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks
Shahhosseini, Sina
Seo, Dongjoo
Kanduri, Anil
Hu, Tianyi
Lim, Sung-Soo
Donyanavard, Bryan
Rahmani, Amir M.
Dutt, Nikil
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (06)
[2] Multi-user edge service orchestration based on Deep Reinforcement Learning
Quadri, Christian
Ceselli, Alberto
Rossi, Gian Paolo
COMPUTER COMMUNICATIONS, 2023, 203 : 30 - 47
[3] Toward Efficient Deep Learning Inference: On-Node Heterogeneous Scheduling in Edge-Cloud Infrastructure
Fefey, Elvis G.
Islam, Tanzima
2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 73 - 78
[4] Beamforming in Multi-User MISO Cellular Networks with Deep Reinforcement Learning
Chen, Hongchao
Zheng, Zhe
Liang, Xiaohui
Liu, Yupu
Zhao, Yi
2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
[5] Deep Reinforcement Learning for Multi-User Access Control in UAV Networks
Cao, Yang
Zhang, Lin
Liang, Ying-Chang
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[6] CoEdge: Exploiting the Edge-Cloud Collaboration for Faster Deep Learning
Hu, Liangyan
Sun, Guodong
Ren, Yanlong
IEEE ACCESS, 2020, 8 : 100533 - 100541
[7] Hybrid SLM and LLM for Edge-Cloud Collaborative Inference
Hao, Zixu
Jiang, Huiqiang
Jiang, Shiqi
Ren, Ju
Cao, Ting
PROCEEDINGS OF THE 2024 WORKSHOP ON EDGE AND MOBILE FOUNDATION MODELS, EDGEFM 2024, 2024, : 36 - 41
[8] Optimizing Edge-Cloud Server Selection: A Multi-Objective Deep Reinforcement Learning Approach
Le, Huyen-Trang
Tran, Hai-Anh
Tran, Truong X.
2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 101 - 106
[9] Federated Deep Reinforcement Learning for Recommendation-Enabled Edge Caching in Mobile Edge-Cloud Computing Networks
Sun, Chuan
Li, Xiuhua
Wen, Junhao
Wang, Xiaofei
Han, Zhu
Leung, Victor C. M.
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (03) : 690 - 705
[10] Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches
Meng, Fan
Chen, Peng
Wu, Lenan
Cheng, Julian
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (10) : 6255 - 6267

← 1 2 3 4 5 →