Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks

被引:1
|
作者
Shahhosseini, Sina [1 ]
Hu, Tianyi [1 ]
Seo, Dongjoo [1 ]
Kanduri, Anil [2 ]
Donyanavard, Bryan [3 ]
Rahmani, Amir M. [1 ]
Dutt, Nikil [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA USA
[2] Univ Turku, Turku, Finland
[3] San Diego State Univ, San Diego, CA USA
来源
PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022) | 2022年
关键词
D O I
10.1109/ISQED54688.2022.9806291
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep-learning-based intelligent services have become prevalent in cyber-physical applications including smart cities and health-care. Collaborative end-edge-cloud computing for deep learning provides a range of performance and efficiency that can address application requirements through computation offloading. The decision to offload computation is a communication-computation co-optimization problem that varies with both system parameters (e.g., network condition) and workload characteristics (e.g., inputs). Identifying optimal orchestration considering the cross-layer opportunities and requirements in the face of varying system dynamics is a challenging multi-dimensional problem. While Reinforcement Learning (RL) approaches have been proposed earlier, they suffer from a large number of trial-and-errors during the learning process resulting in excessive time and resource consumption. We present a Hybrid Learning orchestration framework that reduces the number of interactions with the system environment by combining model-based and model-free reinforcement learning. Our Deep Learning inference orchestration strategy employs reinforcement learning to find the optimal orchestration policy. Furthermore, we deploy Hybrid Learning (HL) to accelerate the RL learning process and reduce the number of direct samplings. We demonstrate efficacy of our HL strategy through experimental comparison with state-of-the-art RL-based inference orchestration, demonstrating that our HL strategy accelerates the learning process by up to 166.6 x.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [1] Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks
    Shahhosseini, Sina
    Seo, Dongjoo
    Kanduri, Anil
    Hu, Tianyi
    Lim, Sung-Soo
    Donyanavard, Bryan
    Rahmani, Amir M.
    Dutt, Nikil
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (06)
  • [2] Multi-user edge service orchestration based on Deep Reinforcement Learning
    Quadri, Christian
    Ceselli, Alberto
    Rossi, Gian Paolo
    COMPUTER COMMUNICATIONS, 2023, 203 : 30 - 47
  • [3] Toward Efficient Deep Learning Inference: On-Node Heterogeneous Scheduling in Edge-Cloud Infrastructure
    Fefey, Elvis G.
    Islam, Tanzima
    2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 73 - 78
  • [4] Beamforming in Multi-User MISO Cellular Networks with Deep Reinforcement Learning
    Chen, Hongchao
    Zheng, Zhe
    Liang, Xiaohui
    Liu, Yupu
    Zhao, Yi
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [5] Deep Reinforcement Learning for Multi-User Access Control in UAV Networks
    Cao, Yang
    Zhang, Lin
    Liang, Ying-Chang
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [6] CoEdge: Exploiting the Edge-Cloud Collaboration for Faster Deep Learning
    Hu, Liangyan
    Sun, Guodong
    Ren, Yanlong
    IEEE ACCESS, 2020, 8 : 100533 - 100541
  • [7] Hybrid SLM and LLM for Edge-Cloud Collaborative Inference
    Hao, Zixu
    Jiang, Huiqiang
    Jiang, Shiqi
    Ren, Ju
    Cao, Ting
    PROCEEDINGS OF THE 2024 WORKSHOP ON EDGE AND MOBILE FOUNDATION MODELS, EDGEFM 2024, 2024, : 36 - 41
  • [8] Optimizing Edge-Cloud Server Selection: A Multi-Objective Deep Reinforcement Learning Approach
    Le, Huyen-Trang
    Tran, Hai-Anh
    Tran, Truong X.
    2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 101 - 106
  • [9] Federated Deep Reinforcement Learning for Recommendation-Enabled Edge Caching in Mobile Edge-Cloud Computing Networks
    Sun, Chuan
    Li, Xiuhua
    Wen, Junhao
    Wang, Xiaofei
    Han, Zhu
    Leung, Victor C. M.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (03) : 690 - 705
  • [10] Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches
    Meng, Fan
    Chen, Peng
    Wu, Lenan
    Cheng, Julian
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (10) : 6255 - 6267