Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks

被引:13
|
作者
Shahhosseini, Sina [1 ]
Seo, Dongjoo [1 ]
Kanduri, Anil [2 ]
Hu, Tianyi [1 ]
Lim, Sung-Soo [3 ]
Donyanavard, Bryan [4 ]
Rahmani, Amir M. [1 ]
Dutt, Nikil [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92717 USA
[2] Univ Turku, Turku, Finland
[3] Kookmin Univ, Seoul, South Korea
[4] San Diego State Univ, San Diego, CA 92182 USA
关键词
Edge computing; online learning; computation offloading; neural network; NEURAL-NETWORKS; MOBILE EDGE; DEEP; OPTIMIZATION; INTERNET; AWARE;
D O I
10.1145/3520129
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep-learning-based intelligent services have become prevalent in cyber-physical applications, including smart cities and health-care. Deploying deep-learning-based intelligence near the end-user enhances privacy protection, responsiveness, and reliability. Resource-constrained end-devices must be carefully managed to meet the latency and energy requirements of computationally intensive deep learning services. Collaborative end-edge-cloud computing for deep learning provides a range of performance and efficiency that can address application requirements through computation offloading. The decision to offload computation is a communication-computation co-optimization problem that varies with both system parameters (e.g., network condition) and workload characteristics (e.g., inputs). However, deep learning model optimization provides another source of tradeoff between latency and model accuracy. An end-to-end decision-making solution that considers such computation-communication problem is required to synergistically find the optimal offloading policy and model for deep learning services. To this end, we propose a reinforcement-learning-based computation offloading solution that learns optimal offloading policy considering deep learning model selection techniques to minimize response time while providing sufficient accuracy. We demonstrate the effectiveness of our solution for edge devices in an end-edge-cloud system and evaluate with a real-setup implementation using multiple AWS and ARM core configurations. Our solution provides 35% speedup in the average response time compared to the state-of-the-art with less than 0.9% accuracy reduction, demonstrating the promise of our online learning framework for orchestrating DL inference in end-edge-cloud systems.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks
    Shahhosseini, Sina
    Hu, Tianyi
    Seo, Dongjoo
    Kanduri, Anil
    Donyanavard, Bryan
    Rahmani, Amir M.
    Dutt, Nikil
    PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022), 2022, : 1 - 6
  • [2] IoT intelligence empowered by end-edge-cloud orchestration
    Zhang, Yaoxue
    Lyu, Feng
    Yang, Peng
    Wu, Wen
    Gao, Jie
    CHINA COMMUNICATIONS, 2022, 19 (07) : 152 - 156
  • [3] Hyperdimensional Hybrid Learning on End-Edge-Cloud Networks
    Issa, Mariam
    Shahhosseini, Sina
    Ni, Yang
    Hu, Tianyi
    Abraham, Danny
    Rahmani, Amir M.
    Dutt, Nikil
    Imani, Mohsen
    2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 652 - 655
  • [4] Multi-user edge service orchestration based on Deep Reinforcement Learning
    Quadri, Christian
    Ceselli, Alberto
    Rossi, Gian Paolo
    COMPUTER COMMUNICATIONS, 2023, 203 : 30 - 47
  • [5] Towards Accurate and Fast Federated Learning in End-Edge-Cloud Orchestrated Networks
    Li, Mingze
    Sun, Peng
    Zhou, Huan
    Zhao, Liang
    Liu, Xuxun
    Leung, Victor C. M.
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 1079 - 1080
  • [6] Two-Stage Community Energy Trading Under End-Edge-Cloud Orchestration
    Li, Xiangyu
    Li, Chaojie
    Liu, Xuan
    Chen, Guo
    Dong, Zhao Yang
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 1961 - 1972
  • [7] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
    Liu, Guozhi
    Dai, Fei
    Xu, Xiaolong
    Fu, Xiaodong
    Dou, Wanchun
    Kumar, Neeraj
    Bilal, Muhammad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
  • [8] Task Offloading for End-Edge-Cloud Orchestrated Computing in Mobile Networks
    Sun, Chuan
    Li, Hui
    Li, Xiuhua
    Wen, Junhao
    Xiong, Qingyu
    Wang, Xiaofei
    Leung, Victor C. M.
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [9] End-Edge-Cloud Collaborative Computing for Deep Learning: A Comprehensive Survey
    Wang, Yingchao
    Yang, Chen
    Lan, Shulin
    Zhu, Liehuang
    Zhang, Yan
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2024, 26 (04): : 2647 - 2683
  • [10] Serial Distributed Detection in Multihop Multirelay Wireless Sensor Networks With End-Edge-Cloud Orchestration Under Graph-Powered Computing
    Zhang, Gaoyuan
    He, Xiaodong
    Mu, Yu
    Wang, Weiguang
    Li, Yiwei
    Ji, Baofeng
    Mumtaz, Shahid
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 3720 - 3733