Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks

被引：13

作者：

Shahhosseini, Sina ^{[1
]}

Seo, Dongjoo ^{[1
]}

Kanduri, Anil ^{[2
]}

Hu, Tianyi ^{[1
]}

Lim, Sung-Soo ^{[3
]}

Donyanavard, Bryan ^{[4
]}

Rahmani, Amir M. ^{[1
]}

Dutt, Nikil ^{[1
]}

机构：

[1] Univ Calif Irvine, Irvine, CA 92717 USA

[2] Univ Turku, Turku, Finland

[3] Kookmin Univ, Seoul, South Korea

[4] San Diego State Univ, San Diego, CA 92182 USA

来源：

ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS | 2022年 / 21卷 / 06期

关键词：

Edge computing; online learning; computation offloading; neural network; NEURAL-NETWORKS; MOBILE EDGE; DEEP; OPTIMIZATION; INTERNET; AWARE;

D O I：

10.1145/3520129

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep-learning-based intelligent services have become prevalent in cyber-physical applications, including smart cities and health-care. Deploying deep-learning-based intelligence near the end-user enhances privacy protection, responsiveness, and reliability. Resource-constrained end-devices must be carefully managed to meet the latency and energy requirements of computationally intensive deep learning services. Collaborative end-edge-cloud computing for deep learning provides a range of performance and efficiency that can address application requirements through computation offloading. The decision to offload computation is a communication-computation co-optimization problem that varies with both system parameters (e.g., network condition) and workload characteristics (e.g., inputs). However, deep learning model optimization provides another source of tradeoff between latency and model accuracy. An end-to-end decision-making solution that considers such computation-communication problem is required to synergistically find the optimal offloading policy and model for deep learning services. To this end, we propose a reinforcement-learning-based computation offloading solution that learns optimal offloading policy considering deep learning model selection techniques to minimize response time while providing sufficient accuracy. We demonstrate the effectiveness of our solution for edge devices in an end-edge-cloud system and evaluate with a real-setup implementation using multiple AWS and ARM core configurations. Our solution provides 35% speedup in the average response time compared to the state-of-the-art with less than 0.9% accuracy reduction, demonstrating the promise of our online learning framework for orchestrating DL inference in end-edge-cloud systems.

引用

页数：25

共 50 条

[1] Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks
Shahhosseini, Sina
Hu, Tianyi
Seo, Dongjoo
Kanduri, Anil
Donyanavard, Bryan
Rahmani, Amir M.
Dutt, Nikil
PROCEEDINGS OF THE TWENTY THIRD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2022), 2022, : 1 - 6
[2] IoT intelligence empowered by end-edge-cloud orchestration
Zhang, Yaoxue
Lyu, Feng
Yang, Peng
Wu, Wen
Gao, Jie
CHINA COMMUNICATIONS, 2022, 19 (07) : 152 - 156
[3] Hyperdimensional Hybrid Learning on End-Edge-Cloud Networks
Issa, Mariam
Shahhosseini, Sina
Ni, Yang
Hu, Tianyi
Abraham, Danny
Rahmani, Amir M.
Dutt, Nikil
Imani, Mohsen
2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 652 - 655
[4] Multi-user edge service orchestration based on Deep Reinforcement Learning
Quadri, Christian
Ceselli, Alberto
Rossi, Gian Paolo
COMPUTER COMMUNICATIONS, 2023, 203 : 30 - 47
[5] Towards Accurate and Fast Federated Learning in End-Edge-Cloud Orchestrated Networks
Li, Mingze
Sun, Peng
Zhou, Huan
Zhao, Liang
Liu, Xuxun
Leung, Victor C. M.
2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 1079 - 1080
[6] Two-Stage Community Energy Trading Under End-Edge-Cloud Orchestration
Li, Xiangyu
Li, Chaojie
Liu, Xuan
Chen, Guo
Dong, Zhao Yang
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 1961 - 1972
[7] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
Liu, Guozhi
Dai, Fei
Xu, Xiaolong
Fu, Xiaodong
Dou, Wanchun
Kumar, Neeraj
Bilal, Muhammad
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
[8] Task Offloading for End-Edge-Cloud Orchestrated Computing in Mobile Networks
Sun, Chuan
Li, Hui
Li, Xiuhua
Wen, Junhao
Xiong, Qingyu
Wang, Xiaofei
Leung, Victor C. M.
2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
[9] End-Edge-Cloud Collaborative Computing for Deep Learning: A Comprehensive Survey
Wang, Yingchao
Yang, Chen
Lan, Shulin
Zhu, Liehuang
Zhang, Yan
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2024, 26 (04): : 2647 - 2683
[10] Serial Distributed Detection in Multihop Multirelay Wireless Sensor Networks With End-Edge-Cloud Orchestration Under Graph-Powered Computing
Zhang, Gaoyuan
He, Xiaodong
Mu, Yu
Wang, Weiguang
Li, Yiwei
Ji, Baofeng
Mumtaz, Shahid
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 3720 - 3733

← 1 2 3 4 5 →