Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing

被引:1
|
作者
Li, Rui [1 ]
Ouyang, Tao [1 ]
Zeng, Liekang [1 ]
Liao, Guocheng [2 ]
Zhou, Zhi [1 ]
Chen, Xu [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510275, Peoples R China
[2] Sun Yat Sen Univ, Sch Software Engn, Guangzhou 510275, Peoples R China
基金
美国国家科学基金会;
关键词
Task analysis; Routing; Computational modeling; Resource management; Optimization; Collaboration; Costs; Collaborative edge computing; workload allocation; unknown utility function; request routing; online mirror descent; ALLOCATION; ALGORITHMS;
D O I
10.1109/TNET.2024.3421356
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Collaborative Edge Computing (CEC) is an emerging paradigm that collaborates heterogeneous edge devices as a resource pool to compute DNN inference tasks in proximity such as edge video analytics. Nevertheless, as the key knob to improve network utility in CEC, existing works mainly focus on the workload routing strategies among edge devices with the aim of minimizing the routing cost, remaining an open question for joint workload allocation and routing optimization problem from a system perspective. To this end, this paper presents a holistic, learned optimization for CEC towards maximizing the total network utility in an online manner, even though the utility functions of task input rates are unknown a priori. In particular, we characterize the CEC system in a flow model and formulate an online learning problem in a form of cross-layer optimization. We propose a nested-loop algorithm to solve workload allocation and distributed routing iteratively, using the tools of gradient sampling and online mirror descent. To improve the convergence rate over the nested-loop version, we further devise a single-loop algorithm. Rigorous analysis is provided to show its inherent convexity, efficient convergence, as well as algorithmic optimality. Finally, extensive numerical simulations demonstrate the superior performance of our solutions.
引用
收藏
页码:4414 / 4426
页数:13
相关论文
共 50 条
  • [1] Efficient Online DNN Inference with Continuous Learning in Edge Computing
    Zeng, Yifan
    Zhou, Ruiting
    Jia, Lei
    Han, Ziyi
    Yu, Jieling
    Ma, Yue
    2024 IEEE/ACM 32ND INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE, IWQOS, 2024,
  • [2] DNN Placement and Inference in Edge Computing
    Bensalem, Mounir
    Dizdarevic, Jasenka
    Jukan, Admela
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 479 - 484
  • [3] DNN Real-Time Collaborative Inference Acceleration with Mobile Edge Computing
    Yang, Run
    Li, Yan
    He, Hui
    Zhang, Weizhe
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] SAFE: Intelligent Online Scheduling for Collaborative DNN Inference in Vehicular Network
    Zhou, Ruiting
    Han, Ziyi
    Zeng, Yifan
    Zhou, Zhi
    Wu, Libing
    Wang, Wei
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 3230 - 3236
  • [5] Modeling of Deep Neural Network (DNN) Placement and Inference in Edge Computing
    Bensalem, Mounir
    Dizdarevic, Jasenka
    Jukan, Admela
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [6] A Survey on Collaborative DNN Inference for Edge Intelligence
    Wei-Qing Ren
    Yu-Ben Qu
    Chao Dong
    Yu-Qian Jing
    Hao Sun
    Qi-Hui Wu
    Song Guo
    Machine Intelligence Research, 2023, 20 : 370 - 395
  • [7] A Survey on Collaborative DNN Inference for Edge Intelligence
    Ren, Wei-Qing
    Qu, Yu-Ben
    Dong, Chao
    Jing, Yu-Qian
    Sun, Hao
    Wu, Qi-Hui
    Guo, Song
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (03) : 370 - 395
  • [8] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
    Liu, Guozhi
    Dai, Fei
    Xu, Xiaolong
    Fu, Xiaodong
    Dou, Wanchun
    Kumar, Neeraj
    Bilal, Muhammad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
  • [9] Collaborative Inference Acceleration Integrating DNN Partitioning and Task Offloading in Mobile Edge Computing
    Xu, Wenxiu
    Yin, Yin
    Chen, Ningjiang
    Tu, Huan
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (11N12) : 1835 - 1863
  • [10] Elastic DNN Inference With Unpredictable Exit in Edge Computing
    Huang, Jiaming
    Gao, Yi
    Dong, Wei
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 14005 - 14016