Joint Optimization With DNN Partitioning and Resource Allocation in Mobile Edge Computing

被引:26
|
作者
Dong, Chongwu [1 ]
Hu, Sheng [1 ]
Chen, Xi [1 ]
Wen, Wushao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2021年 / 18卷 / 04期
基金
中国国家自然科学基金;
关键词
Task analysis; Computational modeling; Costs; Resource management; Optimization; Artificial intelligence; Hardware; Computation offloading; Lyapunov Optimization; edge intelligence; mobile edge computing; deep learning; CLOUD; INTELLIGENCE; NETWORKS; MODEL;
D O I
10.1109/TNSM.2021.3116665
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of computing power and artificial intelligence, IoT devices equipped with ubiquitous sensors are gradually installed with intelligence. People can enjoy many conveniences with intelligent devices, such as face recognition, video understanding, and motion estimation. Currently, deep neural networks are the mainstream technology in intelligent mobile applications. Inspired by DNN model partition schemes, the paradigm of edge computing could be utilized collaboratively to improve the effectiveness of intelligent task execution in IoT devices. However, due to the dynamics of the wireless network environment and the increasing number of IoT devices, a DNN partition policy without adequate consideration would pose a significant challenge to the efficiency of task inference. Moreover, the shortage and high rental cost of edge computing resources make the optimization of DNN-based task execution more difficult. To cope with those situations, we propose a joint method by a self-adaptive DNN partition with cost-effective resource allocation to facilitate collaborative computation between IoT devices and edge servers. Our proposed online algorithm can be proved to ensure the overall rental cost within an upper bound above the optimal solution while guaranteeing the latency for DNN-based task inference. To evaluate the performance of our strategy, we conduct extensive trace-driven illustrative studies and show that the proposed method can achieve sub-optimal results and outperforms other alternative methods.
引用
收藏
页码:3973 / 3986
页数:14
相关论文
共 50 条
  • [21] End-to-End Delay Minimization based on Joint Optimization of DNN Partitioning and Resource Allocation for Cooperative Edge Inference
    Ye, Xinrui
    Sun, Yanzan
    Wen, Dingzhu
    Pan, Guanjin
    Zhang, Shunqing
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [22] A Bilevel Optimization Approach for Joint Offloading Decision and Resource Allocation in Cooperative Mobile Edge Computing
    Huang, Pei-Qiu
    Wang, Yong
    Wang, Kezhi
    Liu, Zhi-Zhong
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (10) : 4228 - 4241
  • [23] A joint optimization scheme of content caching and resource allocation for internet of vehicles in mobile edge computing
    Mu Zhang
    Song Wang
    Qing Gao
    Journal of Cloud Computing, 9
  • [24] Multiobjective Optimization for Joint Task Offloading, Power Assignment, and Resource Allocation in Mobile Edge Computing
    Wang, Peng
    Li, Kenli
    Xiao, Bin
    Li, Keqin
    IEEE INTERNET OF THINGS JOURNAL, 2021, 9 (14) : 11737 - 11748
  • [25] A joint optimization scheme of content caching and resource allocation for internet of vehicles in mobile edge computing
    Zhang, Mu
    Wang, Song
    Gao, Qing
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2020, 9 (01):
  • [26] Service Characteristics-Oriented Joint Optimization of Radio and Computing Resource Allocation in Mobile-Edge Computing
    Feng, Jie
    Liu, Lei
    Pei, Qingqi
    Hou, Fen
    Yang, Tingting
    Wu, Jinsong
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11): : 9407 - 9421
  • [27] A Novel Joint Offloading and Resource Allocation Scheme for Mobile Edge Computing
    Dab, Boutheina
    Aitsaadi, Nadjib
    Langar, Rami
    2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
  • [28] Joint DNN partitioning and task offloading in mobile edge computing via deep reinforcement learning
    Jianbing Zhang
    Shufang Ma
    Zexiao Yan
    Jiwei Huang
    Journal of Cloud Computing, 12
  • [29] Joint DNN partitioning and task offloading in mobile edge computing via deep reinforcement learning
    Zhang, Jianbing
    Ma, Shufang
    Yan, Zexiao
    Huang, Jiwei
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):
  • [30] Bayesian Optimization for Task Offloading and Resource Allocation in Mobile Edge Computing
    Yan, Jia
    Lu, Qin
    Giannakis, Georgios B.
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 1086 - 1090