Deep Reinforcement Learning Based Resource Management for DNN Inference in IIoT

被引:0
|
作者
Zhang, Weiting [1 ]
Yang, Dong [1 ]
Peng, Haixia [2 ]
Wu, Wen [2 ]
Quan, Wei [1 ]
Zhang, Hongke [1 ]
Shen, Xuemin [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
来源
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2020年
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
DNN inference; IIoT; resource management; deep deterministic policy gradient; EDGE; NETWORKS; INTERNET;
D O I
10.1109/GLOBECOM42002.2020.9322223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the joint task assignment and resource allocation for deep neural network (DNN) inference in the device-edge-cloud based industrial Internet of things (IIoT) networks. To efficiently orchestrate the limited spectrum and computing resources in IIoT networks for massive DNN inference tasks, a resource management problem is formulated with the objective of maximizing the average inference accuracy while satisfying the quality-of-service of DNN inference tasks. Considering the strict delay requirements of inference tasks, we transform the formulated problem into a Markov decision process, and propose a deep deterministic policy gradient based learning algorithm to obtain the solution rapidly. Simulation results show that the proposed algorithm can achieve high average inference accuracy.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Based Resource Management for DNN Inference in Industrial IoT
    Zhang, Weiting
    Yang, Dong
    Haixia, Peng
    Wu, Wen
    Quan, Wei
    Zhang, Hongke
    Shen, Xuemin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7605 - 7618
  • [2] Resource Management with Deep Reinforcement Learning
    Mao, Hongzi
    Alizadeh, Mohammad
    Menache, Ishai
    Kandula, Srikanth
    PROCEEDINGS OF THE 15TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS '16), 2016, : 50 - 56
  • [3] Deep Reinforcement Learning Based Cooperative Partial Task Offloading and Resource Allocation for IIoT Applications
    Zhang, Fan
    Han, Guangjie
    Liu, Li
    Martinez-Garcia, Miguel
    Peng, Yan
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2991 - 3006
  • [4] Deep Reinforcement Learning Assisted Federated Learning Algorithm for Data Management of IIoT
    Zhang, Peiying
    Wang, Chao
    Jiang, Chunxiao
    Han, Zhu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) : 8475 - 8484
  • [5] Deep Reinforcement Learning Algorithm for Latency-Oriented IIoT Resource Orchestration
    Zhang, Peiying
    Zhang, Yi
    Kumar, Neeraj
    Hsu, Ching-Hsien
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (08) : 7153 - 7163
  • [6] DNN Deployment, Task Offloading, and Resource Allocation for Joint Task Inference in IIoT
    Fan, Wenhao
    Chen, Zeyu
    Hao, Zhibo
    Su, Yi
    Wu, Fan
    Tang, Bihua
    Liu, Yuan'an
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (02) : 1634 - 1646
  • [7] Cooperative Partial Task Offloading and Resource Allocation for IIoT Based on Decentralized Multiagent Deep Reinforcement Learning
    Zhang, Fan
    Han, Guangjie
    Liu, Li
    Zhang, Yu
    Peng, Yan
    Li, Chao
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 5526 - 5544
  • [8] DNN Inference Acceleration for Smart Devices in Industry 5.0 by Decentralized Deep Reinforcement Learning
    Dong, Chongwu
    Shafiq, Muhammad
    Al Dabel, Maryam M.
    Sun, Yanbin
    Tian, Zhihong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 1519 - 1530
  • [9] Deep Reinforcement Learning for Intelligent Cloud Resource Management
    Zhou, Zhi
    Luo, Ke
    Chen, Xu
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [10] Deep Reinforcement Learning for Resource Management in Network Slicing
    Li, Rongpeng
    Zhao, Zhifeng
    Sun, Qi
    I, Chih-Lin
    Yang, Chenyang
    Chen, Xianfu
    Zhao, Minjian
    Zhang, Honggang
    IEEE ACCESS, 2018, 6 : 74429 - 74441