Deep Reinforcement Learning Based Resource Management for DNN Inference in IIoT

被引:0
作者
Zhang, Weiting [1 ]
Yang, Dong [1 ]
Peng, Haixia [2 ]
Wu, Wen [2 ]
Quan, Wei [1 ]
Zhang, Hongke [1 ]
Shen, Xuemin [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
来源
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2020年
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
DNN inference; IIoT; resource management; deep deterministic policy gradient; EDGE; NETWORKS; INTERNET;
D O I
10.1109/GLOBECOM42002.2020.9322223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the joint task assignment and resource allocation for deep neural network (DNN) inference in the device-edge-cloud based industrial Internet of things (IIoT) networks. To efficiently orchestrate the limited spectrum and computing resources in IIoT networks for massive DNN inference tasks, a resource management problem is formulated with the objective of maximizing the average inference accuracy while satisfying the quality-of-service of DNN inference tasks. Considering the strict delay requirements of inference tasks, we transform the formulated problem into a Markov decision process, and propose a deep deterministic policy gradient based learning algorithm to obtain the solution rapidly. Simulation results show that the proposed algorithm can achieve high average inference accuracy.
引用
收藏
页数:6
相关论文
共 50 条
[41]   English information teaching resource sharing based on deep reinforcement learning [J].
Han, Chao .
INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2024, 34 (01) :53-65
[42]   Deep Reinforcement Learning based Computation Offloading and Resource Allocation for MEC [J].
Li, Ji ;
Gao, Hui ;
Lv, Tiejun ;
Lu, Yueming .
2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2018,
[43]   Joint Task Offloading and Resource Allocation in Multi-UAV Multi-Server Systems: An Attention-Based Deep Reinforcement Learning Approach [J].
Wu, Guohua ;
Liu, Zelin ;
Fan, Mingfeng ;
Wu, Keyu .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (08) :11964-11978
[44]   An intelligent resource management method in SDN based fog computing using reinforcement learning [J].
Anoushee, Milad ;
Fartash, Mehdi ;
Torkestani, Javad Akbari .
COMPUTING, 2024, 106 (04) :1051-1080
[45]   Reinforcement learning-based solution for resource management in fog computing: A comprehensive survey [J].
Ghafari, Reyhane ;
Mansouri, Najme .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 276
[46]   Deep Reinforcement Learning for Optimal Resource Allocation in Blockchain-based IoV Secure Systems [J].
Xiao, Hongzhi ;
Qiu, Chen ;
Yang, Qinglin ;
Huang, Huakun ;
Wang, Junbo ;
Su, Chunhua .
2020 16TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2020), 2020, :137-144
[47]   Deep Reinforcement Learning Based Resource Allocation in Multi-UAV-Aided MEC Networks [J].
Chen, Jingxuan ;
Cao, Xianbin ;
Yang, Peng ;
Xiao, Meng ;
Ren, Siqiao ;
Zhao, Zhongliang ;
Wu, Dapeng Oliver .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) :296-309
[48]   Blockchain-Based Edge Computing Resource Allocation in IoT: A Deep Reinforcement Learning Approach [J].
He, Ying ;
Wang, Yuhang ;
Qiu, Chao ;
Lin, Qiuzhen ;
Li, Jianqiang ;
Ming, Zhong .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) :2226-2237
[49]   Optimization of resource allocation strategy for high-speed railway based on deep reinforcement learning [J].
Gao, Xu ;
Zhao, Junhui ;
Zhang, Qingmiao ;
Han, Haitao .
PHYSICAL COMMUNICATION, 2024, 66
[50]   Joint resource allocation and security redundancy for autonomous driving based on deep reinforcement learning algorithm [J].
Zhang, Han ;
Liang, Hongbin ;
Wang, Lei ;
Yao, Yiting ;
Lin, Bin ;
Zhao, Dongmei .
IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (06) :1109-1120