DNN Deployment, Task Offloading, and Resource Allocation for Joint Task Inference in IIoT

被引:30
作者
Fan, Wenhao [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Hao, Zhibo [1 ,2 ]
Su, Yi [1 ,2 ]
Wu, Fan [1 ,2 ]
Tang, Bihua [1 ,2 ]
Liu, Yuan'an [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Work Safety Intelligent Monitoring, Beijing 100876, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Deep neural network (DNN) inference; edge computing; industrial Internet of Things (IIoT); resource management; task offloading; EDGE; IOT;
D O I
10.1109/TII.2022.3192882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joint task inference, which fully utilizes end edge cloud cooperation, can effectively enhance the performance of deep neural network (DNN) inference services in the industrial internet of things (IIoT) applications. In this paper, we propose a novel joint resource management scheme for a multi task and multi service scenario consisting of multiple sensors, a cloud server, and a base station equipped with an edge server . A time slotted system model is proposed, incorporating DNN deployment, data size control, task offloading, computing resource allocation, and wireless channel allocation. Among them, the DNN deployment is to deploy proper DNNs on the edge server under its total resource constraint, and the data size control is to make trade off between task inference accuracy and task transmission delay through changing task da ta size. Our goal is to minimize the total cost including total task processing delay and total error inference penalty while guaranteeing long term task queue stability and all task inference accuracy requirements. Leveraging the Lyapunov optimization, we first transform the optimization problem into a deterministic problem for each time slot. Then, a deep deterministic policy gradient (DDPG) based deep reinforcement learning (DRL) algorithm is designed to provide the near optimal solution. We further desi gn a fast numerical method for the data size control sub problem to reduce the training complexity of the DRL model, and design a penalty mechanism to prevent frequent optimizations of DNN deployment. Extensive experiments are conducted by varying differen t crucial parameters. The superiority of our scheme is demonstrated in comparison with 3 other schemes.
引用
收藏
页码:1634 / 1646
页数:13
相关论文
共 31 条
[1]  
[Anonymous], 2007, DAGM
[2]  
[Anonymous], 2021, YOLOV5 GITHUB
[3]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4]   Service Placement and Bandwidth Allocation for MEC-enabled Mobile Cloud Gaming [J].
Cao, Tuo ;
Qian, Zhuzhong ;
Wu, Kun ;
Zhou, Mingxian ;
Jin, Yibo .
2021 IEEE 22ND INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM 2021), 2021, :179-188
[5]   Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments [J].
Chen, Xing ;
Zhang, Jianshan ;
Lin, Bing ;
Chen, Zheyi ;
Wolter, Katinka ;
Min, Geyong .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (03) :683-697
[6]   Collaborative Data Caching and Computation Offloading for Multi-Service Mobile Edge Computing [J].
Feng, Hao ;
Guo, Songtao ;
Yang, Li ;
Yang, Yuanyuan .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (09) :9408-9422
[7]   Real-Time Fault Detection for IIoT Facilities Using GBRBM-Based DNN [J].
Huang, Huakun ;
Ding, Shuxue ;
Zhao, Lingjun ;
Huang, Huawei ;
Chen, Liang ;
Gao, Honghao ;
Ahmed, Syed Hassan .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) :5713-5722
[8]   Surface defect saliency of magnetic tile [J].
Huang, Yibin ;
Qiu, Congying ;
Yuan, Kui .
VISUAL COMPUTER, 2020, 36 (01) :85-96
[9]   Collaborative Cloud-Edge-End Task Offloading in Mobile-Edge Computing Networks With Limited Communication Capability [J].
Kai, Caihong ;
Zhou, Hao ;
Yi, Yibo ;
Huang, Wei .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (02) :624-634
[10]   Intelligent Fault Diagnosis for Large-Scale Rotating Machines Using Binarized Deep Neural Networks and Random Forests [J].
Li, Huifang ;
Hu, Guangzheng ;
Li, Jianqiang ;
Zhou, Mengchu .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (02) :1109-1119