Deep Reinforcement Learning-Based Multichannel Access for Industrial Wireless Networks With Dynamic Multiuser Priority

被引：12

作者：

Liu, Xiaoyu ^{[1
,2
,3
,4
]}

Xu, Chi ^{[1
,2
,3
]}

Yu, Haibin ^{[1
,2
,3
]}

Zeng, Peng ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China

[2] Chinese Acad Sci, Key Lab Networked Control Syst, Shenyang 110016, Peoples R China

[3] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China

[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2022年 / 18卷 / 10期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Deep reinforcement learning; dynamic priority; industrial wireless networks (IWNs); multichannel access; quality of service; SPECTRUM ACCESS; COMMUNICATION; TECHNOLOGY; ALLOCATION;

D O I：

10.1109/TII.2021.3139349

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In Industry 4.0, massive heterogeneous industrial devices generate a great deal of data with different quality of service requirements, and communicate via industrial wireless networks (IWNs). However, the limited time-frequency resources of IWNs cannot well support the high concurrent access of massive industrial devices with strict real-time and reliable communication requirements. To address this problem, a deep reinforcement learning-based dynamic priority multichannel access (DRL-DPMCA) algorithm is proposed in this article. Firstly, according to the time-sensitivity of industrial data, industrial devices are assigned with different priorities, based on which their channel access probabilities are dynamically adjusted. Then, the Markov decision process is utilized to model the dynamic priority multichannel access problem. To cope with the explosion of state space caused by the multichannel access of massive industrial devices with dynamic priorities, DRL is used to establish the mapping from states to actions. Next, the long-term cumulative reward is maximized to obtain an effective policy. Especially, with joint consideration of the access reward and priority reward, a compound reward for multichannel access and dynamic priority is designed. For breaking the time correlation of training data while accelerating the convergence of DRL-DPMCA, an experience replay with experience-weight is proposed to store and sample experiences categorically. Besides, the gated recurrent unit, dueling architecture and step-by-step epsilon-greedy method are employed to make states more comprehensive and reduce model oscillation. Extensive experiments show that, compared with slotted-Aloha and deep Q network algorithms, DRL-DPMCA converges quickly, and guarantees the highest channel access probability and the minimum queuing delay for high-priority industrial devices in the context of minimum access conflict and nearly 100% channel utilization.

引用

页码：7048 / 7058

页数：11

共 50 条

[21] Model-Based Deep Reinforcement Learning Framework for Channel Access in Wireless Networks
Park, Jong In
Chae, Jun Byung
Choi, Kae Won
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 10150 - 10167
[22] Deep Reinforcement Learning Based Resource Management for Multi-Access Edge Computing in Vehicular Networks
Peng, Haixia
Shen, Xuemin
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04): : 2416 - 2428
[23] Deep Reinforcement Learning-Based Dynamic Spectrum Access for D2D Communication Underlay Cellular Networks
Huang, Jingfei
Yang, Yang
He, Gang
Xiao, Yang
Liu, Jun
IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2614 - 2618
[24] Deep Reinforcement Learning-Based Routing on Software-Defined Networks
Kim, Gyungmin
Kim, Yohan
Lim, Hyuk
IEEE ACCESS, 2022, 10 : 18121 - 18133
[25] Deep-Reinforcement Learning Multiple Access for Heterogeneous Wireless Networks
Yu, Yiding
Wang, Taotao
Liew, Soung Chang
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (06) : 1277 - 1290
[26] Dynamic multiple access based on deep reinforcement learning for Internet of Things
Liu, Xin
Li, Zengqi
COMPUTER COMMUNICATIONS, 2023, 210 : 331 - 341
[27] Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks
Luong, Phuong
Gagnon, Francois
Tran, Le-Nam
Labeau, Fabrice
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) : 7610 - 7625
[28] Deep Reinforcement Learning Based Dynamic Resource Allocation in Cloud Radio Access Networks
Rodoshi, Rehenuma Tasnim
Kim, Taewoon
Choi, Wooyeol
11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 618 - 623
[29] Unveiling the Effects of Experience Replay on Deep Reinforcement Learning-based Power Allocation in Wireless Networks
Kopic, Amna
Perenda, Erma
Gacanin, Haris
2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
[30] Multi-Agent Deep Reinforcement Learning-Based Resource Allocation for Cognitive Radio Networks
Mei, Ruru
Wang, Zhugang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (03) : 4744 - 4757

← 1 2 3 4 5 →