Deep Reinforcement Learning-Based Multichannel Access for Industrial Wireless Networks With Dynamic Multiuser Priority

被引:12
作者
Liu, Xiaoyu [1 ,2 ,3 ,4 ]
Xu, Chi [1 ,2 ,3 ]
Yu, Haibin [1 ,2 ,3 ]
Zeng, Peng [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Chinese Acad Sci, Key Lab Networked Control Syst, Shenyang 110016, Peoples R China
[3] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Deep reinforcement learning; dynamic priority; industrial wireless networks (IWNs); multichannel access; quality of service; SPECTRUM ACCESS; COMMUNICATION; TECHNOLOGY; ALLOCATION;
D O I
10.1109/TII.2021.3139349
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Industry 4.0, massive heterogeneous industrial devices generate a great deal of data with different quality of service requirements, and communicate via industrial wireless networks (IWNs). However, the limited time-frequency resources of IWNs cannot well support the high concurrent access of massive industrial devices with strict real-time and reliable communication requirements. To address this problem, a deep reinforcement learning-based dynamic priority multichannel access (DRL-DPMCA) algorithm is proposed in this article. Firstly, according to the time-sensitivity of industrial data, industrial devices are assigned with different priorities, based on which their channel access probabilities are dynamically adjusted. Then, the Markov decision process is utilized to model the dynamic priority multichannel access problem. To cope with the explosion of state space caused by the multichannel access of massive industrial devices with dynamic priorities, DRL is used to establish the mapping from states to actions. Next, the long-term cumulative reward is maximized to obtain an effective policy. Especially, with joint consideration of the access reward and priority reward, a compound reward for multichannel access and dynamic priority is designed. For breaking the time correlation of training data while accelerating the convergence of DRL-DPMCA, an experience replay with experience-weight is proposed to store and sample experiences categorically. Besides, the gated recurrent unit, dueling architecture and step-by-step epsilon-greedy method are employed to make states more comprehensive and reduce model oscillation. Extensive experiments show that, compared with slotted-Aloha and deep Q network algorithms, DRL-DPMCA converges quickly, and guarantees the highest channel access probability and the minimum queuing delay for high-priority industrial devices in the context of minimum access conflict and nearly 100% channel utilization.
引用
收藏
页码:7048 / 7058
页数:11
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks
    Wang, Shangxing
    Liu, Hanpeng
    Gomes, Pedro Henrique
    Krishnamachari, Bhaskar
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2018, 4 (02) : 257 - 265
  • [2] Deep-Reinforcement-Learning-Based Distributed Dynamic Spectrum Access in Multiuser Multichannel Cognitive Radio Internet of Things Networks
    Zhang, Xiaohui
    Chen, Ze
    Zhang, Yinghui
    Liu, Yang
    Jin, Minglu
    Qiu, Tianshuang
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (10): : 17495 - 17509
  • [3] Deep Reinforcement Learning Based Dynamic Multichannel Access in HetNets
    Wang, Shaoyang
    Lv, Tiejun
    2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
  • [4] Deep Reinforcement Learning for Dynamic Spectrum Access in Wireless Networks
    Xu, Y.
    Yu, J.
    Headley, W. C.
    Buehrer, R. M.
    2018 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2018), 2018, : 207 - 212
  • [5] Deep Reinforcement Learning-Based Edge Caching in Wireless Networks
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2020, 6 (01) : 48 - 61
  • [6] Deep Reinforcement Learning for Dynamic Radio Access Selection over Future Wireless Networks
    Carballo Gonzalez, Claudia
    Fontes Pupo, Ernesto
    Pereira-Ruisanchez, Dariel
    Atzori, Luigi
    Murroni, Maurizio
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
  • [7] Wireless Access Control in Edge-Aided Disaster Response: A Deep Reinforcement Learning-Based Approach
    Zhou, Hang
    Wang, Xiaoyan
    Umehira, Masahiro
    Chen, Xianfu
    Wu, Celimuge
    Ji, Yusheng
    IEEE ACCESS, 2021, 9 : 46600 - 46611
  • [8] Adversarial Jamming Attacks on Deep Reinforcement Learning Based Dynamic Multichannel Access
    Zhong, Chen
    Wang, Feng
    Gursoy, M. Cenk
    Velipasalar, Senem
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [9] RDRL: A Recurrent Deep Reinforcement Learning Scheme for Dynamic Spectrum Access in Reconfigurable Wireless Networks
    Chen, Miaojiang
    Liu, Anfeng
    Liu, Wei
    Ota, Kaoru
    Dong, Mianxiong
    Xiong, N. Neal
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (02): : 364 - 376
  • [10] Multiple Access for Heterogeneous Wireless Networks with Imperfect Channels Based on Deep Reinforcement Learning
    Xu, Yangzhou
    Lou, Jia
    Wang, Tiantian
    Shi, Junxiao
    Zhang, Tao
    Paul, Agyemang
    Wu, Zhefu
    ELECTRONICS, 2023, 12 (23)