Subcarrier power control for URLLC communication system via multi-agent deep reinforcement learning in IoT network

被引:0
作者
Wang, Haiyan [1 ]
Li, Xinmin [2 ,3 ]
Luo, Feiying [4 ,5 ]
Li, Jiahui [5 ]
Zhang, Xiaoqiang [5 ]
机构
[1] Jiangsu Vocat Inst Commerce, Sch Internet Things & Intelligent Engn, Nanjing, Peoples R China
[2] Chengdu Univ, Key Lab Meid & Edible Plant Resources Dev, Sichuan Educ Dept, Chengdu 610106, Peoples R China
[3] Chinese Univ Hong Kong, Guangdong Prov Key Lab Future Networks Intelligenc, Shenzhen, Peoples R China
[4] CEC Jinjiang Informat Ind Co Ltd, Chengdu, Peoples R China
[5] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang, Peoples R China
基金
中国国家自然科学基金;
关键词
ultra-reliable low-latency communication; URLLC; blocklength allocation; power control; deep reinforcement learning; RESOURCE-ALLOCATION; OPTIMIZATION;
D O I
10.1504/IJCNDS.2024.138252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Designing an intelligent resource allocation scheme to achieve the performance requirements of internet of things (IoT) devices for the future ultra-reliable low-latency communication (URLLC) network is a challenging task. In this paper, we formulate a joint blocklength allocation and power control optimisation problem to maximise the sum-rate performance with the short data packet in an uplink URLLC communication system. To alleviate this non-convex optimisation problem under the subcarrier power, blocklength and rate constraints, we firstly transfer it into a multi-agent reinforcement learning (RL) problem, in which each subcarrier works as the agent to decide its own power intelligently. Then a distributed blocklength allocation and power control scheme is proposed based on deep Q-network (DQN). To improve the rate performance in the dynamic communication environment, we design the segmented reward function depending on the communication rate and blocklength under different conditions, and adopt the experience replay strategy to avoid the dependency of training data. Finally, the simulation results show that the proposed scheme achieve the effectiveness and convergence under different settings compared to benchmark schemes.
引用
收藏
页码:374 / 392
页数:20
相关论文
共 49 条
  • [1] 6G FLAGSHIP, 2019, 6G White Paper: Key Drivers and Research Challenges for 6G UbiquitousWireless Intelligenceonline
  • [2] Al Ayidh A, 2020, UEEE INT SYM PERS IN
  • [3] [Anonymous], 2016, 3GPP TR38.912
  • [4] [Anonymous], 2018, 3GPP TS38.211
  • [5] [Anonymous], 2017, 3GPP TR38.913
  • [6] Spectrum allocation and power control for D2D communication underlay 5G cellular networks
    Benbraika, Mohamed Kamel
    Bitam, Salim
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2021, 27 (03) : 299 - 322
  • [7] A Multi-Objective Optimization Framework for URLLC With Decoding Complexity Constraints
    Celebi, Hasan Basri
    Pitarokoilis, Antonios
    Skoglund, Mikael
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (04) : 2786 - 2798
  • [8] Optimizing Resource Allocation in URLLC for Real-Time Wireless Control Systems
    Chang, Bo
    Zhang, Lei
    Li, Liying
    Zhao, Guodong
    Chen, Zhi
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (09) : 8916 - 8927
  • [9] Relay-Assisted Uplink Transmission Design of URLLC Packets
    Cheng, Jing
    Shen, Chao
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) : 18839 - 18853
  • [10] Distributed DRL-Based Downlink Power Allocation for Hybrid RF/VLC Networks
    Ciftler, Bekir Sait
    Alwarafy, Abdulmalik
    Abdallah, Mohamed
    [J]. IEEE PHOTONICS JOURNAL, 2022, 14 (03):