Subcarrier power control for URLLC communication system via multi-agent deep reinforcement learning in IoT network

被引：0

作者：

Wang, Haiyan ^{[1
]}

Li, Xinmin ^{[2
,3
]}

Luo, Feiying ^{[4
,5
]}

Li, Jiahui ^{[5
]}

Zhang, Xiaoqiang ^{[5
]}

机构：

[1] Jiangsu Vocat Inst Commerce, Sch Internet Things & Intelligent Engn, Nanjing, Peoples R China

[2] Chengdu Univ, Key Lab Meid & Edible Plant Resources Dev, Sichuan Educ Dept, Chengdu 610106, Peoples R China

[3] Chinese Univ Hong Kong, Guangdong Prov Key Lab Future Networks Intelligenc, Shenzhen, Peoples R China

[4] CEC Jinjiang Informat Ind Co Ltd, Chengdu, Peoples R China

[5] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS | 2024年 / 30卷 / 03期

基金：

中国国家自然科学基金;

关键词：

ultra-reliable low-latency communication; URLLC; blocklength allocation; power control; deep reinforcement learning; RESOURCE-ALLOCATION; OPTIMIZATION;

D O I：

10.1504/IJCNDS.2024.138252

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Designing an intelligent resource allocation scheme to achieve the performance requirements of internet of things (IoT) devices for the future ultra-reliable low-latency communication (URLLC) network is a challenging task. In this paper, we formulate a joint blocklength allocation and power control optimisation problem to maximise the sum-rate performance with the short data packet in an uplink URLLC communication system. To alleviate this non-convex optimisation problem under the subcarrier power, blocklength and rate constraints, we firstly transfer it into a multi-agent reinforcement learning (RL) problem, in which each subcarrier works as the agent to decide its own power intelligently. Then a distributed blocklength allocation and power control scheme is proposed based on deep Q-network (DQN). To improve the rate performance in the dynamic communication environment, we design the segmented reward function depending on the communication rate and blocklength under different conditions, and adopt the experience replay strategy to avoid the dependency of training data. Finally, the simulation results show that the proposed scheme achieve the effectiveness and convergence under different settings compared to benchmark schemes.

引用

页码：374 / 392

页数：20

共 49 条

[1]

6G FLAGSHIP, 2019, 6G White Paper: Key Drivers and Research Challenges for 6G UbiquitousWireless Intelligenceonline

[2]

Al Ayidh A, 2020, UEEE INT SYM PERS IN

[3]

[Anonymous], 2016, 3GPP TR38.912

[4]

[Anonymous], 2018, 3GPP TS38.211

[5]

[Anonymous], 2017, 3GPP TR38.913

[6] Spectrum allocation and power control for D2D communication underlay 5G cellular networks [J].

Benbraika, Mohamed Kamel ;

Bitam, Salim .

INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2021, 27 (03) :299-322

[7] A Multi-Objective Optimization Framework for URLLC With Decoding Complexity Constraints [J].

Celebi, Hasan Basri ;

Pitarokoilis, Antonios ;

Skoglund, Mikael .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (04) :2786-2798

[8] Optimizing Resource Allocation in URLLC for Real-Time Wireless Control Systems [J].

Chang, Bo ;

Zhang, Lei ;

Li, Liying ;

Zhao, Guodong ;

Chen, Zhi .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (09) :8916-8927

[9] Relay-Assisted Uplink Transmission Design of URLLC Packets [J].

Cheng, Jing ;

Shen, Chao .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (19) :18839-18853

[10] Distributed DRL-Based Downlink Power Allocation for Hybrid RF/VLC Networks [J].

Ciftler, Bekir Sait ;

Alwarafy, Abdulmalik ;

Abdallah, Mohamed .

IEEE PHOTONICS JOURNAL, 2022, 14 (03)

← 1 2 3 4 5 →