Multi-Agent Reinforcement Learning Based Uplink OFDMA for IEEE 802.11ax Networks

被引：2

作者：

Han, Mingqi ^{[1
]}

Sun, Xinghua ^{[1
]}

Zhan, Wen ^{[1
]}

Gao, Yayu ^{[2
]}

Jiang, Yuan ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China

[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2024年 / 23卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Heuristic algorithms; Throughput; Uplink; Computational complexity; Sun; IEEE 802.11ax Standard; Optimization; Multiple access; multi-agent reinforcement learning; multi-objective reinforcement learning; mean-field reinforcement learning; DYNAMIC MULTICHANNEL ACCESS; MINIMIZATION; INFORMATION;

D O I：

10.1109/TWC.2024.3355276

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the IEEE 802.11ax Wireless Local Area Networks (WLANs), Orthogonal Frequency Division Multiple Access (OFDMA) has been applied to enable the high-throughput WLAN amendment. However, with the growth of the number of devices, it is difficult for the Access Point (AP) to schedule uplink transmissions, which calls for an efficient access mechanism in the OFDMA uplink system. Based on Multi-Agent Proximal Policy Optimization (MAPPO), we propose a Mean-Field Multi-Agent Proximal Policy Optimization (MFMAPPO) algorithm to improve the throughput and guarantee the fairness. Motivated by the Mean-Field games (MFGs) theory, a novel global state and action design are proposed to ensure the convergence of MFMAPPO in the massive access scenario. The Multi-Critic Single-Policy (MCSP) architecture is deployed in the proposed MFMAPPO so that each agent can learn the optimal channel access strategy to improve the throughput while satisfying fairness requirement. Extensive simulation experiments are performed to show that the MFMAPPO algorithm 1) has low computational complexity that increases linearly with respect to the number of stations 2) achieves nearly optimal throughput and fairness performance in the massive access scenario, 3) can adapt to various diverse and dynamic traffic conditions without retraining, as well as the traffic condition different from training traffic.

引用

页码：8868 / 8882

页数：15

共 50 条

[31] Deep learning based adaptive modulation and coding for uplink multi-user SIMO transmissions in IEEE 802.11ax WLANs
Elwekeil, Mohamed
Wang, Taotao
Zhang, Shengli
WIRELESS NETWORKS, 2021, 27 (08) : 5217 - 5227
[32] Deep learning based adaptive modulation and coding for uplink multi-user SIMO transmissions in IEEE 802.11ax WLANs
Mohamed Elwekeil
Taotao Wang
Shengli Zhang
Wireless Networks, 2021, 27 : 5217 - 5227
[33] Distributed Multi-Agent Deep Q-Learning for Fast Roaming in IEEE 802.11ax Wi-Fi Systems
Wang, Ting-Hui
Shen, Li-Hsiang
Feng, Kai-Ten
2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 433 - 438
[34] Performance Evaluation of IEEE 802.11ax for Residential Networks
Sandoval, Jorge
Cespedes, Sandra
2021 IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM 2021), 2021,
[35] Adaptive multi-user uplink resource allocation based on access delay analysis in IEEE 802.11ax
Min Peng
Qiqi Yin
Kai Zhang
Caihong Kai
Wireless Networks, 2023, 29 : 1223 - 1235
[36] Symbol Timing Synchronization for Uplink Multi-User Transmission in IEEE 802.11ax WLAN
Son, Youngwook
Kim, Seongwon
Byeon, Seongho
Choi, Sunghyun
IEEE ACCESS, 2018, 6 : 72962 - 72977
[37] Improving IEEE 802.11ax UORA Performance: Comparison of Reinforcement Learning and Heuristic Approaches
Kosek-Szott, Katarzyna
Szott, Szymon
Dressler, Falko
IEEE ACCESS, 2022, 10 : 120285 - 120295
[38] Adaptive multi-user uplink resource allocation based on access delay analysis in IEEE 802.11ax
Peng, Min
Yin, Qiqi
Zhang, Kai
Kai, Caihong
WIRELESS NETWORKS, 2023, 29 (03) : 1223 - 1235
[39] Improving IEEE 802.11ax UORA Performance: Comparison of Reinforcement Learning and Heuristic Approaches
Kosek-Szott, Katarzyna
Szott, Szymon
Dressler, Falko
IEEE Access, 2022, 10 : 120285 - 120295
[40] Uplink Frame Transmission with Functions of Adaptive Triggering and Resource Allocation of OFDMA in Interfering IEEE 802.11ax Wireless LANs
Takahashi, Ryoichi
Tanigawa, Yosuke
Tode, Hideki
IEICE TRANSACTIONS ON COMMUNICATIONS, 2021, E104B (06) : 664 - 674

← 1 2 3 4 5 →